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The present invention relates to isolated Porphorymonas gingivalis polypeptides and nucleotides. The polypeptides include: an amino 
acid sequence selected from the group consisting of SEQ. ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 531 and SEQ. ID. NO. 532; 
or an amino acid sequence at least 85 %, preferably at least 95 %, identical to an amino acid sequence selected from the group consisting 
of SEQ. ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 531 and SEQ. ID. NO. 532; or at least 40 amino acids having a contiguous 
sequence of at least 40 amino acids identical to a contiguous amino acid sequence selected from the group consisting of SEQ. ID. NO. 265 
to SEQ. ID. NO. 528, SEQ. ID. NO. 531 and SEQ. ID. NO. 532, 
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Porphorymonas gingivalis polypeptides and nucleotides 

FIELD OF THE INVENTION 

5 The present invention relates to P. gingivalis nucleotide sequences, 

P, gingivalis polypeptides and probes for detection of P. gingivalis. The 
P. gingivalis polypeptides and nucleotides can be used in compositions for 
use in raising an immune response in a subject against P. gingivalis and 
treating or preventing or reducing the severity of the condition known as 
10 periodontitis. 

BACKGROUND OF THE INVENTION 

Periodontal diseases are bacterial-associated inflammatory diseases of 

15 the supporting tissues of the teeth and range from the relatively mild form of 
gingivitis, the non-specific, reversible inflammation of gingival tissue to the 
more aggressive forms of periodontitis which are characterised by the 
destruction of the tooth's supporting structures. Periodontitis is associated 
with a subgingival infection of a consortium of specific Gram-negative 

20 bacteria that leads to the destruction of the periodontium and is a major 
public health problem. One bacterium that has attracted considerable 
interest is P. gingivalis as the recovery of this microorganism from adult 
periodontitis lesions can be up to 50% of the subgingival anaerobically 
cultivable flora, whereas P gingivalis is rarely recovered, and then in low 

25 numbers, from healthy sites. A proportional increase in the level of 

P. gingivalis in subgingival plaque has been associated with an increased 
severity of periodontitis and eradication of the microorganism from the 
cultivable subgingival microbial population is accompanied by resolution of 
the disease. The progression of periodontitis lesions in non-human primates 

30 has been demonstrated with the subgingival implantation of P gingivalis. 
These findings in both animals and humans suggest a major role for 
P gingivalis in the development of adult periodontitis. 

P gingivalis is a black-pigmented, anaerobic, asaccharolytic, 
proteolytic Gram-negative rod that obtains energy from the metabolism of 

35 specific amino acids. The microorganism has an absolute growth 

requirement for iron, preferentially in the form of haeme or its Fe(III) 
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oxidation product haemin and when grown under conditions of excess 
haemin is highly virulent in experimental animals. A number of virulence 
factors have been implicated in the pathogenicity of P. gingivalis including 
the capsule, adhesins, cytotoxins and extracellular hydrolytic enzymes. 
5 In order to develop an efficacious and safe vaccine to prevent, 

eliminate or reduce P. gingivalis colonisation it is necessary to identify and 
produce antigens that are involved in virulence that have utility as 
immunogens possibly through the generation of specific antibodies. Whilst it 
is possible to attempt to isolate antigens directly from cultures of P. gingivalis 

10 this is often difficult. For example as mentioned above, P. gingivalis is a 

strict anaerobe and can be difficult to isolate and grow. It is also known that, 
for a number of organisms, when cultured in vitro that many virulence genes 
are down regulated and the encoded proteins are no longer expressed. If 
conventional chemistry techniques were applied to purify vaccine candidates 

15 potentially important (protective) molecules may not be identified. With 

DNA sequencing, as the gene is present (but not transcribed) even when the 
organism is grown in vitro it can be identified, cloned and produced as a 
recombinant DNA protein. Similarly, a protective antigen or therapeutic 
target may be transiently expressed by the organism in vitro or produced in 

20 low levels making the identification of these molecules extremely difficult by 
conventional methods. 

With serological identification of therapeutic targets one is limited to 
those responses which are detectable using standard methods such as 
Western Blotting or ELISA. The limitation here is the both the level of 

25 response that is generated by the animal or human and determining whether 
this response is protective, damaging or irrelevant. No such limitation is 
present with a sequencing approach to the identification of potential 
therapeutic or prophylactic targets. 

It is also well known that P. gingivalis produces a range of broadly 

30 active proteases (University of Melbourne International Patent Application 
No PCT /AU 96/00673, US Patent Nos 5,475,097 and 5,523,390), which make 
the identification of intact proteins difficult because of their degradation by 
these proteases. 
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SUMMARY OF THE INVENTION 

The present inventors have attempted to isolate R gingivalis 
nucleotide sequences which can be used for recombinant production of 
5 P. gingivalis polypeptides and to develop nucleotide probes specific for 
P. gingivalis. The DNA sequences listed below have been selected from a 
large number of P. gingivalis sequences according to their indicative potential 
as vaccine candidates. This intuitive step involved comparison of the 
deduced protein sequence from the P. gingivalis DNA sequences to the 
10 known protein sequence databases. Some of the characteristics used to 

select useful vaccine candidates include; the expected cellular location, such 
as outer membrane proteins or secreted proteins, particular functional 
activities of similar proteins such as those with an enzymatic or proteolytic 
activity, proteins involved in essential metabolic pathways that when 
15 inactivated or blocked may be deleterious or lethal to the organism, proteins 
that might be expected to play a role in the pathogenesis of the organism 
eg. red cell lysis, cell agglutination or cell receptors and proteins which are 
paralogues to proteins with proven vaccine efficacy. 

In a first aspect the present invention consists an isolated antigenic 
20 Porphorj^monas gingivalis polypeptide, the polypeptide comprising; 

an amino acid sequence selected from the group consisting of SEQ. 

ID. NO. 265 to SEQ. ID, NO. 528, SEQ. ID. NO. 531 and SEQ. ID. NO. 

532; or 

an amino acid sequence at least 85%, preferably at least 95%, 
25 identical to an amino acid sequence selected from the group 

consisting of SEQ. ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 
531 and SEQ. ID. NO. 532; or 

at least 40 amino acids having a contiguous sequence of at least 40 
amino acids identical to a contiguous amino acid sequence selected 
30 from the group consisting of SEQ. ID. NO. 265 to SEQ. ID. NO. 528, 

SEQ. ID. NO. 531 and SEQ. ID. NO. 532. 
In an embodiment of the present invention the polypeptide 
comprises; 

an amino acid sequence selected from the group consisting of SEQ. 
35 ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 532; or 
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an amino acid sequence at least 85%, preferably at least 95%, 
identical to an amino acid sequence selected from the group 
consisting of SEQ. ID, NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 
532; or 

5 at least 40 amino acids having a contiguous sequence of at least 40 

amino acids identical to a contiguous amino acid sequence selected 
from the group consisting of SEQ. ID. NO. 386 to SEQ. ID. NO. 528 
and SEQ. ID. NO. 532. 
As used herein % identity for polypeptides is to be calculated using the 
10 alignment algorithm of Needleman and Munsch (9) using a standard protein 
scoring matrix (Blosum 50). 

In a preferred embodiment of the present invention the polypeptide 
comprises an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 386, SEQ. ID. NO. 424, SEQ. ID. NO. 425, SEQ. ID. NO. 434, 
15 SEQ. ID. NO. 447, SEQ. ID. NO. 458, SEQ. ID. NO. 475, SEQ. ID. NO. 498, 
SEQ. ID. NO. 499, SEQ. ID. NO. 500, SEQ. ID. NO. 501, SEQ. ID. NO. 387, 
SEQ. ID. NO. 400, SEQ. ID. NO. 411, SEQ. ID. NO. 419, SEQ. ID. NO. 420, 
SEQ. ID. NO. 427, SEQ. ID. NO. 429, SEQ. ID. NO. 433, SEQ. ID. NO. 437, 
SEQ. ID. NO. 438, SEQ. ID. NO. 443, SEQ. ID. NO. 444, SEQ. ID. NO. 448, 
20 SEQ. ID. NO. 449, SEQ. ID. NO. 452, SEQ. ID. NO. 455, SEQ. ID. NO. 457, 
SEQ. ID. NO. 459, SEQ. ID. NO. 461, SEQ. ID. NO. 462, SEQ. ID. NO. 463, 
SEQ. ID. NO. 467, SEQ. ID. NO. 468, SEQ. ID. NO. 469, SEQ. ID. NO. 482, 
SEQ. ID. NO. 484, SEQ. ID. NO. 485, SEQ. ID. NO. 494, SEQ. ID. NO. 508, 
SEQ. ID. NO. 509, SEQ. ID. NO. 510, SEQ. ID. NO. 520, SEQ. ID. NO. 521, 
25 SEQ. ID. NO. 522, SEQ. ID. NO. 525, SEQ. ID. NO. 526, SEQ. ID. NO. 528, 
SEQ. ID. NO. 389, SEQ. ID. NO. 390 and SEQ. ID. NO. 391. 

In another preferred embodiment of the present invention the 
polypeptide comprises an amino acid sequence selected from the group 
consisting of residue 422 to residue 531 of SEQ. ID. NO. 303, residue 534 to 
30 residue 582 of SEQ. ID. NO. 303, residue 127 to residue 232 of SEQ. ID. NO. 
301, residue 240 to residue 259 of SEQ. ID. NO. 301, residue 139 to residue 
156 of SEQ. ID. NO. 295, residue 160 to residue 178 of SEQ. ID. NO. 295, 
residue 180 to residue 207 of SEQ. ID. NO. 295, residue 221 to residue 257 of 
SEQ. ID. NO. 295, residue 259 to residtie 323 of SEQ. ID. NO. 295, residue 
35 885 to residue 985 of SEQ. ID. NO. 299, residue 147 to residue 259 of SEQ. 
ID. NO. 363, residue 140 to residue 252 of SEQ. ID. NO. 344, residue 247 to 
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residue 356 of SEQ. ID. NO. 353, residue 359 to residue 391 of SEQ. ID. NO. 
353, residue 120 to residue 254 of SEQ. ID. NO. 300, residue 287 to residue 
311 of SEQ. ID. NO. 286, residue 313 to residue 352 of SEQ. ID. NO. 286, 
residue 354 to residue 401 of SEQ. ID. NO. 286, residue 208 to residue 252 of 
5 SEQ. ID. NO. 287, residue 259 to residue 373 of SEQ. ID. NO. 287, residue 5 
to residue 120 of SEQ. ID. NO. 293, residue 123 to residue 139 of SEQ. ID. 
NO. 293, residue 233 to residue 339 of SEQ. ID. NO. 265, residue 67 to 
residue 228 of SEQ. ID. NO. 278, residue 130 to residue 172 of SEQ. ID. NO. 
274, residue 174 to residue 238 of SEQ. ID. NO. 274, residue 99 to residue 

10 112 of SEQ. ID. NO. 274, residue 114 to residue 128 of SEQ. ID. NO. 274, 
residue 26 to residue 69 of SEQ. ID. NO. 285, residue 71 to residue 128 of 
SEQ. ID. NO. 285, residue 130 to residue 146 of SEQ. ID. NO. 285, residue 
620 to residue 636 of SEQ. ID. NO. 327, residue 638 to residue 775 of SEQ. 
ID. NO. 327, residue 397 to residue 505 of SEQ. ID. NO. 301, residue 528 to 

15 residue 545 of SEQ. ID. NO. 301, residue 556 to residue 612 of SEQ. ID. NO. 

301, residue 614 to residue 631 of SEQ. ID. NO. 301, residue 633 to residue 
650 of SEQ. ID. NO. 301, residue 553 to residue 687 of SEQ. ID. NO. 299, 
residue 305 to residue 447 of SEQ. ID. NO. 289, residue 1 to residue 52 of 
SEQ. ID. NO. 364, residue 65 to residue 74 of SEQ. ID. NO. 364, residue 486 

20 to residue 604 of SEQ. ID. NO. 275, residue 158 to residue 267 of SEQ. ID. 
NO. 272, residue 270 to residue 282 of SEQ. ID. NO. 272, residue 163 to 
residue 237 of SEQ. ID. NO. 273, residue 240 to residue 251 of SEQ. ID. NO. 
273, residue 213 to residue 344 of SEQ. ID. NO. 282, residue 183 to residue 
324 of SEQ. ID. NO. 292, residue 327 to residue 341 of SEQ. ID. NO. 292, 

25 residue 352 to residue 372 of SEQ. ID. NO. 292, residue 141 to residue 166 of 
SEQ. ID. NO. 271, residue 168 to residue 232 of SEQ. ID. NO. 271, residue 1 
to residue 13 of SEQ. ID. NO. 302, residue 15 to residue 28 of SEQ. ID. NO. 

302, residue 30 to residue 72 of SEQ. ID. NO. 302, residue 476 to residue 529 
of SEQ. ID. NO. 277, residue 41 to residue 146 of SEQ. ID. NO. 299, residue 

30 149 to residue 162 of SEQ. ID. NO. 299, residue 166 to residue 177 of SEQ. 
ID. NO. 299, residue 192 to residue 203 of SEQ. ID. NO. 299, residue 71 to 
residue 343 of SEQ. ID. NO. 290, residue 346 to residue 363 of SEQ. ID. NO. 
290, residue 36 to residue 240 of SEQ. ID. NO. 331, residue 242 to residue 
270 of SEQ. ID. NO. 331, residue 1 to i-fesidue 192 of SEQ. ID. NO. 375, 

35 residue 266 to residue 290 of SEQ. ID. NO. 375, residue 23 to residue 216 of 
SEQ. ID. NO. 279, residue 220 to residue 270 of SEQ. ID. NO. 279, residue 
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285 to residue 386 of SEQ. ID. NO. 279, residue 84 to residue 234 of SEQ. ID. 
NO. 297, residue 248 to residue 259 of SEQ. ID. NO. 297, residue 261 to 
residue 269 of SEQ. ID. NO. 297, residue 275 to residue 402 of SEQ. ID. NO. 
294, residue 1 to residue 171 of SEQ. ID. NO. 298, residue 403 to residue 417 
5 of SEQ. ID. NO. 307, residue 420 to residue 453 of SEQ. ID. NO. 307, residue 
456 to residue 464 of SEQ. ID. NO. 307, residue 468 to residue 690 of SEQ. 
ID. NO. 307, residue 1 to residue 285 of SEQ. ED. NO. 304, residue 287 to 
residue 315 of SEQ. ID. NO. 304, residue 318 to residue 336 of SEQ. ID. NO. 
304, residue 255 to residue 269 of SEQ. ID. NO. 342, residue 271 to residue 

10 337 of SEQ. ID. NO. 342, residue 347 to residue 467 of SEQ. ID. NO. 281, 

residue 116 to residue 136 of SEQ. ID. NO. 375, residue 138 to residue 357 of 
SEQ. ID. NO. 375, residue 133 to residue 423 of SEQ. ID. NO. 364, residue 
141 to residue 299 of SEQ. ID. NO. 305, residue 202 to residue 365 of SEQ. 
ID. NO. 296, residue 134 to residue 426 of SEQ. ID. NO. 288, residue 1 to 

15 residue 218 of SEQ. ID. NO. 276, residue 1 to residue 246 of SEQ. ID. NO. 
280, residue 444 to residue 608 of SEQ. ID. NO. 364, residue 10 to residue 
686 of SEQ. ID. NO. 283, residue 1 to residue 148 of SEQ. ID. NO. 296, 
residue 1 to residue 191 of SEQ. ID. NO. 287, residue 193 to residue 204 of 
SEQ. ID. NO. 287, residue 209 to residue 373 of SEQ. ID. NO. 287, residue 

20 211 to residue 470 of SEQ. ID. NO. 284, residue 472 to residue 482 of SEQ. 
ID. NO. 284, residue 133 to residue 144 of SEQ. ID. NO. 281, residue 146 to 
residue 336 of SEQ. ID. NO. 281, residue 1 to residue 264 of SEQ. ID. NO. 
303, residue 265 to residue 295 of SEQ. ID. NO. 303, residue 297 to residue 
326 of SEQ. ID. NO. 303, residue 328 to residue 338 of SEQ. ID. NO. 303, 

25 residue 247 to residue 356 of SEQ. ID. NO. 353, residue 358 to residue 391 of 
SEQ. ID. NO. 353, residue 257 to residue 288 of SEQ. ID. NO. 298, residue 
290 to residue 385 of SEQ. ID. NO. 298. residue 245 to residue 256 of SEQ. 
ID. NO. 298, residue 422 to residue 802 of SEQ. ID. NO. 303, residue 803 to 
residue 814 of SEQ. ID. NO. 303, residue 139 to residue 156 of SEQ. ID. NO. 

30 295, residue 160 to residue 340 of SEQ. ID. NO. 295, residue 145 to residue 
361 of SEQ. ID. NO. 282, residue 363 to residue 387 of SEQ. ID. NO. 282, 
residue 398 to residue 471 of SEQ. ID. NO. 282, residue 573 to residue 679 of 
SEQ. ID. NO. 320, residue 27 to residue 168 of SEQ. ID. NO. 291, residue 170 
to residue 183 of SEQ. ID. NO. 291, residue 185 to residue 415 of SEQ. ID. 

35 NO. 291, residue 1 to residue 301 of SEQ. ID. NO. 364, residue 114 to residue 
702 of SEQ. ID. NO. 337, residue 377 to residue 412 of SEQ. ID. NO. 321, 
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residue 413 to residue 772 of SEQ, ID. NO, 321, residue 14 to residue 454 of 

SEQ. ID. NO. 265, residue 129 to residue 614 of SEQ. ID. NO. 268, residue 1 

to residue 930 of SEQ. ID. NO. 300, residue 932 to residue 1046 of SEQ. ID. 

NO. 300, residue 1 to residue 301 of SEQ. ID. NO. 364, residue 1 to residue 
5 42 of SEQ. ID. NO. 381, residue 44 to residue 973 of SEQ. ID. NO. 381, 

residue 1 to residue 93 of SEQ. ID. NO. 358, residue 95 to residue 179 of SEQ. 

ID. NO. 358, residue 181 to residue 227 of SEQ. ID. NO. 358, residue 114 to 

residue 702 of SEQ. ID. NO. 337, residue 1 to residue 659 of SEQ. ID. NO. 

355, residue 661 to residue 907 of SEQ. ID. NO. 355, residue 1 to residue 131 
10 of SEQ. ID. NO. 370, residue 133 to residue 601 of SEQ. ID. NO. 370, residue 

1 to residue 813 of SEQ. ID. NO. 344, residue 377 to residue 412 of SEQ. ID. 

NO. 321, residue 413 to residue 772 of SEQ. ID. NO, 321, and residue 189 to 

residue 614 of SEQ. ID. NO. 364. 

In a second aspect the present invention consists in a n isolated 
15 antigenic Porphoiymonas gingivalis polypeptide, the polypeptide comprising 

an amino acid sequence selected from the group consisting of SEQ. ID. NO. 

386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 532 less the leader sequence set 

out in Table 3. 

In a third aspect the present invention consists in an isolated DNA 
20 molecule, the DNA molecule comprising a nucleotide sequence which 
encodes the polypeptide of the first aspect the pi^esent invention or a 
sequence which hybridises thereto under stringent conditions. 

It is preferred that the isolated DNA molecule comprises a nucleotide 
sequence selected from the group consisting of SEQ. ID. NO. 1 to SEQ. ID. 
25 NO. 264, SEQ. ID. NO. 529 and SEQ. ID. NO, 530, 

In a fourth aspect the present invention consists in a recombinant 
expression vector comprising the DNA molecule of the second aspect of the 
present invention operably linked to a transcription regulatory element. 
The present invention also provides a cell comprising this 
30 recombinant expression vector. 

In a further aspect the present invention consists in a method for 
producing a R gingivalis polypeptide comprising culturing the cell under 
conditions that permit expression of the polypeptide. 

In yet a further aspect the present invention provides a composition 
35 for use in raising an inamune response directed against P. gingivalis in a 
subject, the composition comprising an effective amount of at least one 



BNSOOCID- <WO 9929870A1_L> 



wo 99/29870 PCT/AU98/01023 



polypeptide of the first aspect of the present invention, or at least one DNA 
molecule of the second aspect of the present invention, or both,and a 
pharmaceutically acceptable carrier. It is preferred that the pharmaceutically 
acceptable carrier is an adjuvant. In other aspects the present invention 
5 provides methods of treating P. gingivalis infection in subject comprising the 
administration of the composition to the subject such that treatment of P. 
gingivalis infection occurs. The treatment may be prophylactic or 
therapeutic. 

In yet another aspect the present invention provides an antibody 
10 raised against a polypeptide of the first aspect the invention. The antibody 
may be polyclonal or monoclonal. The present invention also provides 
compositions including these antibodies. It is preferred that these 
compositions are adapted for oral use and may be, for example, dentrifices, 
mouthwashes, etc. 

15 In a still further aspect the present invention provides a nucleotide 

probe comprising at least 18 nucleotides and having a contiguous sequence 
of at least 18 nucleotides identical to a contiguous nucleotide sequence 
selected from the group consisting of SEQ. ID. NO. 1 to SEQ. ID. NO. 121, 
SEQ. ID. NO. 529, and sequences complementary thereto. It is preferred that 

20 the probe further comprises a detectable label. 

The present invention also provides a method for detecting the 
presence of P. gingivalis nucleic acid in a sample comprising: 

(a) contacting a sample with the nucleotide probe under conditions in 
which a hybrid can form between the probe and a P. gingivalis 

25 nucleic acid in the sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a 
hybrid indicates the presence of a P. gingivalis nucleic acid in the 
sample. 



30 DETAILED DESCRIPTION 



Definitions 

A purified or isolated polypeptide or a substantially pure preparation 
of a polypeptide are used interchangeably herein and, as used herein, mean a 
35 polypeptide that has been separated from other proteins, lipids, and nucleic 
acids with which it naturally occurs. Preferably, the polypeptide is also 
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separated from substances, e.g., antibodies or gel matrix, e.g., 
polyacrylamide, which are used to purify it. Preferably, the polypeptide 
constitutes at least 10, 20, 50 70, 80 or 95% dry weight of the purified 
preparation. Preferably, the preparation contains: sufficient polypeptide to 
5 allow protein sequencing; at least 1, 10, or 100 mg of the polypeptide. 

A purified preparation of cells refers to, in the case of plant or animal 
cells, an in vitro preparation of cells and not an entire intact plant or animal. 
In the case of cultured cells or microbial cells, it consists of a preparation of 
at least 10% and more preferably 50% of the subject cells, 

10 A purified or isolated or a substantially pure nucleic acid, e.g., a 

substantially pure DNA, (are terms used interchangeably herein) is a nucleic 
acid which is one or both of the following: not immediately contiguous with 
both of the coding sequences with which it is immediately contiguous (i.e., 
one at the 5' end and one at the 3* end) in the naturally occurring genome of 

15 the organism from which the nucleic acid is derived; or which is 

substantially free of a nucleic acid with which it occurs in the organism from 
which the nucleic acid is derived. The term includes, for example, a 
recombinant DNA which is incorporated into a vector, e.g., into an 
autonomously replicating plasmid or virus, or into the genomic DNA of a 

20 prokaiyote or eukaiyote, or which exists as a separate molecule (e.g., a cDNA 
or a genomic DNA fragment produced by PGR or restriction endonuclease 
treatment) independent of other DNA sequences. Substantially pure DNA 
also includes a recombinant DNA which is part of a hybrid gene encoding 
additional P. gingivalis DNA sequence. 

25 A "contig" as used herein is a nucleic acid representing a continuous 

stretch of genomic sequence of an organism. 

An "open reading frame", also referred to herein as ORF, is a region of 
nucleic acid which encodes a polypeptide. This region may represent a 
portion of a coding sequence or a total sequence and can be determined from 

30 a stop to stop codon or from a start to stop codon. 

As used herein, a "coding sequence" is a nucleic acid which is 
transcribed into messenger RNA and/or translated into a polypeptide when 
placed under the control of appropriate regulatory sequences. The boundaries 
of the coding sequence are determined* by a translation start codon at the five 

35 prime terminus and a translation stop code at the three prime terminus, A 
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coding sequence can include but is not limited to messenger RNA synthetic 
DNA, and recombinant nucleic acid sequences. 

A "complement" of a nucleic acid as used herein refers to an anti- 
parallel or antisense sequence that participates in Watson-Crick base-pairing 
5 with the original sequence. 

A "gene product" is a protein or structural RNA which is specifically 
encoded by a gene. 

As used herein, the term "probe" refers to a nucleic acid, peptide or 
other chemical entity which specifically binds to a molecule of interest. 

10 Probes are often associated with or capable of associating with a label. A 
label is a chemical moiety capable of detection. Typical labels comprise 
dyes, radioisotopes, luminescent and chemiluminescent moieties, 
fluorophores, enzymes, precipitating agents, amplification sequences, and 
the like. Similarly, a nucleic acid, peptide or other chemical entity which 

15 specifically binds to a molecule of interest and immobilizes such molecule is 
referred herein as a "capture ligand". Capture ligands are typically associated 
with or capable of associating with a support such as nitro-cellulose, glass, 
nylon membranes, beads, particles and the like. The specificity of 
hybridization is dependent on conditions such as the base pair composition 

20 of the nucleotides, and the temperature and salt concentration of the 

reaction. These conditions are readily discernible to one of ordinary skill in 
the art using routine experimentation. 

Homologous refers to the sequence similarity or sequence identity 
between two polypeptides or between two nucleic acid molecules. When a 

25 position in both of the two compared sequences is occupied by the same base 
or amino acid monomer sub unit, e.g., if a position in each of two DNA 
molecules is occupied by adenine, then the molecules are homologous at that 
position. The percent of homology between two sequences is a function of 
the number of matching or homologous positions shared by the two 

30 sequences divided by the number of positions compared x 100. 

The terms peptides, proteins, and polypeptides are used 
interchangeably herein. 

An "immunogenic component" as used herein is a moiety, such as an 
P. gingivalis polypeptide, analog or fragment thereof, that is capable of 

35 eliciting a humoral and/or cellular immune response in a host animal. 
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An "antigenic component" as used herein is a moiety, such as 
P. gingivalis polypeptide, analog or fragment thereof, that is capable of 
binding to a specific antibody with sufficiently high affinity to form a 
detectable antigen-antibody complex. 

As used herein, the term "cell-specific promoter" means a DNA 
sequence that serves as a promoter, i.e., regulates expression of a selected 
DNA sequence operably linked to the promoter, and which effects expression 
of the selected DNA sequence in specific cells of a tissue. The term also 
covers so-called "leaky" promoters, which regulate expression of a selected 
DNA primarily in one tissue, but cause expression in other tissues as well. 

As used herein, the term "control sequence" refers to a nucleic acid 
having a base sequence which is recognized by the host organism to effect 
the expression of encoded sequences to which they are ligated. The nature of 
such control sequences differs depending upon the host organism; in 
prokaryotes, such control sequences generally include a promoter, ribosomal 
binding site, terminators, and in some cases operators; in eukaiyotes, 
generally such control sequences include promoters, terminators and in some 
instances, enhancers. The term control sequence is intended to include at a 
minimum, all components whose presence is necessary for expression, and 
may also include additional components whose presence is advantageous, for 
example, leader sequences. 

As used herein, the term "operably linked" refers to sequences joined 
or ligated to function in their intended manner. For example, a control 
sequence is operably linked to coding sequence by ligation in such a way 
that expression of the coding sequence is achieved under conditions 
compatible with the control sequence and host cell 

A "sample" as used herein refers to a biological sample, such as, for 
example, tissue or fluid isolated from an individual (including without 
limitation plasma, serum, cerebrospinal fluid, lymph, tears, saliva and tissue 
sections) or from in vitro cell culture constituents, as well as samples from 
the environment. 

The practice of the invention will employ, unless otherwise 
indicated, conventional techniques of chemistry, molecular biology, 
microbiology, recombinant DNA, and immunology well known to those 
skilled in the art. Such techniques are described and explained throughout 
the literature in sources such as, J. Perbal, A Practical Guide to Molecular 
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Cloning, John Wiley and Sons (1984), J. Sambrook et al., Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbour Laboratory Press (1989), T.A. 
Brown (editor), Essential Molecular Biology: A Practical Approach. Volumes 
1 and 2, IRL Press (1991), D.M. Glover and B,D, Hames (editors), DNA 
5 Cloning: A Practical Approach, Volumes 1-4, IRL Press (1995 and 1996), and 
F.M. Ausubel et al. (Editors), Current Protocols in Molecular Biology, Greene 
Pub. Associates and Wiley-Interscience (1988, including all updates until 
present). The disclosure of these texts are incorporated herein by reference. 

10 Pharmaceutically Acceptable Carriers 

The antibodies, polypeptides and DNA of the present invention can 
be included in compositions which include a carrier or diluent. These 
compositions include pharmaceutical compositions where the carrier or 

15 diluent will be pharmaceutically acceptable. Pharmaceutically acceptable 
carriers or diluents include those used in compositions suitable for oral, 
rectal, nasal, topical (including buccal and sublingual), vaginal, parenteral 
(including subcutaneous, intramuscular, intravenous, intradermal, 
intrathecal and epidural) administration. They are non-toxic to recipients at 

20 the dosages and concentrations employed. Representative examples of 

pharmaceutically acceptable carriers or diluents include, but are not limited 
to; water, isotonic solutions which are preferably buffered at a physiological 
pH (such as phosphate-buffered saline or Tris-buffered saline) and can also 
contain one or more of, mannitol, lactose, trehalose, dextrose, glycerol, 

25 ethanol or polypeptides (such as human serum albumin). The compositions 
may conveniently be presented in unit dosage form and may be prepared by 
any of the methods well known in the art of pharmacy. 

As will be well understood by those skilled in the art alterations may 
30 be made to the amino acid sequences set out in the Sequence Listings. These 
alterations may be deletions, insertions, or substitutions of amino acid 
residues. The altered polypeptides can be either naturally occurring (that is 
to say, purified or isolated from a natural source) or synthetic (for example, 
by performing site-directed metagenesis on the encoding DNA). It is 
35 intended that such altered polypeptides which have at least 85%, preferably 
at least 95% identity with the sequences set out in the Sequence Listing are 
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within the scope of the present invention. Antibodies raised against these 
altered polypeptides will also bind to the polypeptides having one of the 
sequences set out in the Sequence Listings. The level of % identity is to be 
calculated as set out above. 
5 Protein sequences are homologous if they are related by divergence 

from a common ancestor. Consequently, a species homologue of the protein 
will be the equivalent protein which occurs naturally in another species. 
Within any one species a homologue may exist as numerous allelic variants, 
and these will be considered homologues of the protein. Allelic variants and 
10 species homologues can be obtained by following standard techniques known 
to those skilled in the art. 

An allelic variant will be a variant that is naturally occurring within 
an individual organism. 

15 Mutants, Variants and Homology - Nucleic Acids 

Mutant polynucleotides will possess one or more mutations which 
are deletions, insertions, or substitutions of nucleotide residues. Mutants 
can be either naturally occurring (that is to say, isolated from a natural 
source) or synthetic (for example, by performing site-directed metagenesis on 

20 the DNA). It is thus apparent that polynucleotides of the invention can be 
either naturally occurring or recombinant (that is to say prepared using 
recombinant DNA techniques). 

An allelic variant will be a variant that is naturally occurring within 
an individual organism. 

25 Nucleotide sequences are homologous if they are related by 

divergence from a common ancestor. Consequently, a species homologue of 
the polynucleotide will be the equivalent polynucleotide which occurs 
naturally in another species. Within any one species a homologue may exist 
as numerous allelic variants, and these will be considered homologues of the 

30 polynucleotide. Allelic variants and species homologues can be obtained by 
following standard techniques known to those skilled in the art. 

Antibody Production 

Antibodies, either polyclonal or monoclonal, which are specific for a 
35 polypeptide of the present invention can be produced by a person skilled in 
the art using standard techniques such as, but not limited to, those described 
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by Harlow et al. Antibodies: A Laboratory Manual, Cold Springs Harbor 
Laboratoiy Press (1988), and D. Catty (editor), Antibodies: A Practical 
Approach, IRL Press (1988). 

Various procedures known in the art may be used for the production 
5 of polyclonal antibodies to epitopes of a protein. For the production of 
polyclonal antibodies, a number of host animals are acceptable for the 
generation of antibodies by immunization with one or more injections of a 
polypeptide preparation, including but not limited to rabbits, mice, rats, etc. 
Various adjuvants may be used to increase the immunological response in 

10 the host animal, depending on the host species, including but not limited to 
Freund's (complete and incomplete), mineral gels such as aluminium 
hydroxide, surface active substances such as lysolecithin, plui^onic polyols, 
polyanions, oil emulsions, keyhole lympet hemocyanins, dinitrophenol, and 
potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) 

15 and Corynebacterium parvum. 

A monoclonal antibody to an epitope of a protein may be prepared by 
using any technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These include but are not 
limited to the hybridoma technique originally described by Kohler and 

20 Milstein (1975, Nature 256, 493-497), and the more recent human B-cell 

hybridoma technique (Kesber et al, 1983, Immunology Today 4:72) and EBV- 
hybridoma technique (Cole et al. 1985, Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc. pp. 77-96). In addition, techniques developed for 
the production of "chimeric antibodies" by splicing the genes from antibody 

25 molecule of appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity may be used 
(Morrison et al. 1984, Proc. Natl. Acad. Sci,, 81:6851-6855; Neuberger et al. 
1984 Nature 312:604-608; Takeda et al. 1985 Nature 31:452-454), 
Alternatively, techniques described for the production of single chain 

30 antibodies (U.S. Patent 4,946,778) can be adapted to produce 4-specific single 
chain antibodies. 

Recombinant human or humanized versions of monoclonal 
antibodies are a preferred embodiment for human therapeutic applications. 
Humanized antibodies may be prepared according to procedures in the 

35 literature (e,g. Jones et al. 1986, Nature 321:522-25; Reichman et al. 1988 
Nature 332:323-27; Verhoeyen et al. 1988, Science 239:1534-36). The 
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recently described "gene conversion metagenesis" strategy for the production 
of humanized monoclonal antibody may also be employed in the production 
of humanized antibodies (Carter et al. 1992 Proc. Natl. Acad. Sci. U.S.A. 
89:4285-89), Alternatively, techniques for generating the recombinant phase 
5 library of random combinations of heavy and light regions may be used to 
prepare recombinant antibodies (e.g. Huse et al. 1989 Science 246:1275-81). 

Antibody fragments which contain the idiotype of the molecule such 
as Fu F(abl) and F(ab2) may be generated by known techniques. For 
example, such fragments include but are not limited to: the F(ab) E2 

10 fragment which can be produced by pepsin digestion of the intact antibody 
molecule; the Fab' fragments which can be generated by reducing the 
disulfide bridges of the F(ab*)2 fragment, and the two Fab fragments which 
can be generated by treating the antibody molecule with papain and a 
reducing agent. Alternatively, Fab expression libraries may be constructed 

15 (Huse et al. 1989, Science 246:1275-1281) to allow rapid and easy 

identification of monoclonal Fab fragment with the desired specificity to a 
protein. 

Adjuvants 

20 "Adjuvant" means a composition comprised of one or more 

substances that enhances the immunogenicity and efficacy of a vaccine 
composition. Non-limiting examples of suitable adjuvants include squalane 
and squalene (or other oils of animal origin); block copolymers; detergents 
such as Tween®-80; Quil® A, mineral oils such as Drakeol or Marcol, 

25 vegetable oils such as peanut oil; Corynebacterium-derived adjuvants such as 
Corynebacterium parvum; Propionibacterium-derived adjuvants such as 
Propionibacterium acne; Mycobacterium bovis (Bacillus Calmetic and 
Guerinn or BCG); interleukins such as interleukin 2 and interleukin-12; 
monokines such as interleukin 1; tumour necrosis factor; interferons such as 

30 gamma interferon; combinations such as saponin-aluminium hydroxide or 
Quil-A aluminium hydroxide; liposomes; ISCOM adjuvant; mycobacterial 
cell wall extract; synthetic glycopeptides such as muramyl dipeptides or 
other derivatives; Avridine; Lipid A; dextran sulfate; DEAE-Dextran or DHAE- 
Dextran with aluminium phosphate; carboxypolymethylene such as 

35 Carbopor EMA; acrylic copolymer emulsions such as Neocryl A640 (e.g. U.S. 
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Pat. No, 5,047,238); vaccinia or animal posvirus proteins; sub-viral particle 
adjuvants such as cholera toxin, or mixtures thereof. 

As used herein, stringent conditions are those that (1) employ low 
5 ionic strength and high temperature for washing, for example, 0.015 M 
NaCl/0.0015 M sodium citrate/0.1% NaDodS04 at 50°C; (2) employ during 
hybridisation a denaturing agent such as formamide, for example, 50% 
(vol/vol) formamide with 0.1% bovine serum albumin, 0,1% Ficoll, 0.1% 
pol5rvinylpyrrolidone, 50 mM sodium phosphate buffer at pH 6.5 with 750 
10 mM NaCl, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 
5 X SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate 
(pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated 
salmon sperm DNA (50 |ig/ml), 0.1% SDS and 10% dextran sulfate at 42*^0 in 
0.2 X SSC and 0.1% SDS 

15 

As will be understood the present invention includes within its scope 
DNA vaccination. Further information regarding DNA vaccination may be 
found in Donnelly et al, Journal of Immunological Methods 176(1994) 145- 
152, the disclosure of which is incorporated herein by reference. 
20 Throughout this specification the word "comprise", or variations such 

as "comprises" or "comprising", will be vmderstood to imply the inclusion of a 
stated element or integer or group of elements or integers but not the 
exclusion of any other element or integer, or group of elements or integers. 

25 Preparation of the P. gingivalis library for sequencing. 

To determine the DNA sequence of P. gingivalis genomic DNA was 
isolated from P. gingivalis strain W50 (ATCC 53978) essentially by the 
method described by Mamur J. ( J. Mol. Biol. 3, 208-218, 1961). Cloning of 

30 DNA fragments was performed essentially as described by Fleischmann et al., 
(Science; 269, 496-512, 1995)(2). Briefly, purified genomic DNA from 
P. gingivalis was nebulized to fragment the DNA and was treated with Bal31 
nuclease to create blunt ends then run twice through preparative 1% agarose 
gels. DNA fragments of 1.6-2.0 kb were excised from the gel and the DNA 

35 recovered. This DNA was then ligated to the vector pUCl8 (Smal digested 
and dephosphorylated; Pharmacia) and electrophoresed through a 1% 
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preparative agarose gel. The fragment comprising linear vector plus one 
insert was excised, purified and this process repeated to reduce any vector 
without insert contamination. The recovered vector plus insert DNA was 
blunt-ended with T4 DNA polymerase, then a final ligation to produce 
5 circular DNA was performed. Aliquots of Epicurian Coli Electroporation- 
Competent Cells (Stratagene) were ti^ansformed with the ligated DNA and 
plated out on SOB agar antibiotic diffusion plates containing X-gal and 
incubated at 37°C overnight. Colonies with inserts appeared white and those 
without inserts (vector alone) appeared blue. Plates were stored at 4''C until 
10 the white clones were picked and expanded for the extraction of plasmid 
DNA for sequencing. 

DNA sequencing 

Plasmid DNA was prepared by picking bacterial colonies into 1.5ml 

15 of LB, TB or SOB broth supplemented with 50-lOOug/ml Ampicillin in 96 
deep well plates. Plasmid DNA was isolated using the QIAprep Spin or 
QIAprep 96 Turbo miniprep kits (QIAGEN GmbH, Germany). DNA was 
eluted into a 96 well gridded array and stored at -20C. 

Sequencing reactions were performed using ABI PRISM Dye 

20 Terminator and ABI PRISM BIGDye Terminator Cycle Sequencing Ready 
Reaction kits with AmpliTaq DNA polymerase FS (PE Applied Biosystems, 
Foster City, CA) using the Ml 3 Universal forward and reverse sequencing 
primers. Sequence reactions were conducted on either a Perkin-Elmer 
GeneAmp 9700 (PE Applied Biosystems) or Hybaid PGR Express (Hybaid, 

25 UK) thermal cyclers. Sequencing reactions were analysed on ABI PRISM 377 
DNA sequencers (PE Applied Biosystems). 

The sequences obtained are set out below. The relationship between 
these sequences is set out in Table 1, The initiation codon was calculated 
using a combination of sequence homology alignment (FASTA), signal 

30 sequence prediction (PSORT, Signal?) or ORF prediction (GeneMark). 
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Table 1: Reference table indicating the relationships of each sequence ID to 
the selected proteins. 



Protein 


DNA 


Amino acid 


DNA sequence ol 


Amino acid 


name 


sequence of 


sequence of 


protein 


sequence of 




complete 


complete ORF 




protein 




ORF 








PGl 


1 


265 


122 


386 


PGlO 


2 


266 


123 


387 


PGlOO 


3 


267 


124 


388 


PGlOl 


4 


268 






PG102 


5 


269 


125, 126 


389, 390 


PG104 


6 


270 


127 


391 


PG105 


7 


271 


128 


392 


PG106 


8 


272 


129 


393 


PG107 


9 


273 


130, 131, 132 


394, 395, 396 


PG108 


10 


274 


133 


397 


PG109 


11 


275 


134, 135 


398, 399 


PGll 


12 


276 


136 


400 


PGl 10 


13 


277 


137 


401 


PGlll 


14 


278 






PG112 


15 


279 


138, 139 


402, 403 


PG113 


16 


280 


140 


404 


PG114 


17 


281 


141 


405 


PG115 


18 


282 


142 


406 


PG116 


19 


283 


143 


407 


PG117 


20 


284 


144 


408 


PG118 


21 


285 


145 


409 


PG119 


22 


286 


146 


410 


PG12 


23 


287 


147 


411 


PG120 


24 


288 


148 


412 
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Prntf^i n 


nNA 




ui\/\. sequence o, 


Amino acid 


name 


seautmcB nf 




Lit ULClIl 


sec|ueiice oi 




comulete 


comtilptp ORP 




Lll VJLcIU 




ORF 








PG121 


25 


289 


149 


413 


PG122 


26 


290 


150 


414 


PG123 


27 


291 


151 


415 


PG124 


28 


292 


152 


416 


PG125 


29 


293 




4-1 7 

TC X / 


PG126 


30 


294 




4.1ft 
*± xo 


PG13 


31 


295 




*t xy 


PG14 


32 


296 






PG15 


33 


297 


JlxJ / 


4-91 
'x^ X 


PG16 


34 


298 






PG18 


35 


299 


159 


47'? 


PG2 


36 


300 


160 161 


A9A 47S 


PG21 


37 


301 


162 


476 


PG22 


38 


302 


163 


477 


PG23 


39 


303 


164 


478 


PG24 


40 


304 


16^ 

X.VJ \J 


47Q 


PG25 


41 


305 






PG27 


42 


306 


1R7 


4^1 


PG28 


43 


307 


IfiR 
±.\j\j 


4*^9 


PG29 


44 


308 


IRQ 


^OO 


PG3 


45 


309 


1 70 


4^4 


PG30 


46 


310 


1 71 


4"^ Pi 


PG31 


47 


311 


172 


4^6 


PG32 


48 


312 


173 


437 


PG33 


49 


313 


174 


438 


PG34 


50 


314 


175, 176 


439, 440 


PG35 


51 


315 


177 


441 


PG36 


52 


316 


178 


442 


PG37 


53 


317 


179, 180 


443, 444 


PG38 


54 


318 


181 


445 



wo 99/29870 



20 



PCT/AU98/01023 



Protein 


DNA 


Amino acid 


DNA sequence of 


Amino acid 


name 


sequence of 
complete 
ORF 


sequence of 
complete ORF 


protein 


sequence of 
protein 


PG39 


55 


319 


182 


446 


PG4 


56 


320 


183 


447 


PG40 


57 


321 


184 


448 


PG41 


58 


322 


185 


449 


PG42 


59 


323 


186 


450 


PG43 


60 


324 


187 


451 


PG44 


61 


325 


188 


452 


PG45 


62 


326 


189 


453 


PG46 


63 


327 


190 


454 


PG47 


64 


328 


191 


455 


PG48 


65 


329 


192 


456 


PG49 


66 


330 


193 


457 


PG5 


67 


331 


194 


458 


PG50 


68 


332 


195 


459 


PG51 


69 


333 


196 


460 


PG52 


70 


334 


197 


461 


PG53 


71 


335 


198 


462 


PG54 


72 


336 


199 


463 


PG55 


73 


337 


200 


464 


PG56 


74 


338 


201, 202 


465, 466 


PG57 


75 


339 


203, 204, 205 


467, 468, 469 


PG58 


76 


340 


206, 207 


470, 471 


PG59 


77 


341 


208, 209, 210 


472, 473, 474 




78 


342 


211 


475 


PG60 


79 


343 


212 


476 


PG61 


80 


344 


213 


477 


PG62 


81 


345 


214 


478 


PG63 


82 


346 


215 


479 


PG64 


83 


347 


216 


480 


PG65 


84 


348 


217 


481 
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Protein 


DNA 


Amino acid 


DNA senuenc6 


Aminn apiH 


name 


sequence of 


sequence of 


protein 


seouence of 




complete 


complete ORF 




u rote in 




ORF 








PG66 


85 


349 


218 


482 


PG67 


86 


350 


219 


483 


PG68 


87 


351 


220, 221 


484, 485 


PG69 


88 


352 


222 


486 


PG7 


89 


353 


223 


487 


PG70 


90 


354 


224 


488 


PG71 


91 


355 


225 


489 


PG72 


92 


356 


226 


490 


PG73 


93 


357 


227 


491 


PG74 


94 


358 


228 


492 


PG75 


95 


359 


229 


493 


PG76 


96 


360 


230 


494 


PG77 


97 


361 


231 




PG78 


98 


362 


232 


496 


PG79 


99 


363 


233 


497 


PG8 


100 


364 


234 235 236 


498 49Q ^00 "SOI 








237 




PG80 


101 


365 


238 


502 


PG81 


102 


366 


102 


366 


PG82 


103 


367 


239 


503 


PG83 


104 


368 


240 


504 


PG84 


105 


369 


241, 242 


505 506 


PG85 


106 


370 


243 


507 


PG86 


107 


371 


244, 245 


508, 509 


PG87 


108 


372 


246 


510 


PG88 


109 


373 


247, 248, 249 


511, 512, 513 


PG89 


110 


374 


250 


514 


PG9 


111 


375 


251, 252, 253 


515, 516, 517 


PG90 


112 


376 


254, 255 


518, 519 


PG91 


113 


377 


256 


520 
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Protein 


DNA 


Amino acid 


DNA sequence of 


Amino acid 


name 


sequence of 
complete 
ORF 


sequence of 
complete ORF 


protein 


sequence of 
protein 


PG92 


114 


378 


257 


521 


PG93 


115 


379 


258 


522 


PG94 


116 


380 


259 


523 


PG95 


117 


381 


260 


524 


PG96 


118 


382 


261 


525 


PG97 


119 


383 


262 


526 


PG98 


120 


384 


263 


527 


PG99 


121 


385 


264 


528 


PG127 


529 


531 


530 


532 



DNA sequence analysis 

5 

DNA files in FASTA format were converted to GGG format files and 
imported into a database. The DNA files were translated into amino acid 
files using the program Flip obtained from ANGIS(Austi'alian Genomic 
Information Service, University of Sydney, Australia). A series of 

10 bioinformatic analyses were performed on the proteins in order to select 
potential vaccine candidates. The programs used were FASTA homology 
searching (1), PSORT (2,3), Signal? (4), TopPred (5), and GeneMark (6). The 
proteins and their bioinformatic results were stored in the custom written 
database for search and retrieval of proteins with the desired characteristics 

15 The FASTA homology results for these proteins were then examined 

for any alignment with a protein suggesting surface location or vaccine 
efficacy. All proteins were searched for homology against a non-redundant 
bacterial protein database compiled by ANGIS using the FASTA algorithm. 
The settings used for the FASTA searches were Ktup = 2, gap creation 

20 penalty = -12, gap extension penalty = -2, width for deriving alignment in 

opt = 16 and the Blosum 50 scoring matrix. Individual FASTA search results 
were examined for significant homology by statistical probability and amino 
acid alignments. The results are set out in Table 2. 
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Protein files were then trimmed to the first, second, third, fourth and 
fifth methionine residues using a protein trimming program (ANGIS). The 
trimmed proteins were then subjected to PSORT analysis for the detection of 
signal sequences and the prediction of cell location. Proteins exhibiting a 
PSORT probability of outer membrane >0.8 were considered to indicate 
surface localisation. A second signal sequence detection program SignalP 
was also performed and, in certain instances, this program detected signals 
not identified with PSORT. All proteins identified by other methods were 
also analysed by PSORT and SignalP. Previously, the C-terminal amino acid 
of bacterial outer membrane proteins has been shown to be important for the 
assembly of the protein on the outer membrane (7). A typical structure 
definition for outer membrane proteins has been determined as the presence 
of a signal sequence at the N-terminus and a tyrosine or phenylalanine at the 
C-terminus. A number of the selected proteins exhibit this characteristic 
structure. The program TopPred was used to determine the presence and 
number of membrane spanning domains (MSDs) and the presence of such 
sequences indicates a preference to be attached to membranes such as the 
outer membrane. The results of PSORT, SignalP and TopPred analyses with 
the C-terminal amino acids of the selected proteins are set out in Table 3. 

The 70 amino acids from the C-terminus of a number of P. gingivcdis 
outer membrane proteins share 50-100% protein sequence identity. These 
proteins included RGPl, RGP2, KGP, HagA, HagC, HagD, prtH and prtT. This 
conserved motif may be involved in the attachment or sorting of proteins to 
the outer membrane. The protein data set was searched using FASTA 
homology as described above and a number of novel proteins were identified 
which demonstrate similar motifs at their C-termini. The results are listed in 
Table 4 

The TonBIII box is a 30 amino acid motif present within TonB outer 
membrane receptors in a wide variety of bacteria. The TonBIII box of 
P. gingivalis (8) was used to search the protein data set for homology by 
FASTA as described above. Those proteins demonstrating significant 
homology are listed in Table 5. 
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Table 4: Percentage identity and percentage similarity of various proteins 
with the 70 amino acids from the C-terminal of the P. gingivalis arginine 
protease 1 (RGPl), arginine protease 2 (RGP2), and the cysteine 
protease/he magglutinin (prtT). 



name 
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Table 5; Percentage identity and percentage similarity of various proteins 
with the TonBEtl box of R gingivalis. 



Protein 
name 


Percent identity 


Percent similarity 


PG2 


46 


71 


PG13 


57 


93 


PG35 


50 


96 


PG47 


39 


71 
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54 


93 i 
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Cloning, expression and purification of recombinant P. gingiVaiis genes. 
PGl 

5 Oligonucleotides to the 5' and 3' regions of the deduced protein were 

used to amplify the gene of interest from a preparation of F. gingivalis W50 
genomic DNA using the TaqPlus Precision PGR System ( Stratagene) and a 
PTC-100 (M) Research) thermal cycler or similar device. The 5' 
oligonucleotide primer sequence was GCGCCATATGCTGGCCGAACCGGCC, 

10 the 3' oligonucleotide primer sequence was 

GCGCCTCGAGTCAATTCATTTCCTTATAGAG. The PGR fragment was 
purified, digested with Nde I, Xho I restriction enzymes (Promega) and 
ligated into the corresponding sites of the plasmid pProEx-l (Gibco-BRL) and 
transformed into J?. co7i ER1793 cells (a gift from Elizabeth Raleigh, New 

15 England Biolabs). A resulting clone expressing the correct insert was 

selected and induced with or without O.lmM IPTG [Promega) for expression 
of the recombinant protein. Expression of the recombinant protein was 
determined by SDS-PAGE analysis and Western Blot using the one of the 
rabbit antisera described above or an anti-hexahistidine antibody (Clontech) 

20 that detects the hexahistidine tag that was fused to the P, gingivalis 

recombinant protein. PGl was purified by disruption of the E. coli cells by 
sonication in binding buffer (Novagen) and solubilisation by the addition of 
sarkosyl (N-Lauroyl sarcosine) to a 1% final concentration. There after the 
preparation was diluted to 0.1% sarkosyl in binding buffer, bound to a 

25 Nickel-nitrilotriacetic acid column (Ni-NTA; Qiagen), after washing bound 
proteins were eluted with IM imidazole in elution buffer (Novagen) 
according to the Qiagen recommendations with 0.1% sarkosyl added to all 
buffers. Following purification samples were dialysed against 500mM NaCl, 
20mM Tris, 0.1% sarkosyl at pH7.4 to remove the imidazole, concentrated as 

30 required and stored at 4°C until used. Purity and antigenicity were assessed 
by SDS-PAGE and Western blot using selected antisera (from those described 
above) and the protein concentration was determined by the BCA assay 
(Pierce). 
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PG2 

The methods used for PG2 were essentially the same as for PGl with 
the following exceptions. The 5' oligonucleotide primer sequence was 
CGCGGTATACATGAAAAGAATGACGC, the 3' oligonucleotide primer 
5 sequence was CGCGAGATCTGAAAGACAACTGAATACC and the PGR 

product was cloned into pGex-stop PIBS(IV) (Patent appHcation W09619496, 
JC Cox, SE Edwards, I Frazer and EA Webb. Variants of human papilloma 
virus antigens) using the BstZ 171 and Bgl II restriction sites. 2% sarkosyl 
was used to solubilise PG2 and 8M urea was added to the solublisation buffer 
10 and to all other buffers. Urea was removed from the purified protein by 
sequential dialysis (4M then 2M then IM then 0.5M then OM urea all in 
50mM Tris, 500mM NaCl, 0.1% sarkosyl, pH7.43. Purified protein was stored 
at 4°C until requii^ed. 

15 PG3 

The methods used for PG3 were essentially the same as for PGl with 
the following exceptions. The 5* oligonucleotide primer sequence was 
GCGCGTATACATGAAGAAATCAAGTGTAG, the 3' oligonucleotide primer 
sequence was GCGCAGATCTCTTCAGCGTACCTTGCTGTG and DNA was 

20 amplified with Pfu DNA polymerase (Stratagene). The PGR product was 
cloned directly into pCR-Blunt and transformed into E. coli 
ToplOF'(InVitrogen) before subcloning into the expression plasmid pGex- 
stop RBS(IV) using the Est Z171 and Bgl II restriction sites and transformed 
into E. coli BL21DE3 (Pharmacia Biotech). The following modifications were 

25 made to the purification of PG3 from the PGl method. Cells expressing the 
recombinant protein were disrupted by sonication in binding buffer and the 
insoluble inclusion bodies concentrated by centrifugation. Inclusion bodies 
were then solubilised in 6M urea (Sigma) in binding buffer and eluted with 
6M urea added to the elution buffer. In some instances 6M guanidine 

30 hydrochloride (Sigma) was used instead of urea for these steps. Urea (or 
guanidine hydrochloride when it was substituted) was removed from the 
purified protein by sequential dialysis against reducing levels of urea (3M 
then 1.5M then 0.5M then OM urea all in 50mM Tris, SOOmM NaCl, 8% 
glycerol, pH7.4). Purified protein was stored frozen at -80''C until required. 

35 Protein concentration was determined by the Coomassie Plus protein assay 
(Pierce). 
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PG4 

The methods used for PG4 were essentially the same as for PG3 with 
the following exceptions. The 5' oligonucleotide primer sequence was 
CTTCTGTATACTTACAGCGGACATCATAAAATC, the 3' oligonucleotide 
5 primer sequence was TTCCAGGAGGGTACCACGCAACTCTTCTTCGAT and 
DNA was amplified with the Tth XL PGR kit (Perkin Elmer). The PGR 
product was cloned into the expression plasmid pGex-stop RBS(IV) using the 
Bst Z171 and Kpn I restriction sites and transformed inioE. coli ER1793. 

10 PG5 

The methods used for PG5 were essentially the same as for PG3 with 
the following exceptions. The 5' oligonucleotide primer sequence was 
TTGCAACATATGATCAGAACGATACTTTCA, the 3' oligonucleotide primer 
sequence was AGCAATCTCGAGCGGTTCATGAGCCAAAGG and DNA was 
15 amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24 (Novagen) using the Nde I and Xho I restriction 
sites and transformed intoE. coli BL21 (Pharmacia Biotech). Removal of 
urea was not proceeded past IM urea as the protein was insoluble at lower 
concentrations of urea. Purified protein was stored at 4^G until required. 

20 

PG6 

The methods used for PG6 were essentially the same as for PG3 with 
the following exceptions. The 5' oligonucleotide primer sequence was 
TAAAGATATGTGGCTCGAACCCATAATTGCTCGG, the 3' oligonucleotide 
25 primer sequence was GGTGGGCGGAAGGTTTGATGGGGGATTGGTAGT and 
DNA was amplified with the Tth XL PGR kit. The PGR product was cloned 
into the expression plasmid pET24a using the Nde I and Hind III restriction 
sites and transformed into E, coli BL21. 

30 PG8 

The methods used for PG8 were essentially the same as for PG3 with 
the following exceptions. The 5' oligonucleotide primer sequence was 
GGGGGTATACATGGAGTTCAAGATTGTG, the 3' oligonucleotide primer 
sequence was CGGGAGATGTGTTTTGTGAAAGGTTTTC and DNA was 
35 amplified with the TaqPlus Precision PGR System. The PGR product was 
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cloned into the expression plasmid pProEx-1 using the Nde I and Xho I 
restriction sites and transformed intofi", co7iERl793. 

PG8A 

5 PG8A is a shortened version of PG8 and has the first 173 amino acids 

removed. The methods used for PG8A were essentially the same as for PG3 
with the following exceptions. The 5* oligonucleotide primer sequence was 
CGCGGTATACATGGAAAACTTAAAGAAC, the 3' oligonucleotide primer 
sequence was CGCGAGATCTGTTTTCTGAAAGCTTTTC and DNA was 
10 amplified with the TaqPlus Precision PGR System. The PGR product was 
cloned into the expression plasmid pGex-stop RBS(rV) using the Est Z171 
and Bgl II restriction sites and transformed into E, coli ER1793. Prior to 
dialysis of the purified protein EDTA (Sigma) was added to a final 
concentration of lOmM. 

15 

PGlO 

The methods used for PGlO were essentially the same as for PG3 with 
the following exceptions. The 5* oligonucleotide primer sequence was 
GGCGGATATCATGGATAAAGTGAGGTATGC, the 3' oligonucleotide primer 
20 sequence was CGCGAGATCTTTTGTTGATACTCAATAATTG and DNA was 
amplified with the TaqPlus Precision PGR System. The PGR product was 
digested with Eco RV and Bgl II and ligated into the expression plasmid 
pGex-stop RBS(IV] using the Est Z171 and Bgl II resb'iction sites and 
transformed into E, coli ER1793. 

25 

PGll 

The methods used for PGll were essentially the same as for PGl with 
the following exceptions. The 5* oligonucleotide primer sequence was 
GGGGGTATAGATGAGAGGAAAGATTTGGCAGATAGTTTGGG, the 3' 

30 oligonucleotide pi'imer sequence was 

GGGCAGATGTGGGGAAGGGGAGTATATCGGC and DNA was amplified with 
Tli DNA polymerase (Promega). The PGR product was cloned into pCR-Blunt 
and transformed into E. coli ToplOF'before subcloning into the expression 
plasmid pGex-stop RBS(IV) using the Est Z171 and Bgl II restriction sites and 

35 transformed into E. coli ER1793. PGll was purified by solubilisation ofE, 
coli cells with 2% sarkosyl in binding buffer [Qiagen) which was diluted to 
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0.1% sarkosyl in binding buffer, bound to a Nickel-nitrilotriacetic acid 
column (Ni-NTA; Qiagen), after washing bound proteins were eluted with 
IM imidazole (0,7% CHAPS (Sigma) in elution buffer; Qiagen) according to 
the Qiagen recommendations. Following purification samples were dialysed 
5 against SOOmM NaCl, 20mM Tris, 0.7% CHAPS, 20% glycerol (Sigma) at 
pH7.4 to remove the imidazole, concentrated as required and stored at 4^C 
until used. 

PG12 

10 The methods used for PG12 were essentially the same as for PGl with 

the following exceptions. The 5* oligonucleotide primer sequence was 
GCGCGTATACATGAATAGCAGACATCTGACAATCACAATCATTGCCGG, 
the 3' oligonucleotide primer sequence was 

GCGCAGATCTGCTGTTCTGTGAGTGGAGTTGTTTAAGTG and DNA was 
15 amplified with Tli DNA polymerase. The PGR product was cloned into pCR- 
Blunt and transformed into E, coli ToplOF'cells before subcloning into the 
expression plasmid pGex-stop RBS(IV) using the Est Z171 and Bgl II 
restriction sites and transformed into E. coli BL21. Purification of the 
recombinant protein was essentially the same as PGll except 0.5% DHPC 
20 (l,2-Diheptanoyl-sn-glycero-3-phosphocholine; Avanti) in 50mM Tris, 50mM 
NaCl, pH8.0 was used to solubilise the inclusion bodies instead of sarkosyl 
and the DHPC was diluted to 0.1% before addition to the Ni-NTA and 0.1% 
DHPC was added to all buffers. 

25 PG13 

The methods used for PG13 were essentially the same as for PG3 with 
the following exceptions. The 5' oligonucleotide primer sequence was 
GCGCCATATGCGGACAAAAACTATCTTTTTTGCG, the 3' ohgonucleotide 
primer sequence was 

30 GCGCCTCGAGGTTGTTGAATCGAATCGCTATTTGAGC and DNA was 
amplified with Tli DNA polymerase. The PGR product was cloned the 
expression plasmid pET24b using the Nde I and Xho I restriction sites and 
ti^ansformed into E. coli BL21. Purification of the recombinant protein was 
essentially the same as PG3 using 6M urea and 1% NOG (n-octyl glucoside; 

35 Sigma) was added to the dialysis buffer. Removal of urea was not proceeded 
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past 2M urea as the protein was insoluble at lower concentrations of urea. 
Purified protein was stored at 4°C until required. 

PGM 

5 The methods used for PG12 were essentially the same as for PGl with 

the following exceptions. The 5' oligonucleotide primer sequence was 
GCGCGGCGCCATGACGGACAACAAACAACGTAATATCG, the 3' 
oligonucleotide primer sequence was 

GCGCCTCGAGTTACTTGCGTATGATCACGGACATACCC and DNA was 
10 amplified with Tli DNA polymerase. The PGR product was cloned the 

expression plasmid pProEx-1 using the Ehe I and Xho I restriction sites and 
transformed into E, coli BL21. Purification of the recombinant protein was 
essentially the same as PG12. 

15 PG15 

The methods used for PG15 were essentially the same as for PG3 with 
the following exceptions. The 5' oligonucleotide primer sequence was 
CAAAAGTATACTAATAAATATGATTCTCAA, the 3' oligonucleotide primer 
sequence was GCTTATGGTACCTTTGGTCTTATCTATTAT and DNA was 
20 amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pGex-stop RBS(IV] using the Bst Z171 and Kpn I 
restriction sites and transformed into E. coli ER1793. 

PG22 

25 The methods used for PG22 were essentially the same as for PGl with 

the following exceptions. The 5' oligonucleotide primer sequence was 
CCGGGGATGGGATGGGAGTGATGAAGGG, the 3' oligonucleotide primer 
sequence was GGCCCTCGAGGGGAACGGGGTGATAGGC and DNA was 
amplified with the TaqPlus Precision PGR System. The PGR product was 

30 cloned into the expression plasmid pET24b using the Bam HI and Xho I 
restiiction sites and transformed into E. coli BL21DE3. Once PG22 was 
purified dialysis was performed in the same manner as for PGl but in the 
presence of IM imidazole. 
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PG24 

The methods used for PG24 were essentially the same as for PG3 with 
the following exceptions. The 5' oligonucleotide primer sequence was 
CGCGGTATACATGAATTACCTGTACATAC, the 3' oligonucleotide primer 
5 sequence was CGCGGGATCCGTTCGATTGGTCGTCGATGG and DNA was 
amplified with the TaqPlus Precision PGR System. The PGR product was 
digested with Bst Z171 and Bam HI and ligated into the expression plasmid 
pGex-stop RBS(IV] using the Bst Z171 and Bgl II restriction sites and 
transformed into E. coli ER1793. Due to the low level of expression of PG24 
10 purification was not proceeded with except on small scale. 

PG24A 

A modified version of PG24 was also cloned and expressed. PG24A is 
the same as PG24 with the predicted N-terminal sequence removed. The 

15 methods used for PG24A were essentially the same as for PG3 with the 
following exceptions. The 5' oligonucleotide primer sequence was 
CGCGCATATGGAGATTGCTTTCCTTTCTTCG, the 3' oligonucleotide primer 
sequence was CGCGCTCGAGTTAGTTCGATTGGTCGTCG and DNA was 
amplified with the TaqPlus Precision PGR System, The PGR product was 

20 cloned into the expression plasmid pProEx-l using the Nde I and Xho I 
restriction sites and transformed into E. coli ER1793. Purification of the 
recombinant protein was essentially the same as PG3 except 8M urea was 
used to solubilise the inclusion bodies and in the buffers used for the Ni-NTA 
column purification. Urea was removed by sequential dialysis (4M then 2M, 

25 then IM then 0.5M then OM urea all in 50mM Tris, 500raM NaCl, 8% 

glycerol, pH7.4). Purified protein was stored frozen at -80*'C until required. 

PG29 

The methods used for PG29 were essentially the same as for PG3 with 
30 the following exceptions. The 5' oligonucleotide primer sequence was 

GCGCGATATCGCTAGCATGAAAAAGCTATTTCTC, the 3' oligonucleotide 
primer sequence was 

GGGCAGATCTCTCGAGTTTGCCATCGGATTGCGGATTG and DNA was 
amplified with Pfu DNA polymerase being used. The PGR product was 
35 cloned into pCR-Blunt (InVitrogen) and transformed into E. coli 

ToplOF'before subcloning into the expression plasmid pGex-stop RBS(IV) 
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using the EcoR V and Bgl II restriction sites and transformed into coli 
BL21. 6M urea was used throughout the purification process. 

PG30 

5 The methods used for PG30 were essentially the same as for PG3 with 

the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TACGGAATTCGTGACCCCCGTCAGAAATGTGCGC, the 3' 
oligonucleotide primer sequence was 

10 CTATGCGGCCGCTTTGATCCTCAAGGCTTTGCCCGG and DNA was 

amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoE, co7j BL21DE3. Expression studies and immunore activity 
studies were carried out on whole E. coli lysates of PG30. 10ml cultures of 

15 recombinant E. coli were grown to an OD of 2.0 (Aeoonm ) in terrific broth and 
the cells were induced with 0.5 mM IPTG and samples taken for analysis at 4 
hours post induction. Purification was not done for these studies. 

PG31 

20 The methods used for PG31 were essentially the same as for PG30 

with the following exceptions. The 5' oligonucleotide primer sequence was 
CGGGGAATTCGCAAAAATCAATTTCTATGCTGAA, the 3' oligonucleotide 
primer sequence was CTATGCGGCCGCTGTATGCAATAGGGAAAGCTCCGA 
and DNA was amplified with the Tth XL PGR kit. The PGR product was 

25 cloned into the expression plasmid pET24a using the Eco RI and Not I 

restriction sites and transformed into E.coli BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E. coli lysates. 
Purification was not done for these studies. 

30 PG32 

The methods used for PG32 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was CGCAGAATTCCAGGAGAATACTGTACCGGCAACG, the 3' 
35 oligonucleotide primer sequence was 

CTATGCGGCCGCCTTGGAGCGAACGATTACAACAC and DNA was 
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amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
5 for these studies. 

PG33 

The methods used for PG33 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
10 removed from the recombinant protein. 

The 5* oligonucleotide primer sequence was 

TGCAGAATTCCAAGAAGCTAGTACAGAGAACAAA, the 3' oligonucleotide 
primer sequence was GTATGGGGGGGGTTGGGGTGCAGTGATTAGTACAA 
and DNA was ampUfied with the Tth XL PGR kit. The PGR product was 
15 cloned into the expression plasmid pET24a using the Eco RI and Not I 

restriction sites and transformed intoE. coli BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E. coli lysates. 
Purification was not done for these studies. 

20 PG35 

The methods used for PG35 were essentially the same as for PG30 
with the following exceptions. The 5' oligonucleotide primer sequence was 
GGGCGAATTCATGAAAGAAGTAAAGATTATGAGC, the 3' oligonucleotide 
primer sequence was GGGTGGGGGGGGGAAATTGATGTTTGTAGGGAGGA 
25 and DNA was amplified with the Tth XL PGR kit. The PGR product was 
cloned into the expression plasmid pET24a using the Eco RI and Not I 
restriction sites and transformed into£^. coii BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole coli lysates. 
Purification was not done for these studies. 

30 

PG36 

The methods used for PG36 were essentially the same as for PG30 
with the following exceptions. The 5' oligonucleotide primer sequence was 
AAAGGAATTCTAGAAAAAGATTATTGGGGTAGGA, the 3' oligonucleotide 
35 primer sequence was GTATGCGGGGGGGAAGTGGTGTGGGAGGAGAAAGT 
and DNA was amplified with the Tth XL PGR kit. The PGR product was 
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cloned into the expression plasmid pET24a using theEco RI and Not I 
restriction sites and transformed into E, coli BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E. coli lysates. 
Purification was not done for these studies. 

5 

PG37 

The methods used for PG37 were essentially the same as for PG30 
with the following exceptions. The 5' oligonucleotide primer sequence was 
TGGCGAATTCAAACGGTTTTTGATTTTGATCGGC, the 3' oligonucleotide 

10 primer sequence was CTATGCGGCCGCCTTGCTAAAGCCCATCTTGCTCAG 
and DNA was amplified with the Tth XL PGR kit. The PGR product was 
cloned into the expression plasmid pET24a using the Eco RI and Not I 
restriction sites and transformed into E. coli BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E. coli lysates. 

15 Purification was not done for these studies. 

PG38 

The methods used for PG38 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
20 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was CGTCGAATTCGAAAAGGTGGCAGTGGTAAACACT, the 3' 
oligonucleotide primer sequence was 

CTATGCGGCCGCCTTGATTCCGAGTTTCGCTTTTAC and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
25 expression plasmid pET24a using the Eco RI and Not I restriction sites and 

transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 
for these studies, 

30 PG39 

The methods used for PG39 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was AGCTGGATCCCAAGGeGTCAGGGTATCGGGCTAT, the 3' 
35 oligonucleotide primer sequence was 

GTATGCGGCCGCGAATTCGACGAGGAGACGCAGGT and DNA was 
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amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed intoE*. co?/ BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
5 for these studies. 

PG40 

The methods used for PG40 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
10 removed from the recombinant protein. The 5* oligonucleotide primer 

sequence was GGGTGAATTGAAGAGGGAGAAGGTCCCGACAGAT, the 3' 
oligonucleotide primer sequence was 

CTATGGGGCGGGGAAGTTGAGGATAACGTTAGGGA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
15 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. co/i BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 

20 PG41 

The methods used for PG41 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was GAGTGAATTCGAAAAGGGGTGGGAAACGAGGGTA, the 3' 

25 oligonucleotide primer sequence was 

GTATGGGGGCGGTTGTTGGGGAATGCGCATGGGGTT and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoE". coii BL21DE3. Expression studies and immunoreactivity 

30 studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG42 

The methods used for PG42 were essentially the same as for PG30 
35 with the following exceptions. The 5' oligonucleotide primer sequence was 
GTTTGAATTCGGAAATAATAGTGTTTTGGGGAAG, the 3' oligonucleotide 
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primer sequence was 

GAGTGCGGCCGCTTTGCCGGACATCGAAGAGATGGTC and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
5 transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 



PG43 

10 The methods used for PG43 were essentially the same as for PG30 

with the following exceptions. The 5' oligonucleotide primer sequence was 
GCGCGAATTCAAAAAAGAAAAACTTTGGATTGCG, the 3' oligonucleotide 
primer sequence was CTATGCGGCGGCCTTCAAAGCGAAAGAAGCCTTAAC 
and DNA was amplified with the Tth XL PGR kit. The PGR product was 

15 cloned into the expression plasmid pET24a using the Eco RI and Not I 

restriction sites and transformed intoE. coii BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E. coli lysates. 
Purification was not done for these studies. 



20 PG44 

The methods used for PG44 were essentially the same as for PG30 
with the following exceptions. The predicted N- terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was AGGCGAATTCTGTAAGAAAAATGCTGACACTACC, the 3' 

25 oligonucleotide primer sequence was 

CTATGCGGCCGCCTTTTTCCCGGGCTTGATCCCGAT and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoE. co/z BL21DE3. Expression studies and immunoreactivity 

30 studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG45 

The methods used for PG45 were essentially the same as for PG30 
35 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
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sequence was GACAGGATCCTGCTCCACCACAAAGAATCTGCCG, the 3' 
oligonucleotide primer sequence was 

CTATGCGGCCGCGAAGGGATAGCCGACAGCCAAAT and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
5 expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



10 PG46 

The methods used for PG46 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was CTCGGAATTCGGTTATGTGCCGGACGGTAGCAGA, the 3' 

15 oligonucleotide primer sequence was 

CTATGCGGCCGGGAACGGATAGCCTACTGCAATGT and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 

20 studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 



PG47 

The methods used for PG47 were essentially the same as for PG30 
25 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was 

CGCGGAATTCCAAACAGTGGTGACCGGTAAGGTGATCGATTCAGAA, the 
3' oligonucleotide primer sequence was 

30 GTATGCGGGGGGGAAGTTTAGAGGAATACGGGTAGACGAAGTGGGGGC 
and DNA was amplified with the Tth XL PGR kit. The PGR product was 
cloned into the expression plasmid pET24a using the Eco RI and Not I 
restriction sites and transformed intoE. coli BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E. coli lysates. 

35 Purification was not done for these studies. 
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The methods used for PG48 were essentially the same as for PG30 
with the following exceptions. The predicted N- terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
5 sequence was TGCTGAATTCCAAAAATCCAAGCAGGTACAGCGA, the 3' 
oligonucleotide primer sequence was 

CTATGCGGCCGCTCGTAACCATAGTCTTGGGTTTTG and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
10 transformed into K co7i BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 

PG49 

15 The methods used for PG49 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was GAACGGATGCAACGAGGCGGTGGAAGACAGATGC, the 3' 
oligonucleotide primer sequence was 

20 GAGTGCGGCGGCTAATCTCGACTTCATACTTGTACCA and DNA was 

amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 

25 for these studies. 

PG50 

The methods used for PG50 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
30 removed from the recombinant protein. The 5* oligonucleotide primer 

sequence was GGTGGGATGCGGGACAGACACTGAGTTGAAGTAC, the 3' 
oligonucleotide primer sequence was 

GTATGCGGGCGGGAAGTTCACTAGGAAGCGCATGT and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
35 expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
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studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 

PG51 

5 The methods used for PG51 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TCTTGAATTCGCGCAAAGTCTTTTCAGCACCGAA, the 3' 
oligonucleotide primer sequence was 

10 CTATGCGGCCGCACTTTTTGGTGGGATCACTCTCTT and DNA was 

amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and inamunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 

15 for these studies. 

PG52 

The methods used for PG52 were essentially the same as for PG30 
with the following exceptions. The 5' oligonucleotide primer sequence was 

20 AGAAGAATTCAAACGGACAATCCTCCTGACGGCA, the 3' oligonucleotide 
primer sequence was CTATGGGGCCGCGAAGTCTTTGCCCTGATAGAAATC 
and DNA was amplified with the Tth XL PGR kit. The PGR product was 
cloned into the expression plasmid pET24a using the Eco RI and Not I 
restriction sites and transformed into E. coli BL21DE3. Expression studies 

25 and immunoreactivity studies were carried out on whole E. coli lysates. 
Purification was not done for these studies. 

PG53 

The methods used for PG53 were essentially the same as for PG30 
30 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCTGAATTCGCGAATCCCCTTACGGGCCAATCG, the 3' 
oligonucleotide primer sequence was 

CTATGCGGCCGCGTCGGAAAGGCAGGGGTAATAGG and DNA was 
35 amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



59 

transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

5 PG54 

The methods used for PG54 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was CGCTGAATTCCAGATTTCGTTCGGAGGGGAACCC, the 3' 

10 oligonucleotide primer sequence was 

CTATGCGGCCGCCTGCTTCACGATCTTTTGGCTCA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 

15 studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 

PG55 

The methods used for PG55 were essentially the same as for PG30 
20 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonvicleotide primer 
sequence was CGAGGGATCCGAGCTCTCTATTTGCGATGGCGAG, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTCTTACCTGACTTGTTGTCACGAAT and DNA was 
25 amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

30 

PG56 

The methods used for PG56 were essentially the same as for PG30 
with the following exceptions. The 5' oligonucleotide primer sequence was 
AAATGGATCCCGAAAAATTTTGAGCTTTTTGATG, the 3' oligonucleotide 
35 primer sequence was CTATGCGGCCGCTTTGATTCGTAATTTTTCCGTATC 
and DNA was amplified with the Tth XL PGR kit. The PGR product was 
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cloned into the expression plasmid pET24a using the Bam HI and Not I 
restriction sites and transformed into E. coli BL21DE3. Expression studies 
and immunoreactivity studies were carried out on whole E, coli lysates. 
Purification was not done for these studies. 

5 

PG57 

The methods used for PG57 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
10 sequence was TGCTGGATCCCAAGAGATCTCAGGCATGAATGCA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGGGCCTCTTTATCTCTACCTTTTC and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
15 transformed into K co7i BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 



PG58 

20 The methods used for PG58 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was CGGTGAATTCGAAACGCCACGAAATACAGAAACC, the 3' 
oligonucleotide primer sequence was 

25 GAGTGCGGCGGCTGAAAGTCCAGCTAAAACGGGCGAA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

30 for these studies. 



PG59 

The methods used for PG59 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
35 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was TGCTGAATTCCAACAAGAGAAGCAGGTGTTTCAT, the 3' 
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oligonucleotide primer sequence was 

GAGTGCGGCCGCTGAAGATGCTCTTATCGTCCAAACG and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
5 transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG60 

10 The methods used for PG60 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was GGCGGAATTCCAGATGCTCAATACTCCTTTCGAG, the 3' 
oligonucleotide primer sequence was 

15 GAGTGCGGCCGCTGAAGAGGTAGGAGATATTGCAGAT and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
ti-ansformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

20 for these studies. 



PGBl 

The methods used for PG61 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
25 removed from the recombinant protein. The 5* oligonucleotide primer 

sequence was AGCAGAATTGCCCGTCTCGAACAGCGAGATAGAT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGAAATGGATTGTCAGACTACCCAG and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
30 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 
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PG62 

The methods used for PG62 were essentially the same as for PG30 
with the following exceptions. The predicted N- terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
5 sequence was TGCTGAATTCGAGCGGTTTCCGATGGTGCAGGGA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGAAGTGAAATCCGACACGCAGCTG and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
10 transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 



PG63 

15 The methods used for PG63 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCAGAATTCCAAGAAGCAAACACTGCATCTGAC, the 3' 
oligonucleotide primer sequence was 

20 GAGTGCGGCCGCTGAAAGTGTACGCAACACCCACGCC and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoi?. coii BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

25 for these studies. 



PG64 

The methods used for PG64 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminai signal sequence was 
30 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was TGCTGAATTCCAGAGTCGTCCTGCTCTTAGACTG, the 3' 
oligonucleotide primer sequence was 

GAGTGGGGCCGCTGAAGCGAACACCGAGACCCACAAA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
35 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
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studies were carried out on whole E. coli lysates. Purification was not done 
for these studies, 

PG65 

5 The methods used for PG65 were essentially the same as for PG30 

with the following exceptions. The predicted N- terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCCGGATCCATCGGACAAAGCCGCCCGGCACTT, the 3' 
oligonucleotide primer sequence was 

10 GAGTGGGGCCGCTAAAGCGGTAACCTATGCCCACGAA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into £*. coli BL21DE3, Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

15 for these studies. 

PG66 

The methods used for PG66 were essentially the same as for PG30 
with the following exceptions. The predicted N- terminal signal sequence was 
20 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was GTTTGAATTGCAAGAGGTTATGAGAGGATGGTGA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGGGGCTAAAATGAGTGGAGAGCGTGGGCAT and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
25 expression plasmid pET24a using the Eco RI and Not I restriction sites and 

transformed into £*. co7iBL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 

30 PG67 

The methods used for PG67 were essentially the same as for PG30 
with the following exceptions. The predicted N- terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was GAACGAGGTGGGGGAAGGTGGTATGGGGGGAGGA, the 3' 
35 oligonucleotide primer sequence was 

GAGTGGGGGGGGTATAGCAAGTATTCGTGATGGGAGG and DNA was 
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amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Sac I and Not I restriction sites and 
transformed into£'. co72BL2lDE3. Expression studies and immunoreactivity 
studies were carried out on whole K coli lysates. Purification was not done 
5 for these studies. 



PG68 

The methods used for PG68 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
10 removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was GCTTGCGGCCGCCCTTATGAAAGATTTGCAGAT, the 3' 
oligonucleotide primer sequence was 

GGTGCTCGAGTATACTCAACAAGCACCTTATGCAC and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
15 expression plasmid pET24a using the Not I and Xho I restriction sites and 

transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



20 PG69 

The methods used for PG69 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TGCTGAATTGGAGGAAGGGGAGGGGAGTGCCCGA, the 3' 

25 oligonucleotide primer sequence was 

GAGTGCGGGCGGTGAAGCTGTAGCGGGCTTTGAACCA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoE. coli BL21DE3. Expression studies and immunoreactivity 

30 studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 



PG70 

The methods used for PG70 were essentially the same as for PG30 
35 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
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sequence was CGGTGGATCCTCGCAAATGCTCTTCTCAGAGAAT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGGCGCTAAACGAAATATCGATACCAACATC and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
5 expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E, coli BL21DE3. Expression studies and immunore activity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



10 PG71 

The methods used for PG71 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TGCTGAATTCCAGAACAATAGCCTCGATGTACAC, the 3' 

15 oligonucleotide primer sequence was 

GAGTGCGGCCGCTATTGCCGGTAGGATTTGGTTGTCC and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E, coli BL21DE3. Expression studies and immunoreactivity 

20 studies were carried out on whole £*. coli lysates. Purification was not done 
for these studies. 



PG72 

The methods used for PG72 were essentially the same as for PG30 
25 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TGGTGAATTGGGAGAGCGAGTGGAGAGGGAGAGC, the 3' 
oligonucleotide primer sequence was 

GAGTGGGGCGGGTATGATTGGCTTTCAGAAAAGGTAT and DNA was 
30 amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed TGGTGAATTGGGAGAGGGACTGGAGAGGGACAGC into E, coli 
BL21DE3. Expression studies and immunoreactivity studies were carried out 
on whole E. coli lysates. Purification was not done for these studies. 
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PG73 

The methods used for PG73 were essentially the same as for PG30 
with the following exceptions. The predicted N- terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
5 sequence was CGGTGAATTCCAACAGACAGGACCGGCCGAACGC, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTTAAGAAAGGTATCTGATAGATCAG and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
10 transformed into £. co7i BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG74 

15 The methods used for PG74 were essentially the same as for PG30 

with the following exceptions. The predicted N- terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was TGCTGAATTCCAAGAAAATAATACAGAAAAGTCA, the 3' 
oligonucleotide primer sequence was 

20 GAGTGCGGCCGCTGAGGTTTAATCCTATGCCAATACT and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
ti'ansformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole coli lysates. Purification was not done 

25 for these studies. 



PG75 

The methods used for PG75 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
30 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was GGCGGGATGCGCTCAGGAGCAACTGAATGTGGTA, the 3^ 
oligonucleotide primer sequence was 

GAGTGCGGGGGCTGTGGAACAAATTGCGC AATGCATC and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
35 expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed intoJJ. coii BL21DE3. Expression studies and immunoreactivity 
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studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG76 

5 The methods used for PG76 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was AGCAGAATTCGGAAACGCACAGAGCTTTTGGGAA, the 3' 
oligonucleotide primer sequence was 

10 GAGTGCGGCCGCTTACCTGCACCTTATGACTGAATAC and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoE". co7iBL2lDE3. Expression studies and immunoreactivity 
studies were cari'ied out on whole E. coli lysates. Purification was not done 

15 for these studies. 

PG77 

The methods used for PG77 were essentially the same as for PG30 
with the following exceptions. The predicted N- terminal signal sequence was 
20 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was TGGTGAATTCCAAGAGAAAAAGGATAGTCTCTCT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTCTTCTTATCGCCATAGAATACAGG and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
25 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

30 PG78 

The methods used for PG78 were essentially the same as for PG30 
with the following exceptions. The predicted N- terminal signal sequence was 
removed from the recombinant protein. The 5* oligonucleotide primer 
sequence was GGCAGAATTCCAGGATTCTTCCCACGGTAGCAAT, the 3' 
35 oligonucleotide primer sequence was 

GAGTGCGGCCGCTATGATGATAGTAAAGACTGGTTCT and DNA was 
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amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
5 for these studies. 



PG79 

The methods used for PG79 were essentially the same as for PG30 
with the following exceptions. The 5' oligonucleotide primer sequence was 
10 TGGTGAATTCGTAGTGAGGCTGGTGGTAATTGTC, the 3' oligonucleotide 
primer sequence was 

GAGTGGGGCCGCTGCCGTCCTGCCTTTCTGCCTGACG and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
15 transformed into co7j BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG80 

20 The methods used for PG80 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGGGGAATTGCAAAAGGTGGAGTTGGACTAGGAT, the 3' 
oligonucleotide primer sequence was 

25 GAGTGGGGCCGCTGTTGAAAGTCGATTTGACGGGAAG and DNA was 
amphfied with the Tth XL PGR kit. The PGR product was cloned into tlie 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

30 for these studies. 



PG81 

The methods used for PG81 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
35 removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GTTTGAATTGGAGGATTTTGTGTATGAAATAGGA, the 3^ 
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oligonucleotide primer sequence was 

GAGTGCGGCCGCTTTGTTTATTACAAAAAGTCTTACG and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
5 transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG82 

10 The methods used for PG82 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GAACGAATTCGAGAAGAACAACTTTACCGAGTCG, the 3' 
oligonucleotide primer sequence was 

15 GAGTGCGGCCGCTGTTGAGTTTCAGCTTTTTAAACCA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into co7j BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole J?, coli lysates. Purification was not done 

20 for these studies. 

PG84 

The methods used for PG84 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
25 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was TGCTGGATCGCAGAATGATGAGATCTTGGAAGAT, the 3' 
oligonucleotide primer sequence was 

GAGTGGGGGGGGTATTGGGTCCCGGGGCAGTACGTGC and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
30 expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed intoR coii BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 
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PG85 

The methods used for PG85 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
5 sequence was CGGTGAATTCGTACCAACGGACAGCACGGAATCG, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGGCGCTCAGATTGGTGCTATAAGAAAGGTA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
10 transformed into co/j BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Puinfication was not done 
for these studies. 



PG86 

15 The methods used for PG86 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TGCTGGATCCCAAACGCATGATCATCTCATCGAA, the 3' 
oligonucleotide primer sequence was 

20 GAGTGCGGCCGCTGTGGTTCAGGCCGTGGGCAAATCT and DNA was 
ampUfied with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

25 for these studies. 



PG87 

The methods used for PG87 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
30 removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCGGAATTCCAGAGCTATGTGGACTACGTCGAT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTATTACTGTGATTAGCGCGACGCTG and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
35 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoJE. coii BL21DE3. Expression studies and immunoreactivity 
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studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 

PG88 

5 The methods used for PG88 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was AGCAGAATTCGCCGAATCGAAGTCTGTCTCTTTC, the 3' 
oligonucleotide primer sequence was 

10 GAGTGCGGCCGCTCGGCAAGTAAGGCTTTAGTGGGGA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were canned out on whole E. coli lysates. Purification was not done 

15 for these studies. 

PG89 

The methods used for PG89 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
20 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was TGCTGAATTCCAATCGAAGTTAAAGATCAAGAGC, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTATTAGTCCAAAGACCCACGGTAAA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
25 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoE. coif BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

30 PG90 

The methods used for PG90 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCAGAATTGGAAACAAGGAGGAAGAGTAGGGGG, the 3' 
35 oligonucleotide primer sequence was 

GAGTGCGGGCGGTTTTTTGTTGTGATAGTGTTTGGGC and DNA was 
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amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
ti^ansformed intof. co/i BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 
5 for these studies. 



PG91 

The methods used for PG91 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
10 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was TGCTGAATTCCAGAGGATGGGAGGAGATGATGTC, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTTTCCACGATGAGCTTCTCTACGAA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
15 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E, coli lysates. Purification was not done 
for these studies. 



20 PG92 

The methods used for PG92 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCCGAATTCGCCGATGCACAAAGCTCTGTCTCT, the 3' 

25 oligonucleotide primer sequence was 

GAGTGCGGCCGCTTCGAGGACGATTGCTTAGTTCGTA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoE*. coii BL21DE3. Expression studies and immunoreactivity 

30 studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG93 

The methods used for PG93 were essentially the same as for PG30 
35 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
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sequence was GGCCGAGCTCCAAGAGGAAGGTATTTGGAATACC, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGCGAATCACTGCGAAGCGAATTAG and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
5 expression plasmid pET24a using the Sac I and Not I restriction sites and 

transformed into E, coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



10 PG94 

The methods used for PG94 were essentially the same as for PG30 
with the following exceptions. The predicted N- terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCGGAGCTCCAAGAGGAAGGTATTTGGAATACC, the 3' 

15 oligonucleotide primer sequence was 

GAGTGCGGCCGCTTTGTCCTACCACGATCATTTTCTT and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 

20 studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 



PG95 

The methods used for PG95 were essentially the same as for PG30 
25 with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCCGAGCTCTGTGGAAAAAAAGAAAAACACTCT, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGGTAACTGTCTCCTTGTCGCTCCCCGG and DNA was 
30 amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Sac I and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 
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PG96 

The methods used for PG96 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
5 sequence was TGCTGAGCTCCAAACGCAAATGCAAGCAGACCGA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTTTTGAGAATTTTCATTGTCTCACG and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Sac I and Not I restriction sites and 
10 transformed into co7i BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli ly sates. Purification was not done 
for these studies. 



PG97 

15 The methods used for PG97 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GGCGGGATCCCAGTTTGTTCCGGCTCGGAGCACA, the 3' 
oligonucleotide primer sequence was 

20 GAGTGCGGCCGCTCTGTTTGATGAGCTTAGTGGTATA and DNA was 
amplified with the Tth XL PGR kit The PGR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3, Expression studies and inmxunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

25 for these studies. 



PG98 

The methods used for PG98 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
30 removed from the recombinant protein. The 5' oligonucleotide primer 

sequence was AGGAGAATTCCAAGAAAGAGTCGATGAAAAAGTA, the 3' 
oligonucleotide primer sequence was 

GAGTGGGGGCGGTTAGGTGTGTAACATTAAGTTTTTTATTGAT and DNA 
was amplified with the Tth XL PGR kit The PGR product was cloned into 
35 the expression plasmid pET24a using the Eco RI and Not I restriction sites 
and transformed into £. coli BL21DE3. Expression studies and 
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immunoreactivity studies were carried out on whole E, coli lysates. 
Purification was not done for these studies. 

PG99 

5 The methods used for PG99 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was TGCTGAATTCAAGGACAATTCTTCTTACAAACCT, the 3' 
oligonucleotide primer sequence was 

10 GAGTGCGGCCGCTTCGAATCACGACTTTTCTCACAAA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoE. coii BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 

15 for these studies. 

PGlOO 

The methods used for PGlOO were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
20 removed from the recombinant protein. The 5* oligonucleotide primer 

sequence was GGCAGAATTCCAGTCTTTGAGCACAATCAAAGTA, the 3' 
oligonucleotide primer sequence was 

GAGTGCGGCCGCTGATAGGCAGCTTGATGCTCTTAGC and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
25 expression plasmid pET24a using the Eco RI and Not I restriction sites and 
transformed intoK coli BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

30 PGlOl 

The methods used for PGlOl were essentially the same as for PG30 
with the following exceptions. The 5' oligonucleotide primer sequence was 
TGCTGAATTGAAAGGCAAGGGCGATCTGGTCGGG, the 3' oligonucleotide 
primer sequence was 

35 GAGTGCGGGGGCTTGTGTTGTGGAACTTGGGCGAGTA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
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expression plasniid pET24a using the Eco RI and Not I restriction sites and 
transformed intof. coii BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

5 

PG102 

The methods used for PG102 were essentially the same as for PG30 
with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
10 sequence was GGGCGAATTCGAGATGGATATTGGTGGAGACGAT, the 3' 
oligonucleotide primer sequence was 

GACTGCGGCGGGTCTCTAGAATGATTTTTTCCACGAA and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Eco RI and Not I restriction sites and 
15 transformed into co/i BL21DE3. Expression studies and immunoreactivity 
studies were carried out on whole E. coli lysates. Purification was not done 
for these studies. 

PG104 

20 The methods used for PG104 were essentially the same as for PG30 

with the following exceptions. The predicted N-terminal signal sequence was 
removed from the recombinant protein. The 5' oligonucleotide primer 
sequence was GAACGGATCGAACGTGTCTGCTCAGTCACCCCGA, the 3' 
oligonucleotide primer sequence was 

25 GAGTGCGGCCGCTTCTGAGCGATACTTTTGCACGTAT and DNA was 
amplified with the Tth XL PGR kit. The PGR product was cloned into the 
expression plasmid pET24a using the Bam HI and Not I restriction sites and 
transformed into E. coli BL21DE3. Expression studies and immunoreactivity 
studies wei^e carried out on whole E. coli lysates. Purification was not done 

30 for these studies. 

Animal antisera and human patient sera. 

Various antisera were raised for detecting the expression and 
35 refolding of the recombinant P, gingivolis proteins. A whole cell antisera was 
raised by injecting New Zealand White rabbits with 3 doses of sonicated 
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P. gingivalis (strain W50) containing approximately 2mg of protein. The first 
dose was given in Freunds complete adjuvant (FCA) and the second and third 
doses were given in Freunds incomplete adjuvant (IF A) at 3 week intervals. 
Doses (1ml) were given intramuscularly into the hind legs and rabbits bled 7 
5 days after the last dose, the blood clotted and serum removed and stored at - 
ZO^'C until required. A second rabbit antisera was produced in a similar 
manner but using a sarkosyl insoluble fraction (each dose was 0.69mg of 
protein) derived from P. gingivalis W50 according to the method of Doidg and 
Trust T. et al 1994 as the immunogen, A third rabbit antisera was produced 

10 in a similar manner to the first only the sarkosyl soluble fraction (Img of 
protein per dose) derived from P. gingivalis W50 cells according to the 
method of Doidg P. and Trust TJ. (1994 Infect Immun 62:4526-33) was used 
as the immunogen. 

A "protected rat serum" pool was also used in these studies and was 

15 obtained from rats immunised with formalin killed whole P. gingivalis cells 
in FLA (strain ATCC 33277; 2 doses of 2x10^ cells, 3 weeks apart). Rats were 
then challenged 2 weeks after their last dose with live P. gingivalis cells 
(strain 33277) given orally as previously described (Klaussen B. et al. 1991, 
Oral Microbiol Immunol 6:193-201) and the serum obtained from these rats 6 

20 weeks after the final challenge inoculation at the time of sacrifice. 

Human sera were obtained from adult patients undergoing treatment 
or assessment for periodontitis at an outpatient clinic. These patients had at 
least 6 teeth with 6mm attachment loss and had P. gingivalis present in their 
sub-gingival plaque as detected using a P. gingivalis specific DNA probe. 

25 Sera was pooled from these patients and compared to a pool of sera from 
periodontally healthy patients. 

Immunization and Murine Lesion Model Protocols 

30 The mouse abscess model was used to assess the efficacy of 

immunising mice with recombinant P. gingivalis proteins in protecting mice 
from formation of a subcutaneous abscess. This model has been used by 
others as a predictor of potential vaccines against periodontal disease (Bird 
PS, et al. 1995 J. Periodontol. 66:351-362. BALB/c mice 6-8 weeks old were 

35 immunised by subcutaneously injecting them with 0.1 ml containing either 
10 or 20|ig of recombinant P. gingivalis protein, 20|ig of E. coli lysate protein, 
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2 X 10^ formalin killed cells of P. gingivalis strain 33277 emulsified in 
incomplete Freund's adjuvant (IFA; Sigma) on day 0. At day 21 mice were 
re-injected with the same dose and then bled 1 week later and evaluated for 
antibody levels. At day 35 mice all mice were challenged with 
5 approximately 2 x 10^ cells of live P. gingivalis (ATCC 33277) by 

subcutaneous injection in the abdomen. Following challenge mice were 
monitored daily for weight loss and the size of the lesion measured for the 
next 10 days. Lesion sizes were measured by length and width and expressed 
as mm^* Groups were statistically analysed using a Kruskal-Wallis one-way 

10 ANOVA and were also individually examined using the unpaired t test or 
Mann-Whitney rank sum test using the Instat statistical package. 

Figure 1 shows the results of one experiment at day 4 after challenge 
(lesions were at maximum size at this time point). Control mice immunised 
with E, coli lysate showed large lesions while mice immunised with killed 

15 cells of R gingivalis strain 33277 were fully protected. This indicates that 
whole cells provide protection against P. gingivalis while E. coli protein 
immunised mice were not protected. Mice given the various PG recombinant 
proteins showed significant levels of protection for PG2, PG22, PG24 and 
PG29 (p<0.05 unpaired t test) while PG8A was not quite significantly 

20 different (p=0,07) compared to the£*. coli conti'ol group. 

Figure 2 shows the results of a separate experiment using 
combinations of recombinant proteins. Mice given PGl + PG2 showed a 
significant level of protection compared to control mice give E. coli lysate 
(p<0.026 unpaired t test). 

25 

Immunoscreening 

Cloned candidates were cultured in 15ml of Terrific broth, induced 
with IPTG and sampled at 4h post-induction. One ml of culture was 

30 removed, pelleted and the cells resuspended in a volume of PBS determined 
by dividing the OD Aooonm of the culture by 8. An aliquot of lysate (lOOjxl) 
was added to 100|jl1 of 2x sample reducing buffer (125mM Tris pH 6.8, 20% 
glycerol, 4% SDS, 80mM DTT, 0.03% bromophenol blue) and boiled for 
lOmin. SDS-PAGE was performed according to the method of Laemmli UK. 

35 1970 (Nature 227:680-685) using 4-20% 1.0mm Tris-Glycine gels (Novex) 

according to the manufacturers recommendations. Proteins were transferred 
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onto Hybond-C Extra nitrocellulose membranes (Amersham) by transblotting 
and the membranes were then blocked for 2h at room tempera tm'e (RT) in 5% 
skim milk in 20mM Tris, 0.5M NaCl, 0.05% Tween-20, pH 7.5 (TTBS). 
Immunoscreening was performed separately with the rabbit 
5 anti-P. gingivalis whole cell serum, the rat protective serum, a pool of human 
periodontal patients serum, and in many cases an anti-T7-Tag antibody HRP 
conjugate (Novagen). Prior to use, the rabbit, rat and human sera were 
diluted 1/5000, 1/1000 and 1/500 respectively in 5% skim milk in TTBS and 
absorbed with lOOfil (for the rabbit serum) or 250|il (for the rat and human 

10 sera) E. coli extract (20mg/ml; Promega) for 6h at RT. 

Membranes were incubated overnight at RT with the absorbed 
antisera, or for 1 hr at RT with 1/5000 diluted anti-T7-Teg conjugate. 
Following 3xlOmin washes with TTBS, HRP-conjugated anti-rabbit (Silenus), 
anti-mouse (Silenus) or anti-human (KPL) antibody, diluted 1/5000 in 5% 

15 skim milk in TTBS, was added for ih at RT. Membranes were washed as 
before, prior to addition of TMB membrane peroxidase substi'ate (KPL) for 
detection of immunoreactive proteins. Results of reactivity for the 
recombinant R gingivalis proteins is shown in Table 7. 

In addition some of the sera (pooled sera diluted 1/1000) from the 

20 mice immunised with P. gingivalis recombinant proteins (prior to challenge) 
were analysed for their reactivity against Western blots of whole native W50 
P. gingivalis proteins using similar techniques as those outlined above. PG2, 
PG8A, PG29 and PG3 all showed bands at a similar molecular weight to that 
of the recombinant PG protein in the native W50 blot. This indicates that PG 

25 proteins are expressed in the W50 strain and that the recombinant proteins 
have at least some identical immunogenicity to the native proteins. 

m-RNA analysis 

30 Hot Phenol RNA Extraction 

P. gingivalis W50 cells (150ml culture) were grown anaerobically to 
mid log phase (OD Acoo=0.18) mixed with 50% glycerol and stored at -70°C 
until RNA extraction. Cells were pelleted by centrifugation at 6000g, and 
35 resuspended in 8ml ASE (20mM NaOAc, 0.5% SDS, ImM EDTA). An equal 
volume of 20mM NaOAc(pH 4.5)-saturated phenol was added and mixed by 



BNSDOCID: <WO 9g29870A1J_> 



PCT/AU98/01023 



wo 99/29870 



PCT/AU98/01023 



shaking for 30 seconds, incubated at 65°C for 5 minutes, followed by a 
further 5 second shaking and repeated incubation. After cooling, 2ml 
chloroform was added and mixed by shaking for 5 seconds, and the mixture 
spun at lOOOOg for 10 minutes at 4°C. The top aqueous phase was transferred 
5 and re-extracted by repeating the phenol and chloroform steps. The aqueous 
phase was transferred again and lOOU RNase inhibitor (RNAsin; Promega) 
were added. RNA was precipitated with 3 volumes 100% ethanol at -2(f C 
overnight. The RNA precipitate was recovered by centrifugation at lOOOOg at 
4°C for 15 minutes, then washed with 100% ethanol, dried and resuspended 
10 in 600^61 sterile, deionised, dH^O with 1/xl of fresh RNase inhibitor. RNA was 
aliquoted and stored at -ZO^'C. The RNA concentration was determined 
spectrophotometrically. A formaldehyde RNA gel confirmed RNA integrity 
(Sambrook J. et al. 1989, Molecular Cloning. A laboratory manual. Cold 
Spring Laboratory Press, New York. 2nd Edition). 

15 

RT-PCR 

The isolated RNA was used as a template for Reverse Transcription 

20 (RT) to produce cDNA, Varying RNA concentrations were used for the RT as 
each RNA transcript was potentially present at different levels. Subsequent 
amplification of the cDNA was performed using Polymerase Chain Reaction 
(PCR). RT-PCR was performed using GeneAmp® RNA PCR Kit (Perkin 
Elmer) according to the manufacturer's protocol with the following exception 

25 to the PCR; 35 cycles were performed as follows: Melt phase 95°C for 30 
seconds. Anneal phase varied between 50-60°C for 30 seconds, Extension 
phase 72'*C for 1 minute. Amplification was performed in a PTC-100 
Programable Thermal Controller (MJ Research Inc.). As a control to 
demonstrate that the amplified product did not arise from contaminating 

30 DNA, Reverse Transcriptase (RTase) was omitted from a parallel tube. The 
PCR products were examined against DNA markers (GIBCO IkB ladder) on a 
1% agarose gel stained with ethidium bromide. 

RT-PCR results are shown in Table 6 using the oligonucleotide 
primers as used in "Cloning, expression and purification of recombinant 

35 P. gingivalis genes" section described above,except for the following changes. 
For PGl the 3' reverse primer used was 
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GCGCCTCGAGATTCATTTCCTTATAGAG, for PG4 the 5' forward primer 
was CTTCTTGTCGACTACAGCGGACATCATAAAATC and the 3' reverse 
primer was TTCCACCTCGAGTTAACGCAACTCTTCTTCGAT, for PG6 the 5' 
forward primer was TAAAGAATTCTGCCTCGAACCCATAATTGCTCCG, for 
5 PGIO the 5' forward primer was CGCGCATATGGATAAAGTGAGCTATGC 
and the 3' reverse primer was 

CGCGCTCGAGTTTGTTGATACTCAATAATTC, for PG13 the 5' forward 
primer was GCCCGGCGCCATGCGGACAAAAACTATCTTTTTTGCG and the 
3' reverse primer was 

10 GCCCGGCGCCTTAGTTGTTGAATCGAATCGCTATTTGAGC. Amplification 
of P. gingivalis transcripts is a Hkely indication that RNA for a specific 
candidate is present and that the protein is produced. However, where there 
is no amplification achieved this does not indicate that this gene is never 
transcribed and may be the result of the culture conditions or the state of the 

15 cells when haivested. 

Table 6. Expression of PG m-RNA with in vitro grown P. gingivalis W50. The 
symbols are + band visible on agarose gel, - no band present on agarose gel, 
ND not detected. 



PG # 


RNA 


Annealing 
temp. °C 


RT-PCR 


PGR (-RT) 


Approx. 
fragment 
size bp 


Expected 
fragment 
size bp 


1 


0.15 


55 


+ 




1300 


1362 


2 


1.0 


50 


+ 




3200 


3051 


3 


0.15 


60 


+ 




720 


690 


4 


2.9 


55 






N.D. 


2000 


5 


0.02 


50 


+ 




1000 


947 


6 


1.0 


55 


+ 




1000 


972 


8A 


0.15 


50 


+ 




1200 


1278 


10 


0.15 


55 


+ 




590 


585 


11 


0.10 


60 






960 


942 


12 


0.02 


60 


+ 




880 


831 


13 


1.0 


50 


+ 




2150 


2274 


14 


0.15 


60 


+ 




1050 


996 
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PG# 


RNA 


Annealing 
temp. °C 


RT-PCR 


PGR (-RT) 


Approx. 
fragment 
size bp 


Expected 
fragment 
size bp 


22 


1.0 


60 






N.D. 


228 


24 


1.0 


55 


+ 


+ 


1150 


1194 


29 


0.15 


60 


+ 




880 


885 



Table 7: Immunoblot results of proteins expressed iiiE.coli against rabbit, rat 
and human antisera. Deduced MW was calculated from amino acid sequence 
5 of the P. gingivalis proteins, some of which had their N-terminal signal 
sequences removed. Apparent MW was determined from SDS-PAGE gels. 
The N- and C-terminal tags add approximately 2.5 KDa to the deduced MW of 
the recombinant proteins. The symbols are -h positive, - negative, +/- weak 
positive, ND not done. 



Protein 


Deduced 


Apparent 




Antisera reactivity 




number 


MW (KDa) 


MW (KDa) 










T7 


i Rabbit \ Rat I 


Human 


[ PGl 


47.5 


63 


ND 






L..PG2 


112.4 


125.7 


ND 


j + 1 - I 




i PG3 


22.6 

- — - 


18.3 


ND 






| PG4 


75 


90.6 


ND 






; PG5.. 


34.9 


43.8 


ND 






PG6 


36.7 


47.1 


ND 






1 PG8 


67.5 


63.1 


ND 


1 h \ ^ [ 




JPG8A 


47.7 


90.6 


ND 






PGIO 


21.3 


25.5 


ND 


1 + 1 - i 




PGll 


36.2 


42.4 


ND 






;PG12 


30.7 


30.6 


ND 






PG13 


84.5 


101 


ND 






PG14 


36 


42.4 




j - ; + i 


+ 


PG22 


8^6 


11.1 


. ND 






PG24A 


47 _ 


63.1 


ND 






PG29 


31.1 


40.9 


ND 


\ + 1 + [ 
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Protein 
number 


Deduced 
MW (KDa) 


Apparent 
MW (KDa) 


Antisera reactivity 








T7 


Rabbit 


Rat 


Human 


PG30 


35.1 


46.9 


+ 


_ 




- 


PG31 


16.7 


- 






- 


- 


PG32 


41.2 


59.5 


•f 




4- 


- 


PG33 


39.9 


52.7 


4- 


4- 




- 


PG35 


92.6 


116.6 




_ 


_ 




PG36 


98.9 


120.2 






_ 


_ 


PG37 


18.8 


23.1 


+ 


4- 






PG38 


16.1 


22.9 


-h 


_ 


_ 


_ 


PG39 


87.9 


116.6 






_ 


_ 


PG40 


76.6 


103.1 


+ 




_ 




PG41 


48.3 


81.1 






4- 


4- 


PG42 


59.3 


73.9 










PG43 


27.1 


50.3 






_ 




PG44 


28.6 


32.3 


+ 




4- 




PG45 


84 


100.6 




_ 


_ 


_ 


PG46 


83 


97.7 






_ 


- 


PG47 


93.7 


42.5 








4- 


PG48 


45.2 


37.9 




_ 




_ 


PG49 


33.3 


64.1 


+ 




4- 


_ 


PG50 


91.9 


113.2 




+ 


_ 


- 


PG51 


19.6 


27.2 






_ 


_ 


PG52 


50.4 


64.4 




+ 


_ 




PG53 


47.4 


45.4 




- 


- 


4- 


PG54 


101.4 


46.7 


4- 


+ 






PG55 


70.4 


68.4 


+ 








PG56 


142.3 












PG57 


100 


134.5 




+ 


4- 


+ 


PG58 


63 


82.9 










PG59 


33.3 


43.6 


-f- 








PG60 


55.6 


77.8 










PG61 


81.5 


107.3 
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Protein 


Deduced 


Apparent 




Antisera reactivity 


number 


MW (KDa) 


MW (KDa) 














T7 




1 Rabbit 


Rat 


i Human 


1 PG62 


51.9 


58.4 


\ 4- 




j 


[ PG63 


29.6 


43.6 


\ + 




I 


i PG64 


[ 

18.5 




\ 26,9 


\ ± 




- 


[ PG65 


25.9 


28.8 






; - i 


1 PG66 


; 22.2 


25.1 






\ + i 


i - ; 


[ PG67 


103.7 


105 







i 


[ PGB8 


133.3 


30.7 






+ 


4; \ 


[ PGeg 


44.4 


50.8 


+ 






\ PG70 


25.9 


30.8 








1 PG71 


88.9 


105.5 


+ 






[ PG72 


- 

40.7 


49.8 






- j 


1 PG73 


40.7_ 


29 


1/:.... 




- i 


i PG74 


22.2 


32.5 


+ 






1 PG75 


40.7 


46.7 






i 


1 PG76 




48.1 


55.6 


+ 








PG77 


29.6 


36.9 










- 


PG78 


33.3 


35.4 


4- 








PG79 


33.3 


- 








_ 


PG80 


25.9 


20.5 


+ 






_ 


PG81 


23 


25,8 








- 


PG82 


44.8 


48.5 


+ 






_ 


PG84 


41.7 


52.4 


+ 






+/- 


PG85 


62.7 


72.4 


4- 








PG86 


21.7 


27.4 


4- 






±/:.. 


PG87 


83 


91.3 


4- 






+ 


PG88 


27 


40.1 


4- 








PG89 


26.2 


29.4 


4- 








PG90 


23 


28.4 


4- 








PG91 


57.2 


85.7 


4- 


4~ 


4- 


+ 


PG92 


83.6 


110,4 


+ 






+ 


PG93 


83.4 


110,4 


4- 






+ 
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X I (JlcIU 




A n x\ n w*ri n 
ivi V V ^ jv d, y 


Antisera reactivity 








"T''7 

1 / 


: ivauDll : SxSiX 


xiuuiaii 








4- 


1 + 1 + i 


+ i 




AAA 

44.4 


5 7.5 


+ 


1 - 1 + 1 


+ 1 


PG98 


33.3 


36 








PG99 


40.7 


55.6 






+ \ 


PGlOO 


29.6 


10.8 


+ 






PGlOl 


14.8 


19.7,14.1 i 








PG102 


59.3 


70.3 


+ 




+ 1 


PG104 


40.7 


57.5 









a. Positive reaction detected with the rabbit antiserum to sarkosyl insoluble 
5 R gingivalis antigen. 

b. Purified protein demonstrated weak positive reaction with the rabbit 
antiserum to whole P. gingivalis. 

It will be appreciated by persons skilled in the art that numerous 
10 variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are, therefore, to 
be considered in all respects as illustrative and not restrictive. 
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CLAIMS 

1. An isolated antigenic Porphoryxnonas gingivalis polypeptide, the 
polypeptide comprising; 

an amino acid sequence selected from the group consisting of SEQ. 
5 ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 531 and SEQ. ID. NO. 

532; or 

an amino acid sequence at least 85%, preferably at least 95%, 
identical to an amino acid sequence selected from the group 
consisting of SEQ. ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 
10 531 and SEQ. ID. NO. 532; or 

at least 40 amino acids having a contiguous sequence of at least 40 
amino acids identical to a contiguous amino acid sequence selected 
from the group consisting of SEQ. ID. NO. 265 to SEQ. ID. NO. 528, 
SEQ. ID. NO. 531 and SEQ. ID. NO. 532. 
15 2. A polypeptide as claimed in claim 1 in which the polypeptide 

comprises an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 531 and SEQ. ID. NO, 
532. 

3. A polypeptide as claimed in claim 1 in which the polypeptide 
20 comprises an amino acid sequence at least 85%, preferably at least 95%, 

identical to an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 531 and SEQ. ID. NO. 
532. 

4. A polypeptide as claimed in claim 1 in which the polypeptide 

25 comprises at least 40 amino acids having a contiguous sequence of at least 40 
amino acids identical to a contiguous amino acid sequence selected from the 
group consisting of SEQ. ID. NO. 265 to SEQ. ID. NO. 528, SEQ. ID. NO. 531 
and SEQ. ID. NO. 532. 

5. A polypeptide as claimed in claim 1 in which the polypeptide 
30 comprises; 

an amino acid sequence selected from the group consisting of SEQ. 
ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 532; or 
an amino acid sequence at least 85%, preferably at least 95%, 
identical to an amino acid sequence selected from the group 
35 consisting of SEQ. ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 

532; or 
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at least 40 amino acids having a contiguous sequence of at least 40 
amino acids identical to a contiguous amino acid sequence selected 
from the group consisting of SEQ. ID. NO. 386 to SEQ. ID. NO. 528 
and SEQ. ID. NO. 532. 
5 6. A polypeptide as claimed in claim 1 in which the polypeptide 
comprises an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 532. 

7. A polypeptide as claimed in claim 1 in which the polypeptide 
comprises an amino acid sequence at least 85%, preferably at least 95%, 

10 identical to an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 532. 

8. A polypeptide as claimed in claim 1 in which the polypeptide 
comprises at least 40 amino acids having a contiguous sequence of at least 40 
amino acids identical to a contiguous amino acid sequence selected from the 

15 group consisting of SEQ. ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 
532. 

9. A polypeptide as claimed in claim 6 in which the polypeptide 
comprises an amino acid sequence selected from the group consisting of 
SEQ. ID. NO. 386, SEQ. ID. NO. 424, SEQ. ID. NO. 425, SEQ. ID. NO. 434, 

20 SEQ. ID. NO. 447, SEQ. ID. NO. 458, SEQ. ID. NO. 475, SEQ. ID. NO. 498, 
SEQ. ID. NO. 499, SEQ. ID. NO. 500, SEQ. ID. NO. 501, SEQ. ID. NO. 387, 
SEQ. ID. NO. 400, SEQ. ID. NO. 411, SEQ. ID. NO. 419, SEQ. ID. NO. 420, 
SEQ. ID. NO. 427, SEQ. ID. NO. 429, SEQ. ID. NO. 433, SEQ. ID. NO. 437, 
SEQ. ID. NO. 438, SEQ. ID. NO. 443, SEQ. ID. NO. 444, SEQ. ID. NO. 448, 

25 SEQ. ID. NO. 449, SEQ. ID. NO. 452, SEQ. ID. NO. 455, SEQ. ID. NO. 457, 
SEQ. ID. NO. 459, SEQ. ID. NO. 461, SEQ. ID. NO. 462, SEQ. ID. NO. 463, 
SEQ. ID. NO. 467, SEQ. ID. NO. 468, SEQ. ID. NO. 469, SEQ. ID. NO. 482, 
SEQ. ID. NO. 484, SEQ. ID. NO. 485, SEQ. ID. NO. 494, SEQ. ID. NO. 508, 
SEQ. ID. NO. 509, SEQ. ID. NO. 510, SEQ. ID. NO. 520, SEQ. ID. NO. 521, 

30 SEQ. ID. NO. 522, SEQ. ID. NO. 525, SEQ. ID. NO. 526, SEQ. ID. NO. 528, 
SEQ. ID. NO. 389, SEQ. ID. NO. 390 and SEQ. ID. NO. 391. 

10. An isolated antigenic Porphor}monas gingivcdis polypeptide, the 
polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ. ID. NO. 386 to SEQ. ID. NO. 528 and SEQ. ID. NO. 532 

35 less the leader sequence set out in Table 3. 
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11. An isolated DNA molecule, the DNA molecule comprising a 
nucleotide sequence which encodes the polypeptide as claimed in any one of 
claims 1 to 10 or a sequence which hybridises thereto under conditions of 
high stringency. 

5 12. An isolated DNA molecule as claimed in claim 11 in which the DNA 
molecule comprises a nucleotide sequence selected from the group 
consisting of SEQ. ID. NO. 1 to SEQ. ID, NO. 264, SEQ. ID. NO. 529 and SEQ. 
ID. NO. 530. 

13. A recombinant expression vector comprising the DNA molecule as 
10 claimed in claim 11 or claim 12 operably linked to a transcription regulatory 

element. 

14. A cell comprising the recombinant expression vector as claimed in 
claim 13. 

15. A method for producing a P. gingivalis polypeptide comprising 
15 culturing the cell as claimed in claim 14 under conditions that permit 

expression of the polypeptide. 

16. A composition for use in raising an immune response directed 
against P. gingivalis in a subject, the composition comprising an effective 
amount of at least one polypeptide as claimed in any one of claims 1 to 10 

20 and a pharmaceutically acceptable carrier. 

17. A composition as claimed in claim 16 in which the composition 
further comprises at least one DNA molecule as claimed in claim 11 or claim 
12. 

18. A composition as claimed in claim 16 or claim 17 in which the 
25 pharmaceutically acceptable carrier is an adjuvant. 

19. A method of treating a subject for P. gingivalis infection comprising 
administering to the subject a composition as claimed in any one of claims 
16 or claim 18 such that treatment of P. gingivalis infection occurs. 

20. A method as claimed in claim 19, wherein the treatment is a 
30 prophylactic treatment. 

21. A method as claimed in claim 19, wherein the treatment is a 
therapeutic treatment. 

22. A composition for use in raising an immune response directed 
against P. gingivalis in a subject, the composition comprising an effective 

35 amount of at least one DNA molecule as claimed in claim 11 or claim 12 and 
a pharmaceutically acceptable carrier, 
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23. A composition as claimed in claim 22 in which the pharmaceutically 
acceptable carrier is an adjuvant. 

24. A method of treating a subject for P. gingivalis infection comprising 
administering to the subject a composition as claimed in claim 22 or claim 23 

5 such that treatment of P. gingivalis infection occurs. 

25. A method as claimed in claim 24, wherein the treatment is a 
prophylactic treatment. 

26. A method as claimed in claim 24, wherein the ti^eatment is a 
therapeutic treatment. 

10 27. An antibody raised against a polypeptide as claimed in any one of 
claims 1 to 10. 

28. An antibody as claimed in claim 27 in which the antibody is 
polyclonal. 

29. An antibody as claimed in claim 27 in which the antibody is 
15 monoclonal. 

30. A composition comprising at least one antibody as claimed in any 
one of claims 27 to 29. 

31. A composition as claimed in claim 30 in which the composition 
adapted for oral use. 

20 32. A nucleotide probe comprising at least 18 nucleotides and having a 
contiguous sequence of at least 18 nucleotides identical to a contiguous 
nvicleotide sequence selected from the group consisting of SEQ. ID. NO. 1 to 
SEQ. ID. NO. 121, SEQ. ID. NO. 529 and sequences complementary thereto. 

33. A nucleotide probe as claimed in claim 32 in which the probe further 
25 comprises a detectable label. 

34. A method for detecting the presence of P. gingivalis nucleic acid in a 
sample comprising: 

(a) contacting a sample with the nucleotide probe as claimed in claim 
32 or claim 33 under conditions in which a hybrid can form between 

30 the probe and a P. gingivalis nucleic acid in the sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a 
hybrid indicates the presence of a P. gingivalis nucleic acid in the 
sample. 
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PG2 PG8A PG22 PG24 PG29 FK E. coli 

33277 



Group (n=10) 



Figure 1 
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Figure 2 
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{ 2) IHFORMATIOIJ FOR SEQ ID NO:l 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1362 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS : double 
(D} TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SEMSE: NO 



15 (vi) ORIGINAL SOURCE: 

{A} ORGANISM: PORY PHYROHOHAS GIWGIVALIS 

(ix) FEATURE: 

(A) N.AIIE/KEY: misc_feature 
20 (B) LOCATION 1...1362 

(xi) SEQUENCE DESCRIPTION: SEQ ID H0:1 

TTCTGTGTCA TGGCAAAAGT TATAAAAACA AAAAAAGGCC TTGCACTTAA TCTGAAAGGA 60 

25 AAACCGCTGC CCGAGATGCT GGCCGAACCG GCCCAAAGTC CTACTTACGC GGTCGTGCCC 120 

GACGATTTTG AAGGTGTTAT CCCCAAGGTG ACGGCTCGTC CGGGGGATAA GGTGCGTGCC 180 

GGCTCAGCAC TGATGCACCA CAAGGCATAT CCGGAGATGA AGTTTACAAG TCCGGTTAGC 24 0 

GGCGAAGTGA TCGCGGTGAA TCGCGGTGCC AAGCGCAAGG TGTTGAGCAT CGAGGTGAAA 300 

CCGGACGGAC TGAACGAATA CGAGTCATTC CCTGTCGGGG ATCCGTCTGC CCTCTCTGCC 360 

30 GAACAGATCA AGGAGCTTTT ACTGTCGAGC GGTATGTGGG GTTTTATTAA GCAACGTCCT 4 20 

•I'ACGACATAG TGGCTACACC GGATATAGCT CCACGCGACA TTTATATTAC TGCCAACTTT 4 80 

ACTGCACCAT TGGCTCCGGA CTTCGATTTC ATCGTTCGAG GAGAAGAACG CGCCCTGCAG 54 0 

ACTGCCATCG ATGCCTTGGC CAAACTCACG ACAGGAAAGG TGTATGTGGG CCTGAAGCCG 600 

GGTTCATCTC TGGGCTTGCA CAATGCAGAA ATCGTAGAAG TACACGGACC TCATCCGGCA 660 

35 GGTAACGTGG GCGTGCTGAT CAATCATACG AAGCCAATCA ATCGGGGCGA AACGGTGTGG 720 

ACGCTCAAGG CTACCGACCT GATCGTGATC GGACGTTTCC TGCTTACGGG CAAAGCCGAT 7 80 

TTTACCAGAA TGATTGCCAT GACCGGCTCA GACGCTGCAG CTCACGGATA CGTCCGTATT 84 0 

ATGCCGGGTT GCAATGTCTT TGCTTCCTTC CCCGGCCGAC TGACAATAAA GGAATCTCAC 900 

GAGCGTGTGA TCGATGGCAA TGTGCTGACC GGTAAGAAGC TCTGCGAGAA GGAGCCTTTC 960 

40 CTGTCAGCCC GGTGTGACCA GATCACGGTG ATCCCCGAAG GCGACGATGT GGACGAACTC 1020 

TTCGGGTGGG CTGCACCCCG TCTCGATCAG TACAGCATGA GCAGAGCTTA TTTCTCTTGG 108 0 

TTGCAGGGGA AAAACAAAGA GTACGTACTC GATGCCCGGA TCAAGGGTGG CGAACGTGCT 114 0 

ATGATCATGA GCAACGAGTA TGACCGCGTT TTCCCGATGG ACATCTATCC GGAGTATTTG 12 00 

CTCAAGGCTA TTATAGCATT CGACATCGAC AAGATGGAGG ACTTAGGCAT ATATGAAGTG 12 60 

45 GCTCCGGAGG ACTTTGCCAC TTGCGAATTT GTGGATACAT CCAAGATCGA GCTGCAGCGT 132 0 

ATCGTTCGCG AGGGCTTGGA TATGCTCTAT AAGGAAATGA AT 13 62 



(2) INFORMATION FOR SEQ ID NO: 2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 603 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 
55 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 



. (ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1 . . . 603 

70 (xi) SEQUENCE DESCRI PTIOti : SEQ ID NO: 2 

GAACTAAGCA AGTGTTACAT GGATAAAGTG AGCTATGCTC TGGGATTGAG CATCGGTAAT 60 

AATTTCAAGT CTTCGGGCAT CGACAGCGTC GTTATGGATG ATTTCATGCA AGGTCTGTCT 120 

GATGTACTGG AAGAAAAAGC CCCTCAGCTC TCGTATGACG AGGCCAAGCG CGAAATAGAG 180 

75 GCGTATTTCA TGGATTTGCA GCAGAAGGCT GTCAAACTGA ACAAAGAGGC CGGAGAAGAA 24 0 
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TTCCTCAAGA TAAATGCACA CAAGGAAGGT GTGACGACCT TACCGAGCGG CTTGCAATAC 300 

GAAGTCATTA AGATGGGAGA GGGCCCGAAA CCCACCCTTT CGGACACGGT AACCTGTCAT 360 

TATCACGGTA CGCTCATCAA CGGTATCGTT TTCGATAGCT CTATGGACAG GGGAGAACCG 420 

GCCAGTTTCC CTCTAAGAGG AGTTATAGCC GGCTGGACGG AGATTCTTCA ATTAATGCCT 4 80 

5 GTAGGATCCA AGTGGAAAGT AACTATACCG AGCGATCTGG CGTATGGAGA TCGTGGTGCC 5^0 

GGCGAACATA TCAAACCGGG TAGTACGCTC ATTTTTATAA TCGAATTATT GAGTATCAAC 600 

AAA 603 

10 {2) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) UOLECULE TYPE: DNA (genomic) 

20 (iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI-IE/KEY: misc_feature 

(B) LOCATION 1 . . .8 37 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 3 

CAAAAAAACA AACGAAAGAT GAAA7W\GCA TTACTTATTG GTGCTGCTCT TTTGGGAGCA 60 

GTCAGTTTTG CAAGTGCTCA GTCTTTGAGC ACAATCAAAG TACAGAACAA TTCAGTACAG 120 

35 CAACCTCGTG AGGAAGCCAC TATTCAGGTT TGTGGAGAAT TGGCAGAGCA AGTTGACTGC 180 

ATTGGGACAG GTAATTCTGC AATCATAGCC GCTGCAGCGA AATTTGAAAG CGATGATCTC 24 0 

GAAAGCTATG TTGGCTGGGA GATCATGAGT GTTGATTTCT TCCCTGGATA TAAAGCGTGC 300 

AAGTACACAT CTGCAGTCTG GGCTGATGAT ATGACCATTT TGGGCCAATC AGAAGATAGT 360 

GATCCCGAAA TGCAGACTAT CAACAATCTT GCTCTCAAGA CTAGTGTCAA GATTGAAGCC 4 20 

40 GGC/^AGAATT ACATAGTTGG TTATATTGCT AATACCGCAG GTGGACATCC TATCGGATGT 4 80 

GATCAGGGCC CTGCCGTTGA TGGTTATGGA GATTTGGTTT CTATATCAGA AGATGGTGGT 54 0 

GCTACTTTCC CTCCGTTCGA ATCTCTTCAT CAAGCAGTTC CTACCTTAAA TTACAACATC 600 

TATGTCGTTG TTCATTTGAA GAAGGGTGAA GGTGTTGAGG CTGTTCTTAC CAACGACAAG 660 

GCTAATGCTT ATGTTCAGAA TGGCGTTATC TATGTAGCCG GAGCTAATGG TCGTCAGGTA 7 20 

45 TCTCTGTTCG ACATGAACGG TAAGGTTGTT TATACCGGCG TTAGCGAAAC GATTGCAGCT 7 80 

CCTCAGAAGG GCATGTATAT CCTCCGTGTA GGTGCTAAGA GCATCAAGCT GGCTATC 837 



50 



60 



65 



(2) INFORMATION FOR SEQ ID NO: 4 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
55 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AIITI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



(ix) FEATURE: 

(A) NA14E/KEY: misc_feature 

(B) LOCATION 1 ... 471 

70 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

CGTGnATTCC TGCCGGAGAA AGCTCTCTAT ATCGGCTGCC GCGTGGAGAC GCAAGAGGGG 60 

CATGCCGTAG GTTTCGGACT GGATGACGGC CCTGCGATGA AAGGCAAGGG CGATCTGGTC 120 

GGGAGCTATC TTCCCGGTGC TGCTCCGATG CCTTTTGTCC CGCTTTCTGA TATTCCGGCT 180 

75 CGTTCGATGG ATGCCAATTT TTATATCTAT TCTCGTATTT CACTGGGTTC GGGCACGCAG 2 40 
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GATGTTCTCC AGCATAGGAT GAAAGTGTAT CCCAACCCTG CCACGACGGA GCTGCATGTG 300 

GAAGCCCTGT CCGCATGGGT GGGCGAGCAG GCTGCGGTAT ATGATATGCG TGGTCGTCGG 360 

GTATCGGCTC GGACGGTGGA TAGCGAGAAG CTGTGCATCG ACATTGCCTC ACTGCCCGTG 4 20 

GGCGTCTATA TGCTGCGCAT CGGCAGCTAC TCGGCCAAGT TCGAGAAGAG A 471 

5 

(2) INFORllATIOri FOR SEQ ID WO : 5 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



20 



60 



70 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHOMAS GINGIVALIS 



(ix) FEATURE: 
25 (A) NAI-IE/KEY: misc_feature 

{B} LOCATION 1 . . , 1686 

(xi) SEQUEIJCE DESCRIPTION: SEQ ID MO: 5 

30 ACAAGAAATG TTTCACTCAT AAAAATGCCA AGAATTATGA AATTAAAAAT TGCACTCAGA 60 

CTGCTGCTGG CGACTTTTGC CATAGTTTTA TTTAGCCCTC TGGCCAAGGC CCAGATGGAT 120 

ATTGGTGGAG ACGATGTATT GATCGAGACG ATGTCCACCC TATCAGGATA TTCAGAGGAT 180 

TTTTATTACA AGATGGCTGT GGCAGACAAT GGATGGATCT ATGTGATGTT GGATTTCTCT 24 0 

CGTATTTATT TTGATGATGT CAGGCTGTAT CGTTCCAAAG ACGGTGGTGC TACTTACCAA 300 

35 AAGTTAGGGT CTTTGGGGTC TTTGGTGCCT TATGACTTCG ATGTCTCGCA TTGCGATTTT 360 

ATTGTAACGG GAAAGGATGA AGATGATATC AATGTTTGGA CAGTCATGAC AGCATTCGAA 4 20 

TATGTAGGTG GTACTATTGG CAATGGCGTT TTGCTGATGC ATCGCCATGA TGCAGATATC 4 80 

AATAATACAG AGTGTGTGTA CAAGAAGGAT TTCCCTAATA ATAGACTGAT GGGTGTAGCC 54 0 

ATCGCCTCCA ACTACCGTGC GCCCTCTCCT TACGGTTTGG GGGGCGATCC TTTTGCTCTC 600 

40 GCTGTCGCCG TTAGTGGCTC CGGAAGCGAT CACAGCTTCT TGGACTATAT TTTTTCGTTA 660 

GATGGTGGAG TACACTTTGA GCAAAAGCGT ATTTACACAA GACCCCAAAA ACTGACTATC 7 20 

AATAGAGTAG ACCTTTCATT AGGCAGTACA TCTCCTTCTC TTGGATTTAA TACTTGGCCA 7 80 

CTAATGGGAG TCGTATTCGA AATGAATAAG AACCTTGATG GCTTCGACAT TGGTTTCATT 84 0 

TCCAACTTTG TGGACTATGA TCCCCGCTAT GCGTGGTCTG AACCGATAAT AATAGAAGAA 900 

45 GACTGTGGAT GGACTGATTT TAATCCTTTG GGAGCACTAA GTATAGAGAT CCAAATGATG 960 

TTGGATGACA ATTCGGATAA TACCGTGGGT GGAGAACGCT CCCATAACTT CCTGATCACT 102 0 

TACCCGGGCC ATTACGTATA TCCGAAGCAA TCTTTCAATT ATTCTCCCGG ACATACACCG 1080 

ACAAAGAAAG ATCTGGTCTT TAAACACTGT ATAGGTATTC CGGCTTTGGC ATACGATAAG 114 0 

GAAGGCGATC GTTATCTGAC TACTTTTCAA GATCACAATC TAATGAGATA CAGATGGATC 12 00 

50 AAATACGATG ACATTAACTC TTTTTATGGT TGGAGTTGGC CATATGTATA TGCAAAAGAA 12 60 

GCTAAAGATA AAAAGAGGCG CCGTCCGCAA GTAGCACTCA ATCCTACCAA TGGAAAGGCT 1320 

TGTTGGGTAT GGCATACTCG CAAGAGCCCA TATGATGAAA CCAAACCACA TCCTACTCCT 1380 

GTAATTATTA AACATTTCCT ATGGTCCGAT ACGGAGTGGG TACATGCTCT GGACGTGGGG 14 4 0 

GACGTATTGC AGAAGGAGGG TAGCATGAAG CTCTACCCCA ATCCTGCCAA AG7VATATGTT 1500 

55 CTGATCAACC TACCCAAAGA AGGGGGGCAC GAGGCAGTCG TATACGACAT GCAGGGCCGA 1560 

ATCGTGGAGA AAGTTTCATT TTCAGGGAAA GAATATAAGC TGAATGTGCA GTATCTGTCC 1620 

AAAGGTACGT ACATGCTGAA AGTTGTAGCG GATACGGAGT ATTTCGTGGA AAAAATCATT 1680 

GTAGAG 168 6 



(2) INFORI1ATION FOR SEQ ID NO: 6 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 3 base pairs 
65 (B) TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: UNKNOWN 

75 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: POR V PHYROMONAS GIUGIVALIS 

(ix) FEATURE: 

(A) HAilE/KEY: misc_feature 
5 IB) LOCATIOll l...ai7 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

CAAATGAAAC GATTACTCCC CTTTCTCCTT TTAGCAGGAC TCGTAGCCGT AGGAAACGTG 60 

10 TCTGCTCAGT CACCCCGAAT CCCTCAAGTG GATGTACACA CTCGCATCGC AAGAAATGCC 120 

CGTTATCGAC TGGACAAGAT CAGTGTCCCG GATTCTCGTC AGATATTCGA TTACTTCTAT 180 

AAAGAAGAAA CGATACCCAC TAAAATACAA ACGACCACAG GAGGTGCAAT TACAAGCATC 2-^0 

GATTCGCTTT TCTATGAAGA CGACAGGTTG GTTCAGGTGC GCTATTTTGA CAATAACCTT 300 

GAATTAAAAC AAGCGGAGAA GTATGTATAC GACGGTTCTA AGCTGGTCCT TCGAGAAATT 360 

15 CGCAAGTCGC CGACAGACGA AACGCCAATA AAGAAAGTTA GCTATCACTA TCTCTGTGGC ^20 

AGCGATATGC CTTTTGAGAT TACGACAGAG ATGAGCGATG GCTATTTTGA AAGCCATACG 4 80 

CTTAACTATC TGAATGGAAA GATTGCCCGA ATAGATATCA TGACTCAACA GAACCCATCG 5^0 

GCCGAATTGA TCGAAACGGG TAGAATGGTA TATGAGTTTG ATGCCAATAA TGATGCTGTA 600 

CTGCTTCGTG ACAGTGTATT TCTTCCTCTT CAAAACAAGT GGGTAGAAAT GTTTACTCAC 660 

20 CGTTATACAT ACGACAATAA GCATAATTGT ATTCGTTGGG AACAAGACGA ATTCGGCACC 72 0 

CTCACCCTTG CCAACAACTT CGAATACGAC ACCACTATCC CTCTGTCGTC TGTATTGTTC 7 80 

CCCACGCATG AGGAGTTCTT CCGTCCTCTT CTTCCCAATT TTATGAAGCA TATGCGTACG Q4 0 

AAGCAAACGT ATTTCAATAA CTCCGGAGAA GGCTTGTCAG AGGTATGCGA TTACAACTAC 900 

TTCTATACCG ATATGCAGGG TAATGCACTG ACCGATGTTG CCGTGAACGA ATCGATCAAG 960 

25 ATTTATCCTC GTCCTGCCAC GGATTTTCTG CGTATAGAAG GTTCGCAACT GCTTCGCCTT 102 0 

TCGCTATTCG ACATGAACGG GAAGCTCATC AGAGCTACCG AATTGACAGG CGATTTGGCC 1080 

ATTATCGGAG TTGCATCTCT TCCGAGAGGC ACTTACATCG CAG7WVTAAC TGCTGCAAAC ll^lO 

AGCAAAACCA TACGTGCAAA AGTATCGCTC AGA 1173 



30 



40 



(2) It I FORMATION POR SEQ ID NO : 7 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 
35 {B} TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: HO 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISIl: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
50 (B) LOCATION 1...1284 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

AAGAGGAATC CCTTACCTTT GACGGCATCA AATCGAAAGA TTTTTATAAA AATGAGACAG 60 

55 CATTTATCTC TATTTCCTTT TATCTTGTTT CTGCTTCTTG CCTTCTCTTA TGTCGGTTGC 120 

AG/U\CAGTCC GACAAACACC TAAGCAGTCG GAACGGTACG TCGTAGTCCT GTCTTTGGAC 180 

GGCTTCCGAC CGGACTATAC CGATCGGGCA CGTACACCGG CGTTGGATCG GATGGCACAG 2 40 

GAGGGATTGA GCGGGTCGCT CCAACCATGC TTCCCCTCGC TTACATTTCC CAATCATTAC 300 

AGCATGGCTA CGGGGCTTTA CCCCGATCAT CACGGTATCG TAGCCAATGA GTTTGTGGAT 3 60 

60 TCGCTACTGG GCATCTTTCG TATATCCGAC CGAAAAGCCG TGGAGACCCC CGGATTTTGG 4 20 

GGCGGCGAGC CGGTTTGGAA TACGGCCGCA CGCCAAGGCA TCCGTACCGG TGTCTACTTT ^80 

TGGGTAGGAT CCGAAACGGC TGTGAACGGA AATCGGCCGT GGCGGTGGAA AAAATTCTCC 54 0 

TCCACCGTTC CGTTTCGTGA CCGTGCCGAC TCCGTCATCG CGTGGCTCGG ACTGCCCGAA 600 

AAGGAGCGAC CGCGCTTGCT CATGTGGTAC ATCGAGGAGC CGGATATGAT CGGACACAGC 660 

65 CAAACGCCCG AAAGCCCGCT GACACTGGCA ATGGTAGAGC GGTTGGACAG TGTGGTCGGC 7 20 

TATTTCCGCA AGCGGTTGGA CTCTCTGCCC ATAGCCGCAC AGACCGACTT CATCATAGTA 7 80 

TCCGATCACG GTATGGCCAC GTACGAAAAT GAGA/itATGTG TC7U\TCTGTC GCATTATCTG 84 0 

CCTGCGGACA GTTTCCTCTA CATGGCCACC GGGGCCTTCA CCCACTTGTA CCCGAAGCCC 900 

TCCTATACCG AGCGAGCCTA TGAGATCCTG CGGGCCATTC CACATATATC GGTTTACCGC 960 

70 AAGGGGGAGG TGCCCAAGCG TTTGCGCTGT GGCACCAATC CTCGTTTGGG CGAACTGGTC 1020 

GTGATTCCGG ACATAGGCTC CACCGTCTTT TTCGCAATAA ATGAAGACGT TCGTCCGGGA 108 0 

GCGGCACATG GCTATGACAA CCAAGCACCG GAAATGCGGG CTTTACTCCG GGCTGTCGGA 114 0 

CCCGATTTCC GTCCGGGCAG TAGGGTGGAA AACCTGCCGA ATATCACCAT CTATCCGCTC 1200 

ATATGCAGGC TGTTGGGTAT AGAGCCTGCA CCCAACGATG CGGACGAAAC GTTGCTGAAC 12 60 

75 GGCCTGATCC GAGACAAACG ACCA 128 -J 
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(2) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) tlOLECULE TYPE: DJJA (genomic) 

(iii) HYPOTHETICAL: NO 

15 (iv) AMTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

20 FEATURE: 

(A) NAJ'IE/KEY: misc^feature 

(B) LOCATION 1 ... 846 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 



CTTTCCCGTG GAGPTTTTCC CCTGATGTAT GGCCGAAGAG GAAGCATTCG TGCCTCTTCA 60 

GGGCATAGGG ACAAAATTTT TAAGAATACA ATTATCAGAT TTATCACAAT GAAAGTAGGT 120 

TTGTTCATCC CCTGTTATGT CAATGCAGTG TATCCGGAAG TGGGTATCGC CACGTACAAA ISn 

CTGCTGAAGA GTTTGGACAT AGATGTCGAC TACCCGATGG ATCAGACATG TTGCGGCCAG 2 40 

dU CCTATGGCCA ATGCCGGATT CGAACAGAAA GCTCAAAAGC TGGCTTTGCG ATTCGAAGAG 300 

CTGTTCGAGT CGTATGATGT AGTCGTAGGG CCATCGGCCA GTTGCGTTGC TTTCGTGAAA 36n 

GAAAACTATG ATCATATCCT CAGACCGACA GGACATGTCT GCAAGTCGGC AGCCAAGGTT 4'^r\ 

CGGGATATAT GCGAGTTCTT GCACGATGAC CTGAAGATCA CCAGCCTCCC CTCCCGATTC 4 80 

GCCCATAAGG TGAGCCTGCA CAACAGTTGC CACGGTGTGC GCGAACTGCA TCTGTCCACC 54 0 

d5 CCCAGTGAAG TGCACCGACC GTACCACAAC AAGGTGCGCC GGCTATTGGA GATGGTGCAG 600 

GGCATAGAGG TATTCGAGCC GAAGCGAATA GACGAATGCT GCGGTTTCGG CGGTATGTAC 660 

TCGGTGGAGG AGCCGGAGGT ATCCACCTGT ATGGGGCATG ACAAGGTGCT GGATCACATA 7 20 

TCCACAGGTG CGGAGTACAT CACAGGGCCG GACAGCTCGT GCCTCATGCA TATGCAGGGA 7 80 

GTGATAGACA GAGAGAAATT GCCGATCAAG ACAATTCATG CAGTAGAAAT TTTAGCAGCA 84 0 

40 AACTTA 



(2) IlfFORMATION FOR SEQ ID NO: 9 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS: double 

„^ (D) TOPOLOGY: circular 

50 

(ii) MOLECULE TYPE: DNA (genomic') 

(iii) HYPOTHETICAL: NO 

55 ( iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIMGIVALIS 

60 (i>:) FEATURE: 

(A) NAI4E/KEY: misc_feature 

(B) LOCATION 1. . .753 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: P 



846 



CCTCTGAAAA AACGAATGGA TATTGTAAGT ATGGCCGATA AAGCTCTTGT AGTGGAGATG 60 

AGAGATGTGA CGCTCTGTCA GGAGGAAAAC GTCATTTTTC AAAATTTGAA TCTGACCCTT 12 0 

TCCGCCGGAG ACTTCGTCTA TCTGATAGGC TCAGTGGGAT CGGGGAAGAG CACTTTGCTG 180 

AAGGCTTTGT ATGCTGAGGT GCCTATCTCT GCCGGTTATG CCCGCGTGAT AGATTATGAT 24 0 

/U CTGGCAAAGT TGAAACGGAA GCAGTTGCCC TATCTGCGCA GGAATTTGGG CATTGTGTTT 300 

CAGGATTTCC AGTTGCTGAA CGGACGTACT GTTGCGGAGA ATTTGGATTT CGTTTTGCGA 360 

GCTACGGACT GGAAAAACCG AGCCGATCGC GAGCAGCGTA TCGAGGAGGT TTTGACCCGT 4^0 

GTGGGAATGT CTCGGAAGGC TTATAAGAGA CCGCACGAAC TGTCCGGAGG GGAGCAACAA 4 8*^ 

CGTGTGGGTA TAGCCAGAGC TTTGCTGGCG AAGCCTGCGT TGATCCTGGC CGACGAACCC 54 0 

/O ACAGGCAACC TCGATTCGGT GACCGGATTG CAGATCGCTT CTCTGCTCTA CGAAATCAGT (=^00 
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AAGCAGGGCA CTGCAGTACT TATGAGCACG CACAACAGCA GCCTGCTGTC GCATCTGCCG 660 

GCACGGACAT TGGCCGTTCG TAAGAATGGC GATGCCTCCT CTTTGGTCGA GCTGAGTGCA 72 0 

GATGCTGTTT CAAGAAAAAA TACGGAAATA GAT 7 53 

5 

(2) INFORMATION FOR SEQ ID HO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRAUDEDHESS: double 

(D) TOPOLOGY: circular 



15 



(ii) MOLECULE TYPE: DKA (genomic) 

( i i i ) HY POTHETICAL : NO 

(iv) ANT I -SENSE: HO 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
25 (B) LOCATION 1...714 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 10 

ACCAGGCATT GTCCGGCTTG TCGCTCTTCC TTTCACCTCA TAAAAACAAG TAAAACAATG 60 

30 ATTGAAATCA GCAACCTCAC CAAGGTTTTC AGAACAGAAG AAATAGAGAC GGTAGCCCTC 120 

GATGGCGTAT CGCTCAAAGT GGACAAAGGC GAATTTATCG CCATAATGGG GCCTTCGGGA 180 

TGCGGTTVAGT CCACTCTGCT CAATATCCTC GGCCTTCTCG ACAATCCCAC TTCCGGTATC 24 0 

TACAAGCTCG ATGGGGCAGA AGTGGGCAAC CTCCGGGAAA AAGACAGGAC TGCCGTCCGT 300 

AAGGGCAATA TCGGCTTCGT ATTCCAGAGC TTCAACCTCA TCGAAGAGAT GACGGTAAGC 360 

35 GAGAACGTGG AGTTGCCGCT CGTCTATCTG GGTGTGAAGG CCTCCGAGCG GAAAGAGCGA 4 20 

GTGGAGGAGG CACTGCGCAA GATGAGCATC AGCCACCGGG CCGGCCACTT CCCCAATCAG 4 80 

CTCTCCGGAG GACAACAGCA GCGCGTGGCT ATCGCCCGTG CCGTGGTGGC CAATCCGAAG 54 0 

CTCATCCTCG CCGATGAACC CACGGGTAAC CTCGACTCCA AAAACGGAGC CGATGTCATG 600 

GAACTGCTCA GAGGTCTCAA TCGCGAAGGT GCAACCAT CG TCATGGTGAC GCACTCCGAG 660 

40 CACGATGCAC GTAGTGCCGG CCGCATCATC AATCTGTTCG ACGGTAAGAT TCGC 714 

(2) INFORMATION FOR SEQ ID NO: 11 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1812 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIIDEDHESS: double 

(D) TOPOLOGY: circular 

50 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

55 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAI^IISM: PORYPHYROMONAS GINGIVALIS 

60 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 ... 1812 



65 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



AGCACAGAAA CAAATAGCAA ATCTGAAATG AAAGAATTTT TCAAAATGTT TTTCGCCTCG 60 

ATCCTCGGGG TTATAACGGC AGGAATCATC TTGTTCTGTA TCTTTCTATT TATCTTTTTC 120 

GGCATCGTAG CCGGTATTGC CTCCAAGGCA ACGGGAGGAA CCATTCCGAA GATCGAAGCA 180 

AACTCCATCC TACATATANA CA.IVTTCTTCT TTCCCTGAGA TCGTATCGGC CAATCCCTGG 24 0 

70 AGCATGCTCA CAGGCAAAGA CGAGTCCGTA TCGCTCTCAC AGGCAGTCGA AGCCATCGGC 300 

CAAGCCAAAA ATAATCCCAA CATAACCGGT ATCTTCCTCG ATCTGGACAA CCTTTCCGTC 360 

GGTATGGCAT CGGCAGAGGA ATTGCGTCGC GCGTTGCAGG ATTTCAAGAT GTCGGGCAAG 4 20 

TTCGTCGTAT CCTATGCCGA CAGATACACC CAAAAGGGTT ACTACCTCTC CAGTATTGCA 4 80 

GACAAACTCT ACCTCAATCC GAAAGGAATG TTGGGGCTTA TCGGGATTGC GACCCAAACA 54 0 

75 ATGTTCTACA AAGATGCCCT CGACAAATTC GGCGTGAAGA TGGAGATCTT CAAGGTAGGC 600 
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ACCTACAAGG CAGCCGTAGA GCCATTCATG CTCAACAGGA TGAGCGATGC CAATCGCGAA 660 

CAAATCACCA CATACATAAA CGGGCTTTGG GACAAGATCA CATCCGATAT TGCAGAGTCG 7 20 

CGCAAGACGG CAATGGATTC CGTGAAAATG TTTGCCGACA AAGGCGAAAT GTTCGGTCTT 7 80 

GCCGAGAAAG CGGTGGAGAT GAAGCTCGTG GATGAGCTGG CTTACCGTAC CGATGTGGAG 8^0 

5 AAAGAACTCA AA/kAGATGTC CCAACGCGGA GAGAAAGATG AACTTCGGTT CGTATCGCTT 900 

TCTCAGGTTC TGGCCAATGG CCCGATGAAC AAAACGAAAG GCAGTCGGAT CGCCGTTCTC 960 

TTTGCCGAAG GTGAAATAAC GGAAGAAATA ATAAAGAAGC CGTTCGACAC TGACGGTAGC 1020 

TCCATCACAC AAGAACTCGC CAAAGAAATC AAGGCAGCAG CCGATGACGA TGATATCAAA 1080 

GCCGTAGTAC TTCGTGTCAA TTCTCCGGGA GGTAGTGCTT TCACTTCCGA ACAGATATGG 114 0 

10 AAGCAGGTAG CCGATCTCAA GGCCAAAAAG CCTATCGTGG TCTCCATGGG CGACGTAGCA 1200 

GCCTCGGGCG GATACTACAT AGCCTGCGCA GCCAACAGTA TCGTGGCAGA GCATACGACT 1260 

CTGACCGGCT CCATCGGCAT ATTCGGCATG TTCCCGAACT TCGCGGGCGT AGCCAAGAAG 1320 

ATAGGAGTGA ATATGGACGT CGTACAGACA TCCAAGTATG CAGACTTGGG CAACACCTTC 1380 

GCTCCGATGA CGGTCGAAGA TCGTGCCCTC ATCCAACGCT ACATAGAGCA GGGCTACGAC 14 40 

15 CTCTTCCTCA CTCGCGTATC GGAAGGCCGC AACCGCACCA AGGCACAGAT CGACAGCATC 1500 

GCTCAAGGCC GTGTATGGCT CGGCGACAAA GCTCTTGCAC TCGGTTTGGT GGATGAGCTT 1560 

GGAGGTTTGG ACACAGCTAT CAAACGGGCC GCGAAGCTGG CTCAGCTCGG TGGCAACTAC 1620 

AGCATAGAGT ATGGCAAGAC CAAGCGCAAC TTCTTCGAAG AGTTGCTCTC CTCATCAGCA 1680 

GCGGATATGA AGTCTGCCAT CCTGAGTACC ATTCTCTCCG ATCCGGAAAT AGAAGTTCTG 17^0 

20 CGCGAACTCC GCTCCATGCC GCCCCGTCCT TCGGGCATAC AGGCACGTCT CCCCTATTAC 1800 

TTCATGCCGT AC 1812 



25 



35 



40 



65 



(2) IN FORM AT I or I FOR SEQ ID HO: 12 



(i) SEQUEIICE CHARACTERISTICS: 

(A) LEMGTH: 97 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDIJESS : double 
30 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 

( i i i ) H Y POT H ET I CAL : MO 

(iv) AJITI-SEMSE: NO 

(vi) ORIGIIIAL SOURCE: 

(A) ORGANISM: PORY PHYROMOMAS GIMGIVALIS 



(ix) FEATURE: 

(A) MA]-1E/K:EY: misc_feature 

(B) LOCATION 1..-97 2 

45 (xi) SEQUEt>JCE DESCRIPTION : SEQ ID frO:12 

CTAACGTTGT TTTGTTGCAA CTATTTCAAA CAGATGAGAG CAAACATTTG GCAGATACTT 60 

TCCGTTTCGG TTCTCTTTTT CTTCGGGACA GCGATCGGAC AGGCTCAGAG TCGAAACCGT 120 

ACATACGAGG CTTATGTGAA ACAGTACGCC GACGAAGCTA TCCGACAGAT GAGCCGCTAC 180 

50 AATATACCGG CAAGCATCAC CATAGCACAG GCTTTGGTGG AGACAGGAGC CGGAGCCAGT 24 0 

ACACTGGCCA GCGTACACAA CAATCACTTC GGGATCAAAT GCCACAAATC GTGGACGGGC 300 

AAGCGCACCT ATCGTACCGA CGATGCGCCG AACGAATGCT TCCGCAGCTA TTCGGCCGCT 360 

CGCGAATCGT ATGAAGATCA TTCCCGATTT CTGCTCCAAC CACGCTATCG TCCCCTGTTC 4 20 

AAACTCGACA GAGAAGACTA TCGGGGCTGG GCTACGGGGT TGCAACGCTG TGGCTATGCC 4 80 

55 ACCAATCGGG GCTATGCCAA TCTGCTGATC AAGATGGTGG AGCTGTATGA GCTATATGCT 54 0 

TTGGATCGCG AGAAGTACCC CTCATGGTTC CACAAGTCTT ACCCCGGGTC CAACAAAAAA 600 

TCCCATCAAA CGACCAAGCA GAAGCAGAGC GGACTCAAGC ACGAAGCTTA CTTCAGCTAC 660 

GGACTGCTCT ACATCATAGC CAAGCAAGGC GATACCTTCG ATTCTTTGGC CGAAGAGTTC 7 20 

GACATGAGAG CCTCCAAACT GGCCAAATAC AACGATGCTC CCGTGGATTT CCCGATCGAA 7 80 

60 AAGGGCGATG TGATCTATCT GGAGAAAAAG CACGCATGCT CCATCTCCAA ACACACACAG 84 0 

CACGTAGTGC GTGTGGGCGA TTCGATGCAC AGTATCTCCC AACGCTATGG CATCCGGATG 900 

AAGAACCTCT ACAAGCTCAA CGACAAGGAT GGCGAATATA TACCCCAAGA GGGCGATATA 960 

CTGCGCTTGC GC 972 



(2) III FORMATION FOR SEQ ID NO: 13 



(1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1599 base pairs 
70 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



75 



(ii) MOLECULE TYPE: DMA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) AUTI-SEMSE: NO 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORV PHVROMOIfAS GINGIVALIS 

(ix) FEATURE: 

(A) fJAI^IE/KEY: misc_feature 
10 (B) LOCATION 1...159 9 

(;-ci) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

AGGATCCCCG ACGAGCAGAC CGGACGTATC ATGGACGGAC GTCGATATTC GGATGGCCTC 60 

15 CATCAGGCTA TCGAAGCCAA AGAGCATGTG AAAGTAGAGG CTGCGACACA GACATTTGCA 120 

ACTATCACTT TGCAGAACTA TTTCCGCATG TATCATAAGC TGGCAGGGAT GACCGGTACT 180 

GCTGAAACTG A^\GCGGGAGA GCTTTGGGAC ATCTACAAAC TGGACGTTGT AGTTATTCCG 24 0 

ACAAACAAGC CTATCGCCCG TAAGGATATG AATGATCGTA TCTATAAGAC GGCACGTGAA 300 

AAATATGCAG CAGTTATCGA AGAGATTGTA CGTCTTGTCG AAGAGGGCAG ACCTGTACTT 360 

20 GTCGGTACTA CTTCGGTGGA AATATCCGAA TTGTTGAGCC GTATGTTACG CTTGCGTGGC 4 20 

ATCCAACACA ATGTACTCAA TGCCAAATTG CATCAGAAGG AGGCCGAGAT TGTAGCTCAG 4 80 

GCCGGTCAGA AAGGAACTGT TACCATCGCA ACGAACATGG CCGGTCGTGG TACCGACATC 54 0 

AAGCTCTCTG CCGAGGTTAA GAAAGCCGGG GGTTTGGCTA TCATTGGTAC GGAAAGGCAC 600 

GAATCCAGAC GAGTGGACAG ACAGCTTCGT GGTCGTTCCG GCCGTCAGGG TGATCCCGGT 660 

25 TCGTCCATAT TCTATGTTTC CCTTGAAGAT CATCTGATGC GCCTCTTTGC CACAGAAAAG 720 

ATTGCATCAT TGATGGATCG TTTAGGTTTC AAGGAAGGAG AAGTGCTCGA AAACAACATG 7 80 

CTGAGTAAGT CCGTGGAGCG TGCTCAAAAG AAGGTGGAAG AGAACAACTT CGGTATCCGT 84 0 

AAACATCTGC TTGAGTACGA TGATGTAATG AATTCGCAGC GTGAAGTCAT TTATACCCGT 900 

CG CCGTCATG CTTTGATGGG AGAGCGTATC GGTATGGATG TACTCAATAC CATATACGAC 960 

30 GTATGTAAGG CTCTGATTGA CAATTATGCA GAAGCCAATG ATTTCGAAGG CTTCAAGGAA 1020 

GATCTGATGC GTGCACTCGC GATAGAATCT CCTATCACGC AAGAAATATT CAGAGGTAAG 1080 

AAAGCAGAAG AGCTGACCGA TATGCTTTTC GATGAAGCTT ACAAGTCTTT CCAACGTAAG 114 0 

ATGGATCTGA TCGCAGAAGT GGCCCACCCT GTGGTTCATC AGGTATTCGA GACCCAAGCC 1200 

GCCGTGTACG AGCGCATTCT AATCCCCATT ACGGATGGTA AACGTGTCTA TAACATAGGA 12 60 

35 TGCAATTTGC GTGAAGCGGA TGAAACTCAA GGG7WVAGCA TCATCAAAGA AT TT GAG AAA 132 0 

GCTATCGTAC TGCATACTAT CGATGAGTCT TGGAAAGAAC ATCTGCGTGA GATGGACGAG 1380 

CTTCGTAATT CCGTTCAGAA TGCCAGCTAC GAAAACAAAG ATCCACTACT TATCTATAAA 1440 

CTCGAATCTT ACGAACTGTT CCGCAAGATG GTAGAAGCCA TGAACCGTAA GACCGTAGCG 1500 

ATCCTAATGC GTGCTCGGAT ACCGGTACCG GAGGCTCCTT CCCAAGAAGA GCTGGAACAC 1560 

40 AGGCGGCAAA TAGAAATCCG ACATGCAACC CAACAACGT 15 99 

(2) INFORMATION FOR SEQ ID NO: 14 

45 {i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

50 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

55 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

60 (ix) FEATURE: 

(A) NAIIE/KEY: misc_f eature 

(B) LOCATION 1 ... 2160 



65 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14 



AAATCTTGCC GTGTTATTGG CCAGACGAGG CGATACGGAT GCTGCCCTGT CGGACTACGA 60 

CCGTGCCATC AAAGCCTATC CGGAGTTTGC CGATGCCTAT TTCAATCGCG GCCTGCTGTT 120 

GCTTTCGCGC GGAAAGGCCA AAGAAGGCAT CGCCGATCTG AGTCGGGCAG GCGAATACGG 100 

GCTCTACAAG GCGTACAACA TCATCAAACG AATGAGCACG AAGTCATGAT CTCCGTCAAT 240 

70 AACCTGACTG TCGATTTCGG CACCCGTCTG CTCTTCGATC AGGTATCATT CGTCATCAAC 300 

AGGCGCGACC GTATCGCTCT TGTAGGGAAG AACGGTGCCG GCAAGAGTAC GCTGCTCAAG 360 

CTGATTGCCG GCATGGAAGA ACCGACATCC GGACACATAG CACGCCCCAA GGGGATCCGC 4 20 

ATAGGCTATC TGCCGCAGGT GATGCGTTTG CAGGACGGAC ACACGGTTTA CGAAGAGGTC 4 80 

GAGCAGGCTT TCAACGATAT TCGCC7WVTA GAGGAAGAGA TACGGCGTCT GTCCGATGAA 54 0 

75 ATGGCCGGAC GTACGGACTA CGAATCGGAT GACTATATCC GACTGATAGA GCATTATACG 600 
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TXATATGAGCG AGACCCTCTC TCTCATGCAG CAGGGCAACT ATCATGCTGC GATCGAACAG 660 

ACATTGATCG GTCTGGGCTT CGGCCGAGAG GACTTCCACC GCCCCACAGC CGATTTCAGC 7 20 

GGAGGATGGC GTATGCGGAT AGAGCTGGCC AAACTTCTGC TCCAACGCCC CGAAGTTTTG 7 80 
CTGCTCGACG AGCCGACCAA TCACCTCGAC ATCGAATCCA TCGGCTGGCT GGAGCAGTTC 8^0 
5 ATCGCCACCA ATGCAGGAGC CGTTATCCTG GTGTCGCACG ACAGGGCATT CATCGACAAT 900 
ACCACGACAC GCACTATCGA AATAGAACTG GGACATATAT ACGACTACAA GACCAACTAC 960 

AGCCACTATG TGGAGCTGCG CGAAGAGCGG CTGCGACAGC AGATGCGTGC CTACGAGAAT 102 0 

CAGCAGAAGA TGATCCGCGA TACGGAGGAC TTCATCGTVAC GATTCAGATA CAAGGCCACG 1080 

AAGTCCGTAC AGGTACAGAG CCGGATCAAA CAGTTGGAGA AAGTAGAGCG CGTGGAGATA 11^0 

10 GACGAGCGGG ATCGTTCGGC ATTTCACTTC CGCTTTATCC CGGCACAGCC TTCCGGCAGT 1200 

TATCCGCTAA TAGTGGATGA TTTGGCCAAG GCTTATGGCG ATCACCAGGT GTTTTCCGGA 1260 

GCTACATACA CCATCGAAAG AGGCGAAAAG GTGGCTTTCG TAGGCAAAAA CGGTGCCGGC 1320 

AA7W\GTACCA TGGTCAAGTG TATCATGGGA GAGCTGACAG ACTACACCGG CAAGCTCGAA 1380 

CTGGGGCACA ACGTGCAGCT GGGCTACTTT GCCCAAAACG AAGCCCAAGA GCTAAGAGGG 14^0 

15 GATCTCACGG TATTCGACAC GATAGACCGT GAGGCCGTGG GCGACATCCG TCTGCGCCTG 1500 

AACGATTTGC TCGGGGCTTT TCTCTTCGGG GGCGAAGCAT CGGAAAAGAA AGTAAGTGTC 15 60 

CTGAGTGGAG GAGAACGAGC ACGATTGGCT ATTATCAGGC TTTTGCTACA GCCGGCTAAC 162 0 

TTCCTTATTC TCGATGAGCC GACCAATCAC CTCGATATGC GCTCGAAGGA TGTACTGAAA 1680 

GAGGCGATCA AGAACTTCGA TGGGACTGTC ATCGTAGTAT CTCACGACCG TGAGTTCCTC 17 4 0 

20 GATGGGCTTG TCAGCAAGGT GTATGAATTT GCAGATGGAC AGGTGAACGA ACACCTCGGA 18 00 

GGTATATACG ACTATCTCCG GACCCGCCGT ATGCAGACGC TGACAGAGCT GGAGCGAACC 18 60 

ACTACGATCG AAACAAAAAC CACACGGGAG GCTATACCTG AAACGGAAGC CAAAGCGGAC 1920 

TACCGTCGGC AAAAGGAGGT AGCCAAACAG CTGCGCACGT TGGAGCGAAC CGTAGCAACC 1980 

TGCGAGGAGC GGATCGGAAA ATTGGAGTCG GAATTACAGG CAATAGAGAT GCTACTGCAA 204 0 

25 GATCCGAAAC ATGCGACTGA CGCGAATCTG TTCGAGCGAT ACGCCGGCAT GAAACAAGAA 2100 

CTCGAAAAGG CCATGGAGGA CTGGGAACAG GCTTCCGAAG CTTTATCCGA AGCCCAAGGA 2160 



30 



35 



(2) IMFORHATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 



40 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AHTI -SENSE: NO 



45 



50 



55 



60 



65 



70 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

Ux) FEATURE: 

(A) NAI IE/KEY: misc_feature 

(B) LOCATIOIJ 1...1158 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 15 

TATTTGAAGC TATTAATACT ACAAATAGCC TTGATGAATT TCTTAAAAA.A AGAACCGTTT 60 

AAAATATTCT CTATGATTTA TCTGCTGTTA GATACAATAA CAAACCGTGC CGGTACAGAA 12 0 

CGCGCCGTGA TCAACTTGGC TAACAACCTG CATGCCAATG GTCATCGCGT ATCATTAGTC 180 

AGCGTTTGTA CAAAAGAAGG AGAGCCTTCC TTCCAAGTAG AA7W\GGAAT AGAAGTACAC 2^0 

CATCTCGGAA TTAGGCTTTA TGGCAATGCA TTAGCCCGCA AAACAGTATA TTTCAAGGCT 300 

TATCGAAGGA TAAAAGCCCT ATACAAGAAG CGTGAACCGG TTTTATTGAT AGGGACTAAT 360 

ATTTTTATCA ATACAATTTT GTCTCAGATC AGTAACAGAG GCAGAATATT TACGATCGGA 4 20 

TGCGAACATA TCTCTTATGA TATTGCCCGC CCTATTACAA AACGCATAAG GGGGTTTCTG 4 80 

TATTCAGGGC TTGATGCCGT TGTAGCACTG ACAAAAAGAG ATCAGCAATC GTTCGAGGCA 54 0 

ATCTTACGTG GACGCTCTAA AGCATATGTC ATACCCAATC AAGTTTCATT TACTACAGTC 600 

CAAAGAGATG CTACTACTCA CAAACAAATG TTGGCGATTG GCAGGCTTAC CTACCAGAAG 660 

GGTTTTGAAT TCATGATAGA AGATGCATCA CGAGTGCTGC GAGAAAGGCC TGATTGGAAG 7 20 

CTTATCATAG TCGGAGATGG CGAAAATGAA TCGATGCTAC GTAAAGAAAT TGCATCTCGC 7 80 

AATATGGAGT CGCAAJ\TAGA T^TACATCCA TCTACACCGG AAATTCGCAA ATACTACGAA 84 0 

TCATCTGCTA TTTATCTAAT GACGTCCCGT TTCGAAGGAC TACCAATGGT ACTTCTCGAA 900 

GCAGAAGCAT ATGCACTACC TATAATCTCA TACGATTGTC CGACCGGCCC GAGGGAACTG 960 

ATCGAAAACG GTCGCAATGG TTTCCTTGTG CCAATGGAAG CACATGAAGA CTTCGCGGAT 1020 

AAGTTACGCT TATTGATGGA TGATGAAACT CTTCGTAAGA AAATGGGACA AGAATCAGAG 1080 

TTGATGGTCA AATCCTACTC TCCGGCAAAT ATCTATGAAT GTTGGAAGAA ACTATTCGTC 114 0 

GAAATCGGCT ACATGAPvT 1158 



75 



(2) INFORMATION FOR SEQ ID NO: 16 
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10 



15 



55 



70 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 
(iii) HYPOTHETICAL: HO 
(iv) AUTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIMGIVALIS 

(ix) FEATURE: 

(A) NAI-IE/KEY: misc_feature 

(B) LOCATION i...l965 

20 ixi) SEQUENCE DESCRIPTION: SEQ ID N0:16 

ACAACGAAAG AAAACGAGAC AACAACAAAA AACGAATATA GAATTATGGG AAAAATCATT 60 

GG7VATTGACT TAGGCACAAC GAACTCTTGT GTCTCTGTAT TGGAAGGTAA CGAACCTATC 120 

GTTATTACAA ACAGTGAGGG CAAGCGCACA ACGCCCTCGG TAGTGGCTTT TGTGGATGGT 180 

25 GGCGAGCGTA AGGTGGGCGA TCCGGCCAAG CGTCAGGCCA TCACCAATCC GACCAAGACG 240 

ATATACTCTA TCA7VACGCTT CATGGGCGAA ACTTACGATC AGGrTTCCAG AGAAGTGGAG 300 

AGAGTGCCAT TCAAGGTAGT ACGTGGGGAC AATAATACTC CGCGCGTAGA TATAGACGGT 360 

CGTCTCTATA CGCCGCAGGA AATTTCGGCC ATGATCCTTC AGAAGATGAA GAAGACGGCC 4 20 

GAAGACTACC TCGGTCAGGA AGTAACGGAG GCCGTGATCA CTGTGCCCGC ATACTTCAAC 4 80 

30 GACGCTCAAC GTCAGGCAAC GAAAGAAGCA GGAGAGATCG CCGGCCTGAA AGTTCGCCGT 54 0 

ATTGTGAACG AGCCTACGGC AGCTTCTCTG GCCTACGGTC TGGACAAGTC CAATAAGGAT 600 

ATGAAGATCG CTGTCTTCGA CTTGGGTGGC GGTACCTTCG ATATCTCTAT CTTGGAATTG 660 

GGCGACGGCG TTTTCGAAGT GAAATCGACC AACGGTGATA CGCACCTCGG AGGAGACGAC 7 20 

TTCGACCACG TGATCATTGA CTGGCTGGCA GAAGAGTTCA AGTCTCAGGA AGGTGTGGAT 7 80 

35 CTTCGCCAGG ATCCTATGGC TATGCAGCGT CTGAAAGAAG CTGCCGAAAA AGCCAAGATA 84 0 

GAGCTCTCCA GCACTTCATC TACGGAGATC AACCTCCCCT ATATCATGCC GGTGAACGGC 900 

ATCCCCAAGC ACTTGGTGAT GACGCTTACA AGGGCTAAGT TCGAGCAGTT GGCCGATCGT 960 

CTGATTCAGG CATGTGTGGC ACCCTGCGAA ACGGCCTTGA AAGATGCCGG TATGTCACGT 1020 

GGCGATATCG ATGAAGTGAT TCTCGTAGGT GGTTCCACAC GTATTCCTGC TATTCAGGAG 1080 

40 ATTGTGGAGA AGATCTTCGG TAAGGCTCCG TCCAAGGGTG TGAATCCCGA CGAAGTGGTA 114 0 

GCTGTGGGTG CCGCTATTCA AGGCGGTGTT CTGACCGGTG AGGTAAAGGA TGTCTTGCTG 1200 

TTGGACGTTA CCCCCTTGTC GCTCGGTATC GAGACTATGG GAGGCGTGAT GACTCGCTTG 1260 

ATCGATGCCA ATACCACTAT CCCGACGAAG AAGAGCGAAA TCTTTACCAC AGCAGTGGAC 1320 

AATCAACCTT CGGTAGAGAT TCATGTACTT CAGGGTGAGC GTTCTTTGGC TAAGGACAAT 1380 

45 AAGAGCATCG GCCGTTTCAA CTTGGACGGT ATTGCTCCGG CGCCCCGTCA GACACCGCAG 1440 

ATCGAAGTAA CGTTTGACAT CGATGCCAAC GGTATCCTGA ATGTAACGGC TCATGACAAA 1500 

GCTACCGGCA AGAAGCAGAA TATCCGCATC GAAGCCTCCA GCGGTTTGTC CGATGATGAG 1560 

ATCAAGCGCA TGAAGGAAGA GGCGCAGGCC AATGCCGAAG CAGATAAGAA AGAGAAAGAA 162 0 

CGTATCGACA AGATCAATCA GGCCGACAGC ATGATCTTCC AGACGGAAAA GCAGTTGAAG 1680 

50 GAGTTGGGAG ACAAATTCCC GGCCGACAAG AAGGCTCCGA TCGATACCGC TCTCGACAAA 17 4 0 

CTGAAAGAAG CACACAAAGC ACAGGATGTA GCTGCTATCG ATACAGCCAT GGCCGAACTG 18 00 

CAAACCGCTC TTTCCGCAGC GGGCGAAGAG CTTTACAAGA ATGCCGGAGC AGCCCAAGGT i8 60 

GGCGCACAAC CCGGTCCGGA CTTCGGCGGT GCTCAAGGTC CCTCTGCCGG TGATCAGCCC 1920 

TCTGACGACA AGAACGTCAC AGACGTAGAC TTCGAGGAAG TGAAG 1965 



{2) IH FORMATION FOR SEQ ID NO: 17 



(1) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1401 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

65 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) AIJTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORG At J ISM: PORYPHYROMONAS GINGIVALIS 



(i>:) FEATURE: 
75 (A) NAilE/KEY: misc feature 
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{B) LOCATION 1...14 01 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

5 AAGTGGGCAC GTACAACTAC TTTAAGGATA TCGAACGCAA CAATCTATAA AACTATGCGC 60 

TACGACTTAG CTATCATCGG TGGAGGGCCG GCCGGTTATA CGGCTGCCGA ACGTGCTGCC 120 

AAAGGTGGCC TGAAAACCCT CCTAATTGAG AAGAATGCTC TCGGTGGTGT ATGCCTCAAC 180 

GAAGGATGTA TACCGACCAA GACGCTACTC TACTCGGCCA AAGTGCTACA TCAAATTGCT 240 

ACGGCATCTA AATATGCAGT AAGTGGAACG GCCGATGGAC TTGACCTCGG CAAGGTGATT 300 

10 GCCAGAAAAG GTAAAATCAT TCGCAAGCTG ACTGCAGGCA TCCGTTCACG CCTGACAGAG 360 

GCCGGAGTAG AGATGGTGAC GGCAGAAGCT ACCGTAACGG GATGCGATGC AGACGGCATC 4 20 

ATCGGCATTA CTGCGGGCGA AGCACAGTAC AAAGCTGCCA ACCTGCTACT ATGTACCGGT 4 80 

TCGGAGACGT TTATTCCACC CATCCCCGGA GTGGAGCAGA CAGAGTATTG GACAAACCGT 54 0 

GAAGCTCTAC AGAACAAAGA GATTCCGACC TCTCTCGTCA TCATCGGTGG TGGAGTGATC 600 

15 GGAATGGAGT TCGCTTCTTT CTTCAACGGT ATCGGTACGC AAGTGCACGT GGTGGAGATG 660 

CTGCCGGAAA TACTCAACGG TATCGATCCC GAACATGCAG CTATGCTACG CGCTCACTAT 7 20 

GAAAAAGAAG GAATCAAATT CTACCTCGGG CACAAAGTAA CATCGGTTCG CAAGGGAGCT 7 80 

GTTACGGTAG AATACGAAGG AGAAAGCAAA GAGATCGAAG GAGAACGTAT CCTGATGAGT 84 0 

GTGGGACGTC GCCCCGTGCT GCAAGGATTC GAGTCGCTCG GATTGGTGCT TGCCGGCAAA 900 

20 GGTGTAAAGA CTAATGAGAG GATGCAAACT TCCCTGCCCA ATGTCTATGC TGCAGGTGAT 960 

ATTACAGGCT TCTCGCTTTT GGCACATACG GCTGTACGGG AAGCAGAGGT AGCAGTAGAT 102 0 

CAGATTTTGG GCAAAACAGA CGAAACGATG AGCTACCGTG CCGTACCAGG TGTGGTGTAC 1080 

ACCAATCCCG AGGTCGCCGG TGTGGGAGAG ACGGAAGAAT CGCTTCGCAA AGCAGGACGT 114 0 

GCCTACACTG TTCGTCGCCT TCCTATGGCC TTCTCCGGTC GATTTGTAGC AGAAAACGAA 1200 

25 CAAGGCAATG GAGAGTGCAA ACTACTACTT GATGAAGAGA ACCGCTTGAT CGGAGCACAC 1260 

CTCATTGGCA ATCCGGCCGG CGAACTCATC GTAACCGCTG CCATGGCCAT CGAGACCGGC 1320 

ATGACGGATC GACAAATCGA ACGAATCATA TTCCCTCATC CGACTGTAGG CGAAATCCTA 1380 

AAAGAAACTC TCGCCGGAGG T 14 01 

30 

(2} IMFORI'lATION FOR SEQ ID NO: 18 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEIfGTH: 2835 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRAl^JDEDHESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

40 

(iii) HYPOTHETICAL: NO 



(iv) AMTI -SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHOMAS GINGIVALIS 



(XX) FEATURE: 

(A) NAIIE/KEY: misc^feature 
50 (B) LOCATIOM l..,2835 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 18 



CCTAAGATAC TTATGGAATT GAAAAGATTT 

55 ATTCCGATGA AGCTTTCTGC CCAACAGGCT 
GTCGGTAAGT TGGACAACGG ATTGACTTAT 
CGTGCGGATT TCTTTATCGC ACAAAAGGTA 
GGTTTGGCTC ACTTCTTGGA ACACATGGCT 
AACTTGATCA ACTATCTCGA AACGATCGGT 

60 ACCGGATTCG ACAAGACGGA ATATACGATA 
ATCGACTCCT GCTTGCTTAT CCTGCATGAT 
GAGATCGACG AGGAGCGCGG TGTGATCCAG 
CTTCGTATGT TCGAGGCTAT ACTTGCCAAG 
ATGCCCATCG GTCTGATGGA CGTCGTGCTC 

65 TATAAGAAAT GGTATCGTCC CGACCTGCAA 
GACTATGTGG AGAACAAGAT CAAAGAACTC 
GCAGAGCGTA TCTATACGCC GGTAGAGGAC 
GATGCTGAGG CTACTACCAC GCAGCTCTCC 
GAAGTGCGAG GATCGATATT CGGACTTGTG 

70 GCCGTGAATG AGCGTCTGTC CGAGATTACT 
GGAGCTTTCT TCTCTAACTT CATGTACATC 
GCCACGGTTC GTGAGGGTGA AGCGGAGAAA 
AGCCTCCGTC AGTTCGGTAT CACCAAAGGC 
AAGCGATACG AGAATCAATA CAACGAAAGA 

75 GAATACTCCA CCTACTTCAC CGATGGCGGC 



TTATCACTTG GTCTTCTGCT TGTGGGATTC 60 

CAGCCACTCC CTACAGATCC GGCTGTTCGT 120 

TTCATCCGTC ACAACGAGAA CCCGAAAGAT 180 

GGTTCTATTC TTGAAGAAGA TAGCCAGTCC 24 0 

TTCAACGGTA CGAAGAACTT CCCCGGTAAG 3 00 

GTACGTTTCG GTCAGAACCT GAACGCTTCT 360 

ATGGATGTGC CGACTACACG TCAGGGAATC 42 0 

TGGAGTAACA ATATTACCCT CGACGGGCAT 4 80 

GAAGAGTGGC GTGCTCGTCG CGATGCCAAC 54 0 

GCTATGCCGG GTAATAAATA TGCAGAACGC 600 

AACTTCAAGC ATGATGAGCT GCGCAACTAT 660 

GGTCTGGTGA TCGTGGGAGA TATCGATGTG 72 0 

TTCAAGGACG TTCCTGCTCC CGTGAATCCA 7 80 

AACGATGAGC CTATCGTAGC CATTGCTACC 84 0 

ATCAGCTTCA AGAGCGACCC CACTCCTCAA 900 

GAAGACTATA TGAAACAGGT GATCACTACA 96 0 

CACAAGCCTA ACGCTCCTTT CCTCAGTGCA 102 0 

ACCCAGACTA AGGACGCATT CAATTTTGTT 108 0 

GCGATGAACG CATTGGTGGC AGAGATAGAA 114 0 

GAATACGATC GTGCACGCAC GAATGTGCTC 1200 

GACAAGCGTA AGAACAATGC TTATGCCAAT 1260 

TATATCCCGG GTATTGAGGT GGAATATCAG 132 0 
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10 



15 



20 



25 



ACGGTGAATG 
ATGATCGATC 
AAGATTCCGA 
GAAGCCAAGA 
AAGATCGTTT 
GGCATCA7\AG 
CTCAGCCCGG 
TTCATGAACG 
GGTCGCTCTG 
ACGACTGTAG 
CGCAAGGATC 
CAGGAAGCCA 
AATCCGATGA 
ATGGCTTTCT 
AATCTGGATG 
CTCAAGCGTG 
GATTGCAAGT 
GGAAATGTGG 
GATCAGGTGT 
TTCGGCGGTC 
GATCCTGCTC 
AAGGAGGGCC 
AAAGAAAGTC 
GGAAATGACT 
CAAAAGTTTG 
GTTGCAAAGG 



CTTTTGCTCC 
CGGTGAAGAA 
GCGAAGCAGA 
AAGACGAAGT 
CCGAGAAGAA 
TATACCTCAA 
GTGGTATCCT 
TGGGTGGCTT 
CTTCCGTATC 
AAGATATGGA 
CCGAAGCGTT 
ACCCGATGGC 
TGAAACCCAT 
ACAATGAGCG 
AAGCCAAGAT 
GCGATAAGAT 
TCGAGAAGGA 
AATATACGCT 
ACACGGCTAC 
TCGAGCAATA 
GTGCCGAGGA 
CCAATGTGGA 
TGCGTGAGAA 
TCATCACAGA 
CGGCAGACCT 
CTCAA 



TCAGGTTCCT 
TGCTGTCGTT 
CTTCCTCGCT 
CTCCGACCAA 
AGATCAGAAG 
GAAGACCGAT 
CTCCGGAAAG 
GGGCAACTTC 
TCCCTCTTTG 
AACTTTCTTC 
CAAGGCCACA 
TGCGCTTATG 
GAAAGCTGCT 
ATTCGCTGAT 
GAAGCCATTG 
GAATAAGGCT 
AATGGATACT 
CAAGAACAGT 
CGTTCGCGAG 
TCCTCAGCCC 
AATGAATGCT 
ATACTTTAAG 
TCGTTTCTGG 
CTACGAATCC 
CTTGAAGCAG 



CTGGAAGCAT 
ACCCTCACCG 
GCTTTCAAAG 
AAATTGATGG 
TTCGGTACCA 
TTCAAATCAA 
CATGCTCCCA 
GATGCTATCC 
TCTCTGCTCA 
CAGTTGATCT 
CAGGAAAAGT 
GACTCTATCC 
GACGTGGAGA 
GCCGGCGACT 
ATCGAAACTT 
CAGGTACCGG 
CCTTCGACTA 
CTCCTGCTGG 
AAGGAAGGCG 
AAGGCTCTGA 
ATCGTTTTTG 
AAGACTATCG 
CTCGAAGCCA 
GTACTGAACG 
CAGAATCGGG 



TCAATCAGGC 
GTCCTTCAAA 
CTGCTCGTCA 
AGAAAGCTCC 
CGGAACTTAC 
ACGAAATCCT 
ACCAATCTGT 
AGCTGGATAA 
GTG/VAGGTCT 
ATCTCCAAAT 
TGTACAATAA 
GTCATACCAT 
AAGTAAATTA 
TTATGTTCTT 
ATCTTGCTTC 
CTGCCCGTTC 
CTATATTCGA 
AAGTCTTCTC 
GTGCATACAG 
TGCAGATCTA 
CTGAGTTGGA 
AAAACCTGAA 
TGAAGGCGTC 
GTCTTACTCC 
TTGTTGTCAT 



TATTGCCCAA 
GGCTGAAGCC 
GCAGAAAGTA 
TAAGGCCGGA 
CCTTAGCAAT 
GATGAGTGCT 
GATGAATTCG 
GGTGCTGACA 
TTCGGGCAAA 
GACTGCtAAC 
CTTGAAAAAT 
GTACGGCGAT 
CGATCAGGTA 
CTTTATCGGT 
ATTGCCCAAC 
GGGAAAGATC 
TGTCGTGTCC 
AGCCGTAATG 
TGTGGCTGCA 
TTTCCCCACG 
GAAGCTTGCC 
TAAGCAGCAC 
TTTCTTCGAA 
TGCTG.AATTG 
GATGGCTCCT 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2835 



30 



35 



40 



(2} IIIFORIIATION FOR SEQ ID HO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2058 base pairs 

(B) TYPE: nucleic acid 

(C) STRAWDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AIITI-SENSE: NO 



45 



50 



55 



60 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(i:-:) FEATURE: 

(A) NAI-IE/KEY: misc_feature 

(B) LOCATION i...2058 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 



TACACTATGA 
AAAAAATTCC 
GATGCTACGC 
GACCTCCGCG 
GGCGTAGGGA 
GCGGAAGAGT 
CTCGGATTTT 
CGAGAAGATG 
CCTGCGGACA 
GAGTTCCTCA 
GTGCCGATCA 
GAGGACAATC 
GACGAAGACT 
TGGATCCACC 
ATCAAAAACA 
GTCACCGATG 
GATTCGCCGG 
AAGAAGATCT 
AACGACCGCC 
ATGCTGACGG 
GACGGACACA 
AAGGACGGAC 
AAACGTGCAT 
ATCGTGAGCC 



GTAAGA/^GG 
TGTACAGCGA 
AGAAGCTGAA 
TAACGGTCAG 
TGACCGAAGA 
TTCTTGAAAA 
ACTCGGCTTT 
CTACGGCGGT 
AGGCTGACCG 
AAAAAGAAAA 
TTTTCGGCAA 
AGATCAACGA 
ATAAGGAATT 
TCAATGTGGA 
ACTTGGATCT 
AAGTACAGGG 
ATATTCCCCT 
CGTCTCATAT 
CCACATTCGA 
ATGAGAAGTT 
AGTACACGTT 
AGGTAGTGTA 
CCGACAAAGG 
TGCTGGAGCA 



AACAATCGGG 
CCATGAGATA 
AACGCTTACA 
CGTGGATGAA 
GGAGGTGGAG 
GTACAAAGAC 
CATGGTGTCC 
GAAATGGAGC 
TGGCACCGAC 
GATAGAGGGG 
GAAGCAGG7U\ 
CACACATCCT 
TTACCGTTCG 
CTATCCGTTC 
GCAGCGC/iAC 
TATCGTGCCG 
CAACGTATCG 
CACCAAGAAG 
GGAGAAATGG 
CTATGAGCGT 
CGACGAATAC 
TCTCTATGCT 
CTACAGCGTG 
AAAGTTGGAG 



GTAACGAGCG 
TTCCTGCGTG 
TCCGTCGGCG 
GTGGCACGCA 
AAGTACATCA 
GACAAGGCCG 
GAGCGAGTGG 
TGCGACGGAT 
ATCGTGATGC 
CTCCTCGGCA 
TGGAAAGACG 
GCCTGGACCA 
CTCTATCCCA 
AATCTGACAG 
AAGATTCAGC 
GAGTTCCTCA 
CGCTCCTATC 
GTGGCAGACC 
GATAGTCTGA 
GCAGCCAAAT 
CGAACGCTCG 
ACGGACAAGC 
ATGCTGTTGG 
AAGACACACT 



ACAATATATT 
AGATCGTCTC 
AATTCAAAGG 
CGATCACGGT 
ATCAGATTGC 
CCATTATCGG 
ACGTGATCAC 
CGCCCGAATA 
ACATCGATGA 
AATACTGTAA 
GCT^AGATGCA 
AATVAGCCTGC 
TGTCCGAAGA 
GTATCCTCTA 
TCTACTGCAA 
CCCTCCTGCA 
TGCAGAGCGA 
GTCTGGAAGA 
AGCTCTTCGT 
TCTTCCTTTT 
TCGAAGGTGT 
ATGGACAGTA 
ATGGTCAGTT 
TTGTCCGTGT 



CCCCGTCATC 
CAATGCCGTG 
CGAGACGGGT 
CAGCGACCGC 
TTTCTCCAGT 
CCACTTCGGA 
GCGCTCTTTC 
CACGCTCGAA 
GGAGAATAGC 
GTTCCTTACC 
AGATACGGAC 
CGACCTCAAG 
GCCTCTCTTC 
TTTCCCGAAG 
TCAGGTTTAC 
CGGGGTCATC 
TGCCAATGTG 
AATTTTCAAA 
CGAATACGGT 
CACCGATATG 
ACAGACGGAT 
CAGCCACGTG 
GGATCCGCAT 
CGATAGCGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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ACGATCAACA ATCTGATCCG CAAGGAGGAA AGAGCCGAAG TGAAACTGTC CGATACGGAG 1500 

CGCGCCACTC TCGTGAAGCT GTTCGAAGCA CGCCTGCCAC GGGACGAGAA GAAGCACTTC 1560 

AATGTAGCTT TCGAATCGCT CGGAGCCGAA GGTGAAGCCA TCCTTATCAC ACAAGCCGAA 162 0 

TTCATGCGCC GTATGCGCGA TATGGCACAG CTGCAGCCGG GAATGAGCTT CTACGGCGAA 1680 

5 CTCCCCGATT CGTACAATCT GGTACTTAAT ACCGATCATC CGCTCATCGA CAGGGTACTC 17^0 

TCCGGTGAGA AAGAATCGGT AGAGCCTTCG CTCACAGAGC TTAGAGCGAA AATCGCCGAG 1800 

CTGAAAGCGG AAGAGGCCAA GCTGCTCGAT GAGGAAAAAG GGAAGAAACC GGAGGAAATC 1860 

CCTGTTGCCA CGAAGGAAGC CAAGGAGAAC AACGCCGTCG AAGAGGCCAA AACCGAAGGC 1920 

AGTATCAACG ATCAACTGAC CAAATATGCT CAGGACAACG AGCTGATAGG TCAGCTCATC 1900 

10 GACTTGGCTC TGCTCGGAAG CGGATTGCTG ACGGGAGAGG CTTTGGCCGA ATTCATTCGT 2 04 0 

CGCAGCCAGC GTCTTCTC 2 058 



15 



25 



30 



65 



(2) IITFORIIATIOI-J FOR SEQ ID MO: 20 



(i| SEQUENCE CHARACTERISTICS: 

(A) LEIJGTH: 1446 base pairs 

(B) TYPE: nucleic acid 
{CI STRAriDEDNESS: double 

20 (D) TOPOLOGY: circular 



(ii) molecule: TYPE: DMA (genomic) 

(iiil HYPOTHETICAL: NO 

(iv) AMTI-SEMSE: MO 

(vi) ORIGIHAL SOURCE: 

(A) ORGAI4ISM: PORY PHYROMONAS GIMGIVALIS 



(IX) FEATURE: 

(A) IWIE/KEY: misc_feature 

(B) LOCATION 1 ... 1446 

35 (xi) SEQUENCE DESCRIPTIOH: SEQ ID NO: 20 

GACATTCGAC AGAAAAGACC GTGCTTCAAC GCCAATTTGT ACTTTTACCG TGCAATGGAA 60 

AAACTGATCG ATATTTTGGT CGTAGACGAT GATGTGGCAG TCTGTGCCGC ACTGCGTCTG 120 

GTGCTCAAGC GAGCGGGCTA TAATCCCGTT ATAGCCAACA GTCCCGACGA AGCTTTGTCC 180 

40 ATAATGCGGA ATCCTGATGG CGGCTGTAAG CCGGCTGTGA TTCTGATGGA TATGAATTTC 24 0 

TCCCTTTCGA CCTCCGGCAG GGAAGGATTG G7\ACTACTGG AGAAGATGCA GATATTCACT 300 

TCCTGCCCTG TCATACTGAT GACGGCTTGG GCTTCGATTC CACTGGCAGT GGAGGGAATG 360 

AGGCTTGGAG CTTTCGACTT CATAGGCAAG CCATGGGACA ACGATCGGCT CCTTCGTACC 4 20 

ATAGATACGG CCTTGCATCT GGCTGCTCCC TCAGCTGTGG CGAATCCATC GGAACAGTCT 4 80 

45 GACAGAGATA CAGCCCGTCA GCCGAAAGCT ACAGTCCAAG AGAATGACCC CTGTGCCCAT 54 0 

ATCATAGGCC GGAGCGATGC CATCTGTAAG ATCAAGGAAC GGATACGCCG CATAGCTCCC 600 

ACCCATGCCT CTGTGCTGAT CACGGGCGAG AGCGGTACGG GCAAAGAGTT GATAGCCGAA 6 60 

GCTCTGCACC GTGGGAGCAA ACGAGCCTCA GCCCCATTCG TCAAGGTCAA TTTGGGTGGG 7 20 

ATTCCCGAAA GTTTGTTCGA AAGTGAGCTG TTCGGACATA AGAAAGGAGC TTTTACCAAT 7 80 

50 GCTTTTTCCG ACAGGAAAGG ACGGTTCGAG CTGGCTGATG GCGGCACGAT CTTTCTGGAC 84 0 

GAAAT AGGCG AACTACCGGT CGGCAACCAA GTAAAACTGC TGCGAGTGCT ACAGGAACAG 900 

ACATTCGAGC CGTTGGGCGA GAGCGTCTCC CACCGAGTGG ACATCCGTGT GGTATCGGCT 960 

ACGAATGCTT CCTTGGAGCG AATGGTAGCC GAAGGACGTT TCAGAGAGGA CCTCTACTAT 1020 

CGAATCAACC TGATACATCT GCATCTGCCT CCGCTGCGTG AGCGTCAGGA GGATATACAG 1080 

55 CTGCTGGTGG AAGCCTTCAG TGAAGCCTTT GCCCAATCGA ACGGATTGCC CCATGCCGTT 114 0 

TGGAGTGCGG AAGCTATGCG ACGTATCTGT GCCATGCCCC TACCGGGCAA TGTACGCGAA 1200 

CTGAAAAACG TAGTGGAGCG TACGCTATTG CTCTCGGGAT CGAGAGAAAT CAGTGCCCGG 12 60 

GATGTGGCTG ACTTCGGTTC GCAGGTGACG GCAGCAGACC ACTCCGACGA ACGGGCTTTG 1320 

ACCGACATGG AGGAAGCTGC TATCCGAGAG ACGCTGACTA AATACAACGG CAACGTTAGT 1380 

60 CGTGCTGCAC GAGCCTTGGG ATTGAGCCGG GCAGCTCTTT ACCGGCGAAT GGAGAAATAC 14 40 

GGACTG 1^46 



(2) lUFORMATIOH FOR SEQ ID HO; 21 



ti) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 789 base pairs 
(B| TYPE: nucleic acid 
(C) STRAMDEDNESS: double 
70 (D) TOPOLOGY: circular 



75 



(ii) MOLECULE TYPE: DIIA (genomic) 
(iii) HYPOTHETICAL: HO 
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(iv) AMTI-SEHSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

5 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_f eature 

(B) LOCATION 1 . , .789 

10 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 21 

AGATCTCTGC AATCATTTCA AAATAA7\AAA CACTCAAGTA TGCTTAAGAT AAAGAACCTC 60 

CACGCCACAG TACAGGGCAA AGAGATATTG AAAGGAATCA ATCTGGAGAT CAATGCCGGA 120 

GAGATTCATG CTATCATGGG GCCGAACGGA TCGGGGAAAA GTACGCTCTC TTCCGTTTTG 180 

15 GTGGGACATC CCTCCTTTGA AGTCACGGAA GGAGAGGTGA CATTCAATGG AATCGACCTG 24 0 

CTCGAACTCG AACCGGAAGA ACGTGCACAC CTCGGACTCT TTCTCAGTTT CCAATATCCG 300 

GTCGAGATCC CGGGCGTCAG CATGGTGAAT TTCATGAGGG CAGCTGTCAA TGAACATAGG 360 

AAAGCGATCG GAGCAGAACC CGTATCGGCA AGCGACTTCC TCAAGATGAT GCGAGAGAAG 4 20 

CGTGCCATTG TGGAGCTGGA CAACAT^TTG GCCAGCCGTT CTGTGAACGA AGGCTTCTCC 4 80 

20 GGTGGAGAAA AAAAGAGGAA CGAAATCTTC CAAATGGCTA TGCTCGAACC CAAGCTGGCT 54 0 

ATTTTGGACG AAACCGATAG CGGGCTCGAT ATCGACGCTC TCCGCATCGT AGCAGGCGGG 600 

GTAAACCGAC TCCGCTCTCC GGAGAATGCT GCTATTGTGA TCACACACTA TCAGCGTTTG 660 

CTCGAGTACA TCAAGCCGGA CTTCGTACAC GTCCTTTACA AGGGGCGCAT CGTCAAGTCG 7 20 

GGAGGAGCCG AGCTGGCTCT CACGCTCGAA GAAAAAGGCT ACGACTGGAT CAAGGAAGAG 7 80 

25 ATAGGAGAA 7 8 9 

(2} INFORMATION FOR SEQ ID NO: 22 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1386 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

35 

(ii) MOLECULE TYPE: DHA (genomic) 

(iii) HYPOTHETICAL: NO 

40 (iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

45 (ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION l..,138 6 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22 



AGCATGGCTA AGGAGAAAAC GATCTACGTC TGCCGTTCGT GCGGAACCAA ATACGCCAAA 60 

TGGCAAGGCA ACTGCAATGC CTGTGGAGAG TGGAACTGCA TTGATGAGGA GAAGGTGCCG 120 

GCACCGGCAT CGGGCAAGCA TGCAGCCAAG AGTTTTATGC CTCGGGAGCA GGACAACCGG 180 

CCAAGACTCT TACAGGATGT GGAGTCCGGC GATGAAGAGC GTATTCGCCT CGGCGATGAA 24 0 

55 GAGTTCGACC GCGTACTGGG TGGAGGAATT GTCAAAGGAG CATTTGTCCT GCTTGGGGGC 300 

GAGCCGGGAA TCGGTAAGTC CACGCTTATC CTCCAGACGG TGCTGCGTCT GCCGCAGTTG 360 

CGCACGCTCT ATGTGTCGGG CGAAGAAAGT GCCCGACAAC TGAAGATGCG CGCCGAACGA 4 20 

CTGGGGCAAG CCATGAATGG GTGCTACGTA TACTGCGAAA CGAATATAGA GAGGATACTC 4 80 

TCCCGTGCAG AAGAACTCAC ACCCGATCTC CTCGTGATAG ACTCTATACA GACGGTCTAT 54 0 

60 ACCGAGGAAA TGGAAAGCTC GGCCGGCAGC GTGGGGCAGA TCCGCGAATG TGCCGCCTTA 600 

CTGCTCAAAT ACTGCAAGAC TACGGGTATC CCCGTCATCG TCATCGGACA CATCACCAAA 660 

GAAGGTAGCA TAGCCGGACC GAAGGTGCTG GAGCATATAG TGGATACGGT GCTTCTCTTC 7 20 

GACGGGGATA AGCATCATCT CTACCGGATA CTCCGAGGAC AGAAGAACCG CTATGGCAGT 7 80 

ACTTCCGAGC TGGGGATATA CGAGATGCGG CAGGACGGTC TGCGTGGCGT GGAGAATCCG 84 0 

65 AGCGAACATC TCATCACACG CAATAGGGAA GACCTCAGTG GCATAGCCAT AGCCGTAGCG 900 

ATGGAGGGCA TTCGCCCGAT ACTCATCGAA GCGCAGGCTT TGGTCAGCTC GGCCATTTAT 960 

GCCAATCCGC AGCGTTCGGC CACGGGCTTC GATATTCGGC GGATGAACAT GCTCTTAGCC 102 0 

GTACTGGAGA AACGTGCCGG CTTCAAGCTC ATACAGAAGG ATGTGTTTCT GAACATTGCC 1080 

GGAGGTATCA AAATAGCCGA TCCGGCTACG GATCTGGCCG TTATCTCGGC AGTGCTGGCG 114 0 

70 TCGAGTCTGG ACATCGTTAT CCCGCCGGCC GTATGCATGA CGGGCGAGGT CGGACTCTCC 12 00 

GGAGAGATAC GTCCCGTGAG CCGCATCGAG CAGCGCATAA CGGAAGCGCG TCGCATAGGG 1260 

TTCAAAGAGA TATTGGTACC GGCCGATAAT TTCCGGCAGG AGGATGCCGG CCGCTTCGGT 1320 

ATTCGGCTCG TGCCGGTCAG .AAAGGTGGAG GAAGCCTTCC GCCATCTGTT CTCGAAAGGA 1380 

AGAGAA 138 6 

75 
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(2) lUFORI iATTOi; FOR SEQ ID NO: 23 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: MO 

(iv) AIITI-SEHSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GIUGIVALIS 

(ix) FEATURE: 

(A) IIAIIE/KEY: misc_feature 

(B) LOCATION 1. , .1119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

GGGTCTTGTG GAAGTAGCCC AGCAATTGGC CGATCAGGGT GTTCGCGTTG TGATCGCCGG 
ATTGGACATG GACTTTCGAC GTCAGCCTTT CGGACCTATG CCGGGCTTGT GTGCCATAGC 
CGACTCCGI'G ACCAAAGTTC ATGCCGTGTG TGTGGAATGC GGCCGATTGG CCAGCTATTC 
TTTCCGTCGT GTCCAAGGCG ATCAGCAAGT GATGCTGGGC GAACTGAACG AATACAGTCC 
CCTCTGCAGA ACCTGCTACA GGAAATGCAG TTCTCCCCCA CAAACAGAAG AAATCCATTC 
GACAATATGA ATAGCAGACA TCTGACAATC ACAATCATTG CCGGCCTCTC CCTCTTTGTA 
CTGACATTGG GCGGCTGCTC CGTAGCCCAA CAAGATACGC AGTGGACTCT CGGCGGAAAG 
CTCTTTACTT CGGCGTGGAT ACAACGTTCG GCCGAATATC AAGCGCTTTG CATTCAGGCA 
TACAACATCG CTACGGAAAG AGTGGACGCT CTACCGGCAG AACGTAAACA AGGAGATAGG 
CCTTATGCCA TCGTAACGGA CATAGACGAA ACCATTTTGG ACAATACGCC TAACTCCGTG 
TATCAGGCTC TCAGGGGCAA GGATTATGAT GAAGAGACTT GGGGGAAATG GTGTGCACAG 
GCCGATGCCG ACACACTGGC AGGAGCTTTG TCTTTCTTCC TCCATGCAGC GAACAAGGGG 
ATCGAGGTCT TTTACGTCAC CAACCGCAGA GACAATCTGC GCGAAGCAAC TCTTCAGAAC 
CTTCAGCGTT ACGGATTCCC CTTTGCCGAT GAAGAACATT TGCTTACGAC CCATGGGCCA 
TCCGACAAAG AACCCCGTCG GCTCAAAATA CAAGAACAGT ATGAAATAGT ATTGCTCATA 
GGAGACAACT TGGGCGACTT CCACCACTTC TTCAATACGA AAGAAGAGTC CGGACGCAAA 
CAGGCTCTGG GCCTGACAGC CGGGGAGTTT GGCCGGCACT TCATCATGCT GCCCAATCCC 
AACTACGGAT CTTGGGAACC GGCATGGTAC GGCGGGAAGT ATCCGCCACT GCCCGAAAGA 
GACAAAGCAC TTAAACAACT GCACTCACAG AACAGCAGA 



(2) INFORIIATION FOR SEQ ID NO: 2^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORVPHYROr-IOUAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI'IE/KEY: misc_feature 

( B) LOCATION 1 ... 1278 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

CGTTGGGGTT TTTCTAATTT TGTGGCCATG AGCACCAATA TAGATGTACA ACAGATCAAA 
CAGCGTTTCG GCATCATCGG TAGCAGTCCG CTGATGGAAC ATGCCATACG AGTGGCAGCA 
CAGGTGGCTC CTACCGACAT GTCCGTCCTC GTGACGGGGG AGAGCGGTTC GGGGAAAGAG 
TTCTTCCCAC AGATAATCCA CTACTACAGC GCCCGGAAAC ATCATAGCTA CATTGCAGTC 
AATTGCGGAG CCATCCCCGA AGGAACCATC GATTCCGAGC TGTTCGGACA CCGCAAAGGT 
TCCTTTACCG GAGCCGTATC GGATCGCAAG GGGTACTTCG AAGAAGCATC CGGCGGCACG 
ATCTTTCTGG ACGAAGTGGG CGAACTGCCT TTGCCCACGC AGGCGAGGCT GCTGAGGGTG 
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CTGGAGACGG GCGAGTTCAT CCCCGTAGGA GCCAGCCAGT CGCAGAAGAC GGATGTCCGT 4 80 

ATCGTAGCGG CGACGAATGT GAACCTCAAG GAGGCGGTAG CGAACGGGAA GTTCCGGGAA 54 0 

GACCTCTTCT TCCGGCTCAA TACGGTACCG ATCGAGGTGC CTGCGCTGCG TATGCGACCG 600 

GACGACGTGC CCTTGCTTTT TCGCCGATTC GCCGCCGACA GCGCCGAGAA GTATCGGATG 66 0 

5 CCTCCGCTGC GCCTATCGGA CGAAGCCCGT ACCATATTAA TGCGTTACCG CTGGCCCGGC 720 

AATGTGCGAG AGCTGCGCAA TATTKACCGAC AGGCTGAGCA TCCTGGAGGA GGAGCGGACG 780 

GTATCGGCAG AGACCATCAC TCGCTACCTG GACGCTGAGG GGATGCAAGA CCTCCACCCC 84 0 

GTCGTGATCC GACGGAACGA AACGACCGAA GCGGACA7\AC AAATCCCCCA TTACGAGCGC 900 

GAAATCATCT ACCAGGTGCT ATACGATATG AAGAAAGAGA TAGCCGATTT GAAGGGGATG 960 

10 ATGAACCGCC TGGCGCACCA CGAACAGCCC TCATGGCCTG TAGGGTCGGA CGTCTGGGGC 1020 

AACGACGACA AGCGCACCGC AGATCCGAAG TGGGGCGTCA GCACGCACAA GGCCCCCATC 108 0 

GCGAACGCGG CAGAACCCGT GGAGCCGATA CAGGAAGCCA GCGAATACAC CGAGGATCCG 114 0 

GTTTCGCTGG AGGAGGTAGA GAAGAAAATG ATTTCCCTTG CATTGGAACG CCACGGCGGA 1200 

AGGCGCAAGC AGACAGCCGA GGAACTGAAG ATTTCGGAGC GGACACTATA CCGTAAAATC 1260 

15 AAGGAGTATG GACTGGAA 1278 

(2) IMFORMATION FOR SEQ ID NO: 25 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1959 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

25 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

{vi} ORIGINAL SOURCE: 

( A ) ORGAt ll SM : FORY PHYRa-lOMAS GINGIVALI S 

35 (ix) FEATURE: 

(A) NAIIE/KEY: mi sc_ feature 

(B) LOCATION 1 . . .195 9 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO:25 



AAAAATCTTC CGAGGTACGC TCCGGCATTT GGCAGAAATC GTAAAAGGGA ACAACCTGAC 60 

CCTGACAACG ATGATAGTCG TGGGAGAAGC CATAGACAAT CGGGAAGGGC TGTCACGGTT 120 

GTATGCCGAC GAATTCAAAC ACCTCTTCCG CACATGATCC TACTCTTCGG CGGTACTACG 180 

GAAGGCCGTG CCGCAGCTCG CGTGCTGGAT GAAGCGGGAA GTCCGTTTTT CTACTCCACC 2 40 

45 AAAGGCAATC TGC7\AGAGAT CCAGAGTAGC CACGGCCATC GTCTGACAGG AGCCATGACG 300 

GTTGCCGACA TGGTTTCGTT TTGTCGGAAA GAAGAGATCC GACTGATCGT GGACGCCGCT 360 

CATCCTTTCG CCGAAGAATT GCACGCTTCA GTGGCAGAAG CCTCTGAACA AACAGGTATC ^120 

CCCGTAGTAA GATACGAGAG ACAATACCCT CCACGCGAAG AAGGTATCGT CTGGTGTGCA 4 80 

AACTACGATA CGGCTGCCGA GCGGATGCTT GGCGATGGCG TGCAGCGTCT GCTGATGCTG 540 

50 ACAGGAGTGA ATACGATCCC CAAGCTGGCT GCTTTCTGGA AAGAGCGCAC CACCTTTTGC 600 

CGCATATTGA AGCGAGACGA ATCGGTTGCT TTGGCAGAGA AGAACGGCTT TCCTGCGGAG 660 

CGCATCGTTT TCTTCGAACC GCATGCGGAC GAGGAGCTGA TGCAAGCCGT TCGCCCCGAT 7 20 

GCCATTATCA CAAAAGAAAG CGGAGAGAGC GGTTACTTCC GAGAAAAGAT AGAAGCTGCC 7 80 

CGACGGATGG GCATCCGTAT ATATGCCGTC GTACGTCCCC CTTTGCCTCC TTCATTCATT 840 

55 CCCGTAGGCG GGCCTGTCGG TTTGAGACGG GCGGTAGAAC GCCTCGTGCC GGGATTCTTT 900 

TCACTCCGAA GCGGATTCAC TACCGGCACC ACAGCTACCG CTGCAGTAGT AGCAGCCATG 960 

TACCGATTGA TGGGGCTTGG CTCTCTCGCC GAAGCTCCCG TAGAATTGCC TTCGGGCGAA 1020 

ATAGTCAGTC TGCCCATAGC GGAAATTCGA GAGGAAGAAG ATGCTGTCGT ATCCGCAGTC 1080 

CTGAAAGATG CAGGTGATGA TCCGGATGTG ACCAATGGCA TGGCGGTATG CGCTACGATC 114 0 

60 AGGCTCAATC CCGAACATGA GGAAGTCCGC TTCCTGCAGG GTGAAGGGGT GGGGGTAGTG 12 00 

ACGCTCCCCG GCCTCGGTCT GGAGGTCGGA GGTCCGGCTA TCAACCTCGT ACCTCGACGA 12 60 

ATGATGACAG CAGAGGTACG CCGACTCTAT GCGCAGGGAG GTGTGGATAT TACGATTAGC 1320 

GTACCCGAAG GCCGAGAGGC TGCTACCCAG ACATTCAATC CCCGACTCGG CATACGGGAC 1380 

GGCATCTCTA TTATCGGAAC ATCGGGAGTC GTGAAACCTT TTTCGGCCGA AGCGTTCGTT 14 40 

65 GGTGCCATCC GTAAGCAAGT GGGTATTGCC ACCGCCTTGG GAGCCAATCA TATCGTCCTC 1500 

AATTCGGGAG CCAAGAGTGA GCGTTATGTA AAAGGAGCCT ATCCGGCACT CATTCCACAG 1560 

GCCTTTGTGC AGTATGGCAJV TTTCGTCGGC GAATCACTCA GTTGTGTAGC TTCCTTCCCT 1620 

TCTGTCCGTT CGGTAACGGT AGGAATCATG CTCGGCAAAG CAGTGAAACT CGCCGAAGGC 168 0 

TATCTGGATA CGCACAGTAA AAAGGTAGTG ATGAATCGGG ATTTCCTGCA CGAACTGGCT 17 4 0 

70 CGTCAGGCAG GTTGTTCGGA AGACATCCAT GCCATAATAG ACAGCCTGAA TTTGGCTCGT 1800 

GAGCTATGGA CTATGCCGAG TGCGGAGGAC AGCGATCGAC TGCTACGAA.A GATTGCCGAA 18 60 

CGATCTTGGG AAACTTGCCG CCCATCGGTA CCATCGGCCG AATTAGAACT CCTGCTGATC 1920 

GATGAGTCCG GAGCGATTCG TTTTCGTATC GGTGGAGAA 1959 

75 
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(2) IlJFORtlATIOIi FOR SEQ ID NO: 26 



(i) SEQUEflCE CHARACTERISTICS: 

(A) LEIIGTH: 1353 base pairs 
5 (B) TYPE: nucleic acid 

(C) ST RAIIDEDl- J ESS: double 

(D) TOPOLOGY: circular 

(ii) nOLECULE TYPE: DWA (genomic) 

10 

(iii) HYPOTHETICAL: UO 



(iv) AlITI-SEliSE: MO 

15 (vi) ORIGIMAL SOURCE: 

(A) ORGAEIISM: PORYPHYROI-IONAS GI HG I VAL IS 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 
20 (B) LOCATION 1...1353 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 



CAACAAAGAC CACATAGAAT TATTATGTTG 

25 CCCGAAAACA AGTTGTCGGC AGGCAAGCCC 

GTCATCCCTC TTGGTCAGCA CATCGGTGCA 
GAAGTTAAGG TCGGGACTAT CATTGCTCAG 
TCATCTGTGT CGGGTAAGGT GCTGAAGATC 
AAGCCCGCAG TCTTCATTAG CGTAGAAGGT 

30 CCAGCCATCG TCAAAGAATG CAATCTGGAT 
GCCGGTATTG TGGGTCTTGG CGGTGCTACC 
CCGGGCAACA AAGCTGAGAT CCTGATCATC 
AGCGACCATG TCCTTATGCT GGAGCACGGC 
ATGAAAGCCA TTCAGGTAAA CAAGGCCGTC 

35 ATTGCTCACC TCACCAAACT GGCCACTGCA 

GTGCAATATC CTCAAGGCGG TGAGAAGCAG 
AAAAGCGGTG CCTTGCCTAT CAGCACAGGT 
GCCGTGTACG AAGCAGTACA GAAG7\ACAAG 
GGAAAAAAAC TGTCTCGTCC GTCTAACCTC 

40 TTGATCGAAG CAGCAGGTGG CTTGCCGGAG 
ATGATGGGAC GCGCTCTGCT GTCACCGGAT 
TTGATTCTCG ATAGAGAAGA GGCAGTTCGC 
AAGTGCGTCG GAGTGTGTCC GATGGGACTC 
TATAAGAGCT GGGAAACAGC GGAAAAAGGC 

45 TGCAGCTTCA CCTGTCCGGC CAACCGTCCT 
ACTGTGATGG GTATCCAAAG AGCACGTAAG 



AGGACTTTCC GAATCGGTGG TATTCACCCC 60 

GTAGAGGTGT TGCCTATCCC CTCACAGGTA 120 

CCGGCAACTG CCACGGTCAA GAAAGGGGAT 180 

GCCGGAGGAT TCGTATCAGC TAATATCCAC 240 

GATAACGTAT ACGACTCAAG CGGCTATCCC 300 

GACGAATGGG A.AGAGGGCAT CGATCGCTCA 360 

GCAAAAGAAA TCGTAGCCAA AATTTCTGCA 4 20 

TTCCCTACCC ATGTGAAGCT GTCCCCTCCT 4 80 

AACGCCGTAG AGTGCGAGCC TTATCTGACG 54 0 

GAAGAGATCA TGATCGGCGT GAGTATCCTG 600 

ATCGGAGTTG AGAATAATAA GAAAGATGCT 660 

TATCCGGGCA TAGAGGTAAT GCCGTTGAAG 7 20 

CTGATCGATG CAGTGATCCG CAAGCAGGTA 7 80 

GCCGTAGTAC AAAACGTGGG TACGGTATTC 84 0 

CCTCTGGTCG AGCGCATCGT GACGGTTACA 900 

CTCGTTCGTA TAGGTACTCC TATTGCGGCT 960 

AATACGGGCA AGATCATCGG CGGAGGTCCG 1020 

GTGCCTGTGA CCAAAGGCAG CTCCGGAGTA 1080 

AAGCCTATGC GCGACTGTAT CCGATGCGCC 114 0 

AATCCGGCTT TCCTTATGCG CGACACCTTA 1200 

AACGTGGTTG ACTGTATCGA ATGCGGTTCG 1260 

CTGCTGGATT ATATCCGCCA AGCCAAGAAG 1320 

CAA 1353 



(2) IN FORMATION FOR SEQ ID NO: 27 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1467 base pairs 

(B) TYPE: nucleic acid 

(C) STRAUDEDNESS : double 

(D) TOPOLOGY: circular 



55 



60 



(ii) IIOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: HO 
(iv) ANTI -SENSE: NO 



( V i ) OR I G 1 1 1 A L S OU RCE : 

(A) ORGAtJlSM: PORY PHYROf-lONAS GINGIVALIS 



65 

(ix) FEATURE: 



(A) NAIIE/KEY: mi sc_f eat ure 

(B) LOCATION 1. . .1-3 67 



70 (;:i) SEQUENCE DESCRIPTION: SEQ ID NO: 27 



ATGAACTCTC AAAAGAAAGA GGCTTTTAAT ATGAAAAGAA TACAACTAAC TCTTATCGCT 60 

CTCTTCGCCG CTGTTGCCGG TTTGGTCGCT CAAAATGCTT ACGAGGGAGT AATTTCATAT 120 

AAAATTTCGT TGGACAAAAC CGGAAACAAG GTTGTACTGA ATGGTGCGGC AGATATGAGT 18*^ 

AATTTAAAGC TCAAGAGCAC TCAGATGATC ATTGTTACGC CTATTCTTCG TTCAGAAGAT 24 0 
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GGTACCAGCC GGGTGGAATT TCCTTCGGTA GTCATTACAG GCCGCAATAG AACAAAAGCT 300 

CTCAAGCGTG AAATCGCATT TAGTTCGGCT TTGCCCCAAG CAAAACATGC AGCTCAATAC 3 60 

ATTCGCCGTC ATAATGGGAA GAGCGAGCAG TTTGCTTTTA CAGGAGAACA TGCTTATGCA 4 20 

TCATGGATGA TGGATGCCAA GTTTGTGGTT CGTGAGGAGG TACGAGGTTG TGCTAAATGC 4 80 

5 CCTGTAGGTC TCTCGAGTAA TATTGTTCCT TTTGATCCAC TCTTCAATCC GGCAGAGGCT 54 0 

CCTTATTTGT TGGCACACAT TACTCCGGCA GAAGAAGTGG AAAAACAGCG AGAGTCCAGC 600 

TTCGATGCTT ATATCAACTT CAAAGTCAAT AAGGCAGATG TCCTTCCTGA GTATCGCAAC 660 

7\ATAAGGCGG AGTTAGAGAA AATCAAAGAA TTTGTAAGCA CCGTTAAGGC TAATCCAAAC 72 0 

TATTCGGTCA ATAAAATGAT CATCGAAGGG TTTGCTTCTC CCGAGGCTTC AATAGCCCAC 78 0 

10 AATAAGGCTT TGTCGGAGCG CCGTGCTAAA AGACTCGCGG AAGAATTGGT GCGTAAGTAT 84 0 

GGCAAAACAT TGCCGAATAT AACCACTGAA TTCGGCGGTG AAGATTGGAA GGGGCTGAAA 900 

CTGGCTATCG AAAAGAGTGA TATAGCCGAT CGTGACCGCG TATTGGAGAT AATCAACTCC 960 

GATAAATATG CCGATGATGA TGCACGTGAA CAGGCTCTGA AGCAACTTTC GTCTTATCGT 102 0 

TATATCTTGG ATCAGATCTA TCCGAATTTG CGTCGCAATA CGATAACCAT GGGGTATATC 108 0 

15 GTTCGTGATT ATACCCTCGA AGAAGCTCGT GAAATCATTA AGACTGCTCC GAAAGAACTT 114 0 

AGTGAGGCCG AAATGTACCG TGTGGCAATG TCTTATCCTG AGGGGCACCA AGAGCGTTTG 1200 

TTTGCTCTGA ATACGACCCT TAAGTATTTC CCTGAAAGTG TAACGGGCCG AATCAATTTG 12 60 

GCTGTAGCCG CTTTTAATGG TGGAGACGTT CAACAGGCAA TTGCTCTGTT GAGTCCGATT 1320 

CAGACAGAAA AGGGTGTAAG CAATATCCTT GGAGCTGCTT ATGCTCGTAC GGGAGATTTT 138 0 

20 GCTCGTGCCG AAACCTTCTT CCGTAAGGCC GTTGCAGAAG GAGATGCAAA TGCGCAGCGC 14 4 0 

AACCTCGATA TGCTGCTTGG CAAAAAG 14 67 



25 



35 



40 



70 



75 



(2) IMFORIIATION FOR SEQ ID MO: 20 



( i ) SEQUEI JCE CHARACTERISTICS : 

(A) LENGTH: 1152 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 
30 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DIIA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AHTI-SEMSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 



(ix) FEATURE: 

(A) NAI-1E/KEY: misc_feature 

(B) LOCATIOM 1 ... 1152 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28 

GACATGGCAG AAAAAAGAGA CTATTACGAA GTCCTCGGTG TATCGAAGAA. TGCCACCGAC 6 0 

GATGAACTGA AAAAAGCATA TCGCAAGAAG GCTATCCAAT ACCATCCTGA TAAGAACCCC 120 

GGTGACAAGG AGGCCGAAGA GCACTTCAAA GAGGTAGCTG AAGCCTACGA CGTATTGAGC 180 

50 GATCCGCAGA AGCGCAGTCA ATATGACCAG TTCGGCCATG CCGGATTGGG CGGAGCTGCC 24 0 

GGTGGAGGTT TCAGCGGAGG CGGTATGTCC ATGGAGGATA TTTTCAGTCG CTTCGGTGAT 300 

CTATTCGGTG GGTTCGGCGG TTTCGGCGGA TTCTCCGATA TGGGCGGTGG CAGTCGCAGA 360 

CGTGTTCGCA GAGGGTCTGA CCTGCGAGTA CGAGTGAAGC TTTCTTTGGC CGATATAAGT 4 20 

AAAGGTGTGG AGAAGAAAGT GAAGGTAAAA AAGCAGGTAG TGTGCAGCAA ATGTCGTGGC 4 80 

55 GATGGCACGG AAGAAGCCAA TGGCAAGACT ACCTGCCAGA CCTGCCATGG AACCGGCGTG 54 0 

GTTACACGTG TGAGCAACAC TTTCCTTGGG GCCATGCAGA CCCAGAGCAC TTGTCCCACT 600 

TGCCACGGAG AGGGTGAGAT CATCACGAAG CCATGCTCCA AGTGTAAGGG CGAAGGTGTG 660 

GAGATCGGCG AAGAGGTGAT CTCATTCCAC ATCCCTGCCG GTGTAGCCGA AGGAATGCAA 7 20 

ATGTCCGTGA ACGGCAAGGG AAATGCCGCG CCCCGAGGAG GCGTGAATGG CGACTTGATA 7 80 

60 GTCGTGATCG CCGAGGAACC GGATCCGAAT CTGATCCGCA ATGGCAACGA TCTGATATAC 84 0 

AATCTGCTTA TATCCGTTCC GTTGGCTATA AAAGGAGGTA GTGTGGAAGT GCCGACGATA 900 

GACGGACGAG CCAAGATCCG CATCGAGGCG GGGACACAAC CCGGCAAGAT GCTGCGTTTG 9 60 

CGCAATAAGG GGTTGCCCAG CGTAAACGGC TATGGCATGG GAGACCAACT GGTGAATGTC 102 0 

AATGTCTATA TCCCCGAATC GATCGATGCC AAAGATGAGC AGGCTATCGC AGCGATGGAA 1080 

65 AACTCGGACA GCTTCAAACC TACCGATGCT GCTCGTAAGG ATATAGACAA GAAATACAGA 114 0 

GAGATGCTGG AT 1152 



(2) INFORMATION FOR SEQ ID NO: 29 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDl-IESS: double 

(D) TOPOLOGY: circular 
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10 



50 



55 



fii) nOLECULE TYPE: DIJA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AtlTI-SEIJSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHVROHONAS GINGIVALIS 

{!>:) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 . . . 927 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 9 

CGATTCGATT CAACAACTAA TGTCTCACAA ATTAATTTAA GAACAGAGAT GAAAAAACTG 60 

ATTTTAGCGA CTTTGGGACT TATGGCCATT GCCATGCTCT CATGTTCAAG CAACAACAAG 120 

GATTTGGAGA ACAAAGGGGA GGCTACTCTT TTGGTAACGT TTGGTAGCTC CTATAAAGCT 180 

20 CCACGCGAAA CCTATGCGAA GATTGAGAAG ACTTTTGCCG CAGCTTATCC CGATCAAAGG 240 

ATAAGCTGGA CATACACGTC TTCTATTATC CGAAAGAAAC TGGCTCAGCA GGGTATTTAT 300 

ATCGATGCTC CGGATGAGGC TTTGGAGAAA TTGGCTCGTC TGGGTTATAA GAAGATCAAT 360 

GTACAGAGTC TTCATGTGAT TCCCGGCCGA GAATATGATG AGATGATCGA CTTTGTC7U\T 4 20 

AAGTTTAAGG CAGCACATAG TGATATTACT GTGAAGGTAG GGGCTCCGCT TTTCGATACC ^80 

25 GATGAAGATA TGCGCGAGGT GGCAGAGATC TTGCACAAGC GTTTTCAGCA AACGATAGAG 5^0 

AAAGGTGAAG CTATTGTATT CATGGGACAC GGCACCGAGC ATGCTGCCAA TGACAGGTAT 600 

GCCCGTATCA ATAAGATCAT GAAGAACTAT AGCAAGTTCA TGATCGTCGG AACCGTCGAG 660 

TCCGATCCCT CTATCAATGA TGTTATTGCC GAACTGAA.2,G AAACCGGTGC CACGGCCGTA 7 20 

ACAATGATGC CGCTGATGAG TGTGGCAGGC GACCATGCTA CGAATGATAT GGCCGGAGAT 7 80 

JO GAGGACGATA GCTGGAAGAC GTTGCTGACC AATGCCGGCT ACACAGTTTC TATAGACAAG B40 

CTGGACAATG GCAATTTCTC AGCTCTTGGA GATATAGAAG AGATCCGGAA TATCTGGCTC 900 

AAGCATATGA AAGCCACCTC TGCTCGC 927 

35 (2) INFORMATION FOR SEQ ID NO: 30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1473 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) IIOLECULE TYPE: UNA (genomic) 
45 (iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 

( B ) LOCATION 1 . . .14 7 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 



CGGAGAGATG CGCGGAGGCA ACTGGTACGC CGTAGGAGGA AAGAGCTATC TGGCACAGCA 60 

^TACGCGAT GCCGGAGGAG AGTATTTCCT GAAAGACGAT CAGCGATCCG GTGGTGTTTC 12 0 

60 CCTCGACTTC GAGACGGTCT ACAGCCGATC CGATTCGGCT CGCTACTGGC GAATCCTCAA 18 0 

CAGCTATCCC GGGGAGTTCG GTTACGAAGC TTTGAAAGCC GAAGACAGCC GATATGCCGA 24 0 

TTTCAGGGCA TTCAAGGAAA AAGGCGTGAT ATACTGCAAC CTGAGGGAAA CAGCCTTCTA 300 

CGAACTCATG CCCATGCATC CCGATTGGGT GCTGGCCGAT CTTATCGCTA TCTTGCACCC 360 

CGGACTACTT CCCGACCACC AACCGCATTT CTATTATTTG CTCCAATGAC ATCCGTCAGC 42 0 

b5 CACTTACGTA CAATTTCTGT CGCAGGTATC CTGGCTGCGC TGGGAGGGGC TGTACTCATT 4 80 

CTCTTCGGGG TTAATCTCTT CCTCGGCTCG GTGGCTATTC CGATGAGCGA GATCTTCCGA S4 0 

CATCTTTTTT CAGATCGTCC CGAAGGAGGA GAAGCACTCG TGCACTACAA TATCCTATGG 600 

AAATCCCGCC TGCCCGAAGC CCTCACGGCT GCTTTTGCCG GCGCAGGTTT ATCCGTTAGT 660 

GGCTTGCAGA TGCAGACCGT CTTTCGCAAT CCTTTGGCCG GTCCGTCCGT TCTCGGCATC 72 0 

70 AGCTCCGGTG CCAGTTTGGG TGTTGCTTTG GTCGTTCTGC TGAGCGGCTC GCTGGGAGGA 780 

GTGGCATTGA GTAGCCTGGG TTATATGGGC GAGGTGGCCA TGAATATAGC CGCTGCCGTA 84 0 

GGCTCGCTGG CAGTAATGGG GCTGATCGTT TTTGTCAGCA CCAAGGTGCG CAGCCACGTT 900 

ACGCTGCTCA TTATCGGCGT TATGATCGGA TATGTAGCCA CTGCCGTCAT CGGGGTATTC 960 

AAGTTTTTCA GTATCGAAGA AGATATTCGG GCATACGTAA TTTGGGGGTT GGGCAGCTTT 102 0 

75 TCCCGTGCCA CGGATTCGCA ACTGAGTTTC TTTGCCATTC TGATGTTGAT CTTTATTCCG 108 0 
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GCCGGTATGC TCCTTGTCAA GCAGTTGAAT CTCTTATTGC TGGGAGAAAG CTACGCACGT 114 0 

AATCTGGGAC TGAATACTCG TCGGGCACGG CTGCTCGTGA TCTCTTCCGC CGGTTTGCTC 1200 

ATCGCTACCG TCACGGCCTA TTGCGGTCCC ATCGGCTTTT TGGGGATGGC TGTGCCACAC 1260 

TTGGCACGGG TTATCTTTCA CACATCGGAT CATCGGATCC TGATGCCTGC TACCTGTTTG 1320 

5 ATTGGAAGTG CTCTGGCTCT TTTCTGCAAT ATCATTGCTC GTATGCCGGG GTTTGAGGGG 138 0 

GCTTTGCCCG TCAATTCCGT AACGGCTTTG GTGGGAGCAC CTATTATCGT CACCGTTTTG l^^O 

TTCCGGCGCA GACGCTTCAA GGAAGAAACC GAC 147 3 

10 (2} IMFORMATIOjg FOR SEQ ID NO:31 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2289 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRAJIDEDHESS: double 

( D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DIIA (genomic) 

20 (iii) HYPOTHETICAL: NO 

( iv) ANTI-SENSE : NO 

(Vi) ORIGINAL SOURCE: 
25 ( A ) ORGAN I SM : PORY PHYRaiONAS GIHGI VALI S 

(i:-:) FEATURE: 

(A) NAJIE/KEY: misc_feature 

(B) LOCATION 1 . . . 2289 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31 



CATTTTTTAG TAACGATTAT GCGGACAAAA ACTATCTTTT TTGCGATTAT CTCTTTTATT 60 

GCTCTATTGT CGTCTTCTCT GTCGGCTCAG AGCAAAGCCG TTTTAACCGG TAGTGTGTCG 120 

35 GATGCCGAAA CCGGAGAGCC TCTTGCCGGT GCTCGAATCG AAGTCAAACA CACCAACATA 180 

GTAGCCGGTG CCGATGCCGG CGGACATTTC GAGATCAAGA ACCTGCCGGC AGGGCAGCAT 24 0 

ACTATTATAT GTTCGTTGGG GGGGTATGGA CAGAAAGAGG AGGTGGTTGC CATCGAAGCC 300 

GGACAGACCA TWVCGATCTC TTTTGCATTG CGACTGCGAA CGAACAACTT GGAGGAAGTC 360 

GTCGTTACCG GTACCGGTAC ACGTTACCGC TTGGTCGATG CTCCTGTGGC AACGGAAGTC 4 20 

40 CTTACCGCTA AGGACATAGC CTCTTTCTCG GCTCCTACTT CCGAGGCCTT ATTGCAGGGG 4 80 

CTGAGTCCGT CTTTTGACTT CGGCCCCAAT CTGATGGGCT CTTTCATGCA GCTGAACGGC 54 0 

CTTAGCAGTA AGTATATCCT CATCCTTATC GATGGTAAGC GTGTGTACGG CGATGTAGGC 600 

GGTCAGGCCG ATTTGAGTCG TATTTCTCCT GATCAGATCG AACGGATCGA ACTGGTGAAA 660 

GGTGCTTCGA GTTCGCTCTA CGGATCCGAT GCCATCGCCG GGGTAATCAA TGTGATCACA 720 

45 7W\AAGAATA CGAATCGACT GAGTGCATAT ACGTCACATC GCATATCGAA GTACAACGAT 7 80 

CGGCAAACCA ATACTTCGCT CGATATAAAC ATCGGTAAGT TCAGTAGCAA TACCAACTAT 04 0 

TTCTTCTACC ATACGGATGG CTGGCAGAAT AGTCCGTTCG AAATAAAAAA GAAAAAAGGA 900 

TCCGGCGAAC CGGTCTTGGA GGAAACGTAT AAGAAAACTT TTCGTGCACA GGAAAATCAG 960 

GGTGTAAGCC AATCGCTTTC CTATTATGCA ACTAACAATC TTAGCTTCAG CGGAAATGTG 1020 

50 CAGTACAATA AACGTCAGAT CTTCACTCCG ACTTTTTCCG AAAAGAAGGC CTATGACATG 1080 

GATTATCGTG CTTTGACGGC TTCACTCGGT ACGAACTATC TTTTCCCCAA TGGTCTGCAT 1140 

ACGCTTTCTT TCGATGCCGT CTACGATCGC TTCCGTTTCG GATATTTGTA TCATGACAAG 1200 

GACAGCAGTG AGAGCCTGAT CAACAACCAA GGTCAGACCG AGCAACCCAC ATTCTTTCCG 1260 

GGTCAGCTAC GCAATAAAAA CGATCAGATC CGATACACGG CAGAGGCTCG CGGTGTATTT 1320 

55 ACACTGCCTT ATGCGCAGAA ACTGACCGGC GGTTTGGAGT ATTTCCGTGA GGAATTGATC 1380 

TCTCCCTATA ATTTGATTAC CGACAAGGCA GATGCTTCCA CGCTCTCTGC TTATGTACAA 14 40 

GATGAATGGA AACCGCTCGA TTGGTTCAAT ATGACAGCCG GTTTCCGTCT GGTACACCAT 1500 

CAGGAGTTCG GTACACGAAT GACGCCTAAG GTATCCATAC TCGCCAAGTA TGGGCCGCTG 1560 

/\ACTTCCG03 CTACGTATGC TAACGGCTAT AAGACTCCCA CGCTGAAAGA GCTTTTTGCA 1620 

60 CGGAACGAAC TCACCACTAT GGGTTCGCAC AATCTCTATC TCGGCAATGC GGATCTTAJ\G 1680 

CCACAGATGT CGGATTATTA TGCTTTGGGC TTGGAGTACA ATCAAGGCCC TATCTCGTTC 17 4 0 

AGTGCAACGG TTTATGACAA TGAACTTCGC AATCTGATCT CCTTTATGGA TATACCGACC 18 00 

TCACCCGAGC ACGAAGCTCA GGGAATCAAG AAAACCAAGC AGTATGCCAA CATAGGAAAA 18 60 

GCTCGCAGCC GCGGCCTTGA TGTCCTATGT GATGCCTCTA TCGGTTGGGG TATCAAGTTA 1920 

65 GGAGCCGGAT ACAGCCTCGT GGAAGCTAAG AATCTCCAGA CGGATGAGTG GCTGGAAGGA 1980 

GCTGCACGTC ATCGTGCCAA TGTGCACGCC GATTGGGTTC ACTACTGGGG TCAGTATAGA 204 0 

CTTGGCGTGA GCCTTTTCGG CCGTATTCAG AGCGAGCGTT ACTACAAAGA CGGCAATGCT 2100 

CCGGACTATA CCTTGTGGCG ACTCGCCACA TCGCATCGTT TCGCTCATTT CCGCCACATC 2160 

ATCCTGGATG GAACGCTCGG TATAGACAAC CTGTTTGACT ACGTGGATGA TCGTCCTATG 2220 

70 GGTGTCAATT ATGCTACCGT 7\ACGCCGGGA CGTACTTTCT TTGCTCAAAT AGCGATTCGA 22 80 

TTCAACAAC 22 8 9 



75 



(2} INFORMATION FOR SEQ ID 110:32 
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15 



(i) SEQUEKCE CHARACTERISTICS: 

(A) LEMGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRAtlDEDMESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: WO 
(iv) AIITI-SEHSE: HO 



{vi) ORIGINAL SOURCE: 

(A) ORGAIJISM: PORYPHYROHOHAS GIHGIVALIS 

(ix) FEATURE: 

(A) NAI IE/KEY: misc_f ea ture 

(B) LOCATION 1. . . 1095 

20 (>:i) SEQUENCE DESCRIPTIOM: SEQ ID ilO:32 

TGCTGCGCAA GGCTTCGGGA CACCTCTCCC CGGAAGAAGT CGTTTCGGAA TGATACCGTT 60 

TTTCCTTATT TCGTTATTCA TCTGATCAAA CATATTATCA TTATGACGGA C7U\CAAACAA 120 

CGTAATATCG TATTCCCGGC GTTTCTCCTC TTGCTGGGAG TCATCGCAGT GGTGACGATC 180 

25 GTTGGTTTTT TCATGCTCAG ACCGGCCGAG GAGATTATCC AAGGACAGAT AGAAGTGACC 24 0 

GAATACCGAG TGTCCAGCAA AGTGCCCGGG CGCATCAAGG AACTTAGGGT ATCCGAGGGA 3 00 

CAGCAGGTGC AGGCCGGCGA TACCCTCGCT GTCATCGAAG CCCCCGACGT AGCGGCTAAG 3 60 

ATGGAGCAGG CAAAGGCTGC CGAAGCAGCT GCACAGGCTC AGAACGCCAA GGCTCTCAAA 4 20 

GGAGCACGCA GCGAACAGAT ACAGGCAGCC TATGAGATGT GGCAGAAAGC TCAGGCCGGC 4 80 

30 GTAGCCATAG CGACCAAGAC ACACCAGCGC GTGCAGAACC TCTATGACCA GGGAGTGGTA 54 0 

CCGGCTCAGA AGTTGGACGA AGCCACTGCC CAGCGCGATG CGGCCATCGC TACGCAAAAA 600 

GCGGCCGAAG CCCAGTACAA TATGGCTCGC AACGGTGCCG AACGCGAAGA CAAGCTGGCA 660 

GCTTCTGCCC TCGTCGATAG AGCGAGAGGA GCCGTCGCCG AGGTGGAGTC GTACATCAAC 7 20 

GAAACCTACC TCATCGCCCC ACGGGCAGGC GAAGTGTCGG AGATATTCCC CA7\AGCCGGC 7 80 

35 GAACTCGTAG GTACCGGCGC ACCTATCATG AATATCGCCG AGATGGGCGA TATGTGGGCC 84 0 

AGCTTTGCCG TTCGTGAGGA TTTCCTCAGC AGCATGACCA TGGGAGCCGT TCTGGAGACT 900 

GTGGTGCCGG CTCTGAATGA AGAAAAAGTA CGCTTCAAGA TCACATTCAT CAAGAACATG 960 

GGTACCTATG CTGCCTGGAA AGCGACCAAG ACAACAGGGC AGTACGACCT GAAGACCTTC 102 0 

GAGGTAAAGG CCACCCTTGC GGATAAAGAC AAGGCACAAA AGCTACGCCC GGGTATGTCC 10 80 

40 GTGATCATAC GCAAG 10 95 



(2) INFORIIATION FOR SEQ ID NO: 33 



45 



50 



55 



{i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 960 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DfJA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AUTI- SENSE: NO 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROt40I IAS GINGIVAL IS 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1 . . . 960 



65 



70 



75 



{::i) SEQUENCE DESCRIPTION: SEQ ID MO: 33 



CCGCAATCCT 
AGCGTTTTAG 
GTACATCAAC 
TTGTTCGTCT 
GTTTACCTGC 
AAAATTGAGA 
ACGCCCTACA 
TCGAACGAAG 
ATCTATGTAG 
TCCAGTGGAT 



CTCCTGATCG 
AGGTCGGCAA 
AACCCCCTTT 
CTTTTTCGGT 
AAGATATCCA 
AGGACGATGT 
ACCAAGTGTT 
GCTTCCTGGT 
AGGGCCTTAC 
TTATCAAGGA 



AAGAAGCTTC 
AATTGGGATT 
TTTAAGCCAT 
TTTGCTTTTT 
AACTTTTAAT 
GCTGAACATC 
GACCACTCGT 
CGATTCGAAA 
TCGTACCGAA 
TCCTACGGTA 



CAAAACGTCA 
GTGATTATCT 
AAATCAATTA 
GCATCATGCC 
CGGGAGATTA 
CTTGTCAGCA 
GCACTGGCCC 
GGGTACATCA 
CTGGAGAAGG 
ACGGTGCAGC 



TGAATAAATA 
TTGCGCCCAT 
TGCGTATTGT 
GTTCCCAGCG 
TCGCTA7VACC 
GTAGAGACCC 
GCAACGGCTA 
ATTATCCTAT 
AGATACAGAA 
TTCAAAATTT 



TCATTCTCAA 
AGTACGGAAT 
CAGTAATTTT 
AGAAAAGGTC 
ATATGACGTA 
GGAGCTTTCA 
TGGAACGAAC 
TTTAGGCCAG 
GAGGATTATT 
CAAGGTGTCG 



60 
120 
180 
240 
3 00 
360 
420 
480 
540 
600 
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GTTTTGGGAG AGGTGAATCA TCCGGGTTCG ATGTCGGTAA AAGGAGAGCG AATAACTCTT 660 

TTGGAAGCGA TCGGAATGGC CGGAGACCTG ACAATCTATG GTCGCCGCGA TCGGGTTTTT 720 

GTGATTAGAG AAACCGATGG GCATCGCGAG GTTTTCCAGA CGGATCTCAG AAAGGCCGAC 7 80 

TTGCTCGCAA GCCCCGTGTA CTATCTGCAT CAGAACGACG TCATCTATGT GGAGCCGAAC 84 0 

5 GACAAGAAAA CACAGATGAG CGAGATCAAC CAGAATAATA ACGTAAACGT ATGGCTGAGT 900 

GTTACCTCCA CTTTGGTATC CATTTCCACG CTGACGATTA CGATAATAGA TAAGACCAAA 960 

(2) IMFORltATIOW FOR SEQ ID MO: 34 

10 

(i) SEQUEIiCE CHARACTERISTICS: 

(A) LENGTH: 1746 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAMDEDNESS : double 
15 (D) TOPOLOGY: circular 

(ii) ItOLECULE TYPE: DHA (genomic) 

(iii) HYPOTHETICAL: MO 

(iv) AMTI-SENSE: UO 

(vi) ORIGINAL SOURCE: 

{ A ) ORGAMISM : PORY PHYROHOI IAS GI 1 IGI VALI S 



20 



25 

(ix) FEATURE: 

( A ) rJ AJ' IE / KE Y : mi s c_ f e a t u r e 

(B) LOCATION l..,1746 

30 (xil SEQUENCE DESCRIPTION: SEQ ID NO: 34 

TCAAAAATAG TACTACGAAA GTTTTGTACC TTAGCGCGCA TGAAAAAGAC CAATCTGTTT 60 

TTATCTCTGC TGGTGATCTT TATCACCGGT AGTTTTATGA CTGCCTGTGC ACAGAAGTCC 12 0 

AAGACGAACA AACTCACCGA AGAAGATCGG AGCCGCAATG AGTATGTACA GTCGATGGAT 180 

35 GTGCTTAGCA ATATTATCGG TAACGTCAGG CTGTATTTCG TCGATACCAT AAGTATCAAA 24 0 

CATATGACTC GGCGTGGTAT AGATGCGATG TTGGGCGGGC TTGACCCCTA TACCGAATAC 300 

ATTCCTTACG AGGAAATGGA TGAACTGAAA TTGATGACTA CGGGAGAGTA TGCCGGAGTC 360 

GGAGCTATCA TATCGCAGCG CCCGGATAGT GCTGTGATTA TCCAGAGACC TATGGAAGGT 4 20 

ATGCCCGCAG ACGAAGCAGG ATTGATAGCA GGCGACCGCA TCCTGACTAT CGATGGGAAA 4 80 

40 GACTTCCGCA AATCCACCAC ACCGAAAGTA AGCCAAGCAC TGAAAGGGAT AGCCGGTACT 54 0 
GTTGCAAAGG TGACAGTAAT GCGCTATGGC GAAACCAAAC CTCGTACTTT TTCCGTGAAA 600 
CGTCAAAAAG TGATTATGAA TTCCGTCACT TACAGCGGAA TGCTCGATGG CTCGATAGGA 660 

TATATCCGCT TGAACAACTT TACGGACAAA AGTGCAGAAG AGGTGCGCAC GGCCTTGTTG 72 0 

GATCTTCGTG ACAAACAAGG AGCGAAAGGT CTCATTTTGG ATTTAAGAGG CAATGGTGGC 7 80 

45 GGACTGATGC AGGCTGCTAT CGAGATAGTC AATCTGTTCG TCCCTAAGGG CAAAGAGGTG 84 0 

GTAACGACCA AAGGTCGCAT TGCAGAGTCG GCGTCCGTAT TTCGCACATT GACTGAACCG 900 
ATCGACACGA AACTCCCGAT AGTAGTCCTG ATCGATGGAC AATCGGCATC TTCCTCGGAG 960 

ATTGTAGCCG GAGCACTGCA GGATATGGAC AGGGCTGTAC TGATGGGACA AAAGAGCTAT 102 0 

GGCAAAGGGC TTGTACAAAC GACTCGTCAG CTACCATACA ACGGCGTGAT CAAATTGACT 108 0 

50 ACGGCCAAGT ACTACATCCC AAGCGGACGT TGTATTCAGC GTTTGGACTA CAGCCGCACC 114 0 

AATCGGACAG GTATGGCAAC GGCCATTCCT GACAGTCTGC ACAA/^TCTT TTACACTGCT 1200 

GCCGGAAGAC GTGTAGAAGA TGCAGGAGGA ATCCTGCCTG ACATCGAGGT CAAACAAGAT 12 60 

ACAGCTGCGA CATTACTTTA TTATATGGCC ATCAATAATG ACGTTTTCGA TTTCGTCACA 1320 

GGTTATGTGC TCAAGCATAA AACGATTGCC AAGCCGGAGG ATTTTTCCAT AACGAACGAG 138 0 

55 GACTATGCAG CTTTCTGCAA GATGATGGAA GAAAAGAAAT TTGACTATGA TCGCCAGAGT 14 4 0 

GGCAAGATGC TTGACAAACT GGAGGAACTG GCTAAGATAG AAGGCTACCT GCCGGAAGCC 1500 

AACTCGGAGC TTAAAGCACT ACGCGAAAAG CTAAAACCCA ACCTGTCGCG TGATCTGCTA 1560 

CGATTCAAAA AGGAGATAAC AAACTATCTC AACAATGAGA TTGTCACTCG CTATTATTAT 162 0 

GAGCGAGGCA GTATCCGCCA GAGTTTGCCG GAAGATAAGG TAGTCAAAGA AGCTATTAAG 168 0 

60 CTGCTGAAGG ACCATCCGGA ACAAATTCGA CAGATCCTTG CAGCTCCGAA AGCAGAGAAT 174 0 

AAAGGG 17 4 6 



65 



75 



{2) INFORIIATION FOR SEQ ID NO: 35 



(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 2955 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
70 (Dl TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: HO 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO _9929870A1 _L> 



wo 99/29870 



PCT/AU98/01023 



23 / 490 



(iv) AtlTl-SEHSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAJiISM: PORYPHYROMOHAS GIMGIVALIS 

5 

(i>:) FEATURE: 

(A) NAltE/KEY: misc_f eature 

(B) LOCATION 1 . . .2 955 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

GTAACAGACA AAATGCAAAA CAAAGGATTT GTGATTGTTA TCACATCGGC TCTGGCCATC 60 

ATCTGTGCGT TTTACCTGTC ATTCTCTTTC GTTACGAACC GTTACGAAAA GAAGGCTAAG 120 

GCGATGGGCG ATGTTGCCGG AATGGCCTAT CTTGATTCCA TGTCGAATGA GAAGGTCTGG 180 

15 TTCGGCTACA CGCTGAAAGA AGCTCAAGCC CAGCAAATTG GTCTTGGCCT TGACTTAAAG 2^0 

GGGGGTATGA ACGTTATCTT GAAACTTAAC GCAAGCGATC TGCTTCGTAA CCTCTCTAAC 300 

AAAAGTTTGG ATCCCAACTT CAACAAAGCT CTGGAGAATG CTGCCAAGAG CACGGAGCAA 360 

TCCGACTTCA TCGATATTTT CGTGAAGGAA TATCGCAAGC TCGATCCCAA CGGTCGCTTG 4 20 

GCCGTTATCT TCGGTTCGGG TGACCTTCGC GACCAGATTA CCGCAAAGTC TACGGATGCA 4 80 

20 GACGTAGTGC GTCTGCTCAA AGAAAAATAT AATAGTGCTG TAGAAGCTTC GTTCAATGTG 54 0 

CTCCGTGCTC GTATCGATGC TTTCGGTGTG GTTGCACCTA ATTTGCAGCG ATTGGAAGGG 600 

CAAGGGCGTA TCCTTGTCGA ACTCCCCGGA GTGAAAGACC CTGAGCGTGT TCGTACCCTT 660 

TTGCAACGCA GTGCCAACCT ACAGTTCTGG CGTACATACA AATTCGAAGA GGTCAGCGGA 7 20 

GACTTGATCG CTGCCAATGA TCGTCTGAGC GAATTGGCTA TGAACAACAC GGATGCTACC 7 80 

25 CCGGAAACAG AGCCTGCAAC TACTGACTCT GTAGCTGCAA CAGCCGATTC TGCTGCTGTA 64 0 

CAAGCTGTAG CTGATTCTGC TACTGTAGCA CAAAAAGAGG CCAAGGATGC TACTCGTAAA. 900 

GACGCACTCT TCTCTCTGCT TACTCCCGTG AATCGTGGCG GTGCAGTAGT GGGTGTGGCT 960 

CGTCGTGCTA ATATGGCTCA GATATCTGAA ATGCTCCAGC AAGCTCACGA TCTG7VAGGTT 1020 

ACACGTGAAG ATGTGCTTTT CCTCTGGGGT GCTAAAGCAA TCGAAGACCC CGAAACCAAA 1080 

30 AAGGAGACCG ACCTCTACGA ACTCTATGCT ATTCGTACCA ATCGTACGGG AGATCCTGAT 114 0 

TTGGGAGGTG ATGTAGTGAC TTCCGCCAAG AGTGATATCC AAAATGACTT CGGTCGTTCC 1200 

GAACCGATCG TTTCGATGAC GATGAATGAA GAAGGTGCTC GTAAATGGGC GCGTATCACA 12 60 

AAGGATAACG TGGGACGGGC AATCGCTATC GTTTTGGATG GTGTGGTTTA TTCTGCTCCG 1320 

AACGTGAATG ATGAGATCAC GGGCGGTCGC TCTCAGATCT CCGGGCACTT CACCGTGGAG 1380 

35 GAGGCCGGTG ACCTTGCCAA CGTACTCAAC TCCGGTAAAA TGGATGCTAC GGTAAGCATC 14 40 

GAACAGGAAA ACGTGATTGG TCCTACGCTG GGTGCCGAGT CCATTAAAGC CGGATTCTTG 1500 

TCGTTCCTGC TCGCTTTGGT TATCCTGATG TGTTACATGT GTCTGGCTTA CGGTTTCTTG 1560 

CCGGGTCTTA TCGCAAACGG CGCATTGATT GTAAACAGCT TCTTCACATT GGGCGTATTG 1620 

GCTTCTTTCC ATGCCGTGCT GACCCTCTCG GGTATCGCAG GTTTGGTGCT GACGCTGGGT 1680 

40 ATGGCTGTGG ATGCCAACGT ACTTATCTTC GAGCGTATCA AAGAAGAGCT TCGTGCCGGT 17 4 0 

AAGACTCCGA TTCGTGCCGT TACGGATGGT TATGGCAACG CTTTCTCTGC CATCTTCGAC 18 00 

TCGAACGTTA CGACTATTAT TACCGGTATC ATCCTATTCC TCTACGGGAC GGGGCCGATT 18 60 

CGCGGTTTTG CCACTACGTT GATTATCGGT CTTATCGCTT CTTTCATTAC GGCTGTCTTC 1920 

TTGACTCGTA TCGTCTTCGA GAAACTGGCG AAAAAAGGTC GTTTGGATAA GATTACATTC 1980 

45 ACTACGAGCA TTACTCGCAA TCTCCTTGTC AATCCCTCAT ACAACATCTT GGGTAAGCGC 2040 

AAGACCGGCT TTATCATTCC GGTGATTATC ATCGTTTTGG GACTTATAGC TTCATTTACA 2100 

ATCGGTCTCA ATAGGGGTAT TGAATTCTCC GGAGGACGTA ACTACGTAGT TAAATTCGAC 2160 

CAGCCTGTAT CTTCCGAAGC CGTTCGTTCG GCCTTGTCTT CTCCCCTGCA GGAAAAGGTA 2220 

TTGGTTACCT CCATCGGTAC TGAAGGGACA GAGGTGCGTA TATCTACGAA CTATAAGATC 22 80 

50 CAGGAGGAAA GCGAAGAAAC TGAAGCAGAG ATTACTGACA AATTGTATCA GAGCCTGAAA 234 0 

GGTTTCTACA CCCAGCAGCC TACTGCTGAT CAGTTCTTGG ACAATATCAT TAGCTCTCAG 2 4 00 

AAAGTAAGTC CCAGTATGTC GAGTGACATC ACGAGAGGTG CTATTTGGGC TGTGCTGTTA 2 4 60 

TCGATGATCT TCATGGCCAT TTATATTCTG ATTCGCTTCC GTGACATTTC TTTCTCTGCC 252 0 

GGGGTATTCG TATCTGTGGC CGCTACTACA TTCTGCATTA TTGCTCTGTA TGCGTTGCTG 2 580 

55 TGGAAGATTC TGCCCTTCAC CATGGAGATC GATCAGAACT TCATCGCTGC TATTCTGGCT 2640 

ATCATCGGTT ACTCGCTCAA TGACACCGTG GTTGTATTTG ACCGTATCCG AGAGACGATG 27 00 

AAATTGTACC CCAACAGAGA TCGCTATCAG GTGATCAACG ATGCCCTTAA TTCAACATTG 27 60 

GGTCGAACAT TAAATACGTC TTTGACTACG TTTATCGTTA TGTTGGTAAT CTTCATCTTT 2820 

GGAGGTGCTA CGATGCGTAG TTTCACGTTC TCGATCCTGC TCGGTATCGT TATCGGTACA 2 8 80 

60 TACTCTACGC TCTTTGTTGC TACACCCCTT GCCTACGAGA TCCAAAAGCG CAAGCTCAAC 2 94 0 

AAAGCAGCTA AGAAA 2 955 



65 



75 



{2) INFORMATIOM FOR SEQ ID NO: 36 



(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 3138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 
70 (D) TOPOLOGY: circular 



(li) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: HO 
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60 



65 



70 



(2) IIJFORMATIOM FOR SEQ ID 110:37 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LEtJGTH: 2607 base pair^ 

(B) TYPE: nucleic acid 

(C) STRAIIDEDHESS: double 

(D) TOPOLOGY: circular 

75 (ii) MOLECULE TYPE: DMA (genomic) 



(iv) AriTI-SEMSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGAIIISM: PORY PHYROMOHAS GIUGIVALIS 

5 

(ix) FEATURE: 

(A) WAI IE/KEY: miEC_feature 

(B) LOCATION 1 . . . 3138 

10 SEQUENCE DESCRIPTION: SEQ ID NO: 36 

GACTTCCCTT GGGTTAGGGT AAAACCGGAG AAGAAAAGAA AACAACATAA CAGTAATAAT 60 

TTTAAGTTTA ACGCAAAAGA AAAGTCTATG AAAAGAATGA CGCTATTCTT CCTTTGCTTG 120 

CTGACGAGCA TTGGGTGGGC TATGGCCCAG AATAGAACCG TGAAGGGTAC AGTTATCTCC 180 

15 TCCGAGGATA ATGAGCCCCT GATCGGCGCG AATGTCGTGG TTGTCGGAAA CACCACTATC 24 0 

GGTGCTGCAA CCGACTTGGA TGGCAACTTC ACGCTTAGCG TGCCTGCCAA TGCCAAAATG 30n 

TTGAGAGTGT CCTATTCCGG TATGACTACC AAAGAGGTCG CCATCGCTAA TGTGATGAAG 360 

ATCGTACTGG ATCCGGACTC TAAGGTTCTG GAGCAGGTAG TTGTATTGGG TTACGGTACG 4 20 

GGACAGAAAC TCAGCACTGT TTCCGGTTCT GTGGCCAAAG TGTCCAGCGA AAAGCTCGCG 4 00 

GAAAAGCCCG TTGCCAATAT CATGGATGCC CTCCAAGGTC AGGTAGCCGG TATGCAGGTT 5^0 

ATGACTACAT CCGGTGACCC TACTGCCGTC GCTTCTGTGG AGATCCATGG TACAGGGTCG 60^ 

TTGGGGGCAA GCTCTGCACC ATTGTATATC GTGGATGGTA TGCAAACTTC TTTGGATGTT 66^ 

GTGGCTACGA TGAATCCGAA TGATTTTGAA TCTATGTCCG TTTTGAAAGA TGCTTCTC^A 7-^0 

ACATCTATTT ATGGAGCTCG TGCTGCAAAC GGAGTCGTTT TCATTCAAAC GAAGAAAGGT 7 80 

Zt) AAAATGAGCG AGAGAGGTCG TATTACCTTT AATGCCAGTT ACGGGATTTC TCAAATCCTG 84 n 

AATACTAAGC CCCTTGATAA TATGATGACT GGAGATGAAT TGCTGGATTT TCAGGTGAAG 900 

GCAGGTTTTT GGGGGAACAA TCAAACCGTT CAGAAGGTTA AAGATATGAT CCTTGCCGGA ^60 

GCTGAAGATT TGTATGGCAA TTATGATTCT TTGAAAGATG AGTATGGTAA GACATTGTT^ 10-^0 

CCAGTGGATT TTAATCATGA TGCAGACTGG CTCAAGGCTT TGTTTAAAAC AGCACCCACC 108 0 

CJU AGTCAAGGTG ATATTTCTTT CTCCGGAGGG TCTCAGGG.AA CTTCATATTA TGCCTCTATA 114 0 

GGCTACTTCG ATCAGGAAGG TATGGCTCGT GAACCGGCAA ATTTTAAGCG CTATAGTGC^ 1200 

CGGCTCAACT TCGAAAGTCG TATCAATGAA TGGCTGAAAG TTGGTGCAAA TTTGTCTGGT 1260 

GCGATAGCGA ATAGACGATC TGCCGACTAT TTTGGAAAGT ATTATATGGG GTCAGGTACT 1320 

TTCGGTGTGT TAACGATGCC TCGTTATTAT AACCCTTTTG ATGTGAATGG GGATTTAGCA 13 80 

J5 GATGTCTATT ACATGTATGG AGCTACCAGA CCTTCTATGA CAGAACCGTA CTTCGCAAAA 14 40 

ATGAGACCGT TCAGTTCCGA ATCACATCAG GCCAATGTAA ATGGTTTCGC CCAGATTA^T 15 no 

CCGATCAAAG GCCTTACTTT AAAGGCACAG GCTGGTGTTG ATATTACTAA TACTCGCACT 1560 

TCTTCTAAGA GAATGCCCAA TAATCCGTAT GATTCTACTC CTCTTGGGGA AAGMGAGAA 162 0 

AGAGCTTATC GAGATGTTAG CAAGTCTTTT ACAAATACGG CTGAATATAA GTTTTCAATT 1680 

4U GATGAAAAAC ATGATCTTAC AGCATTGATG GGGCATGAAT ATATTGAATA TGAAGGGGAT 17 40 

GTTATTGGGG CATCTTCTAA AGGATTTGAA AGTGATAAGT TGATGTTACT GAGCCAGGGA 18 00 

AAAACCGGAA ATAGTTTGTC TTTGCCTGAA CACAGAGTCG CTGAATATGC CTATTTGTCT 18 60 

TTCTTTAGTC GTTTTAATTA CGGTTTTGAC AAATGGATGT ATATAGATTT CTCTGTTCGT 1^2 0 

AATGACCAAT CCTCTCGATT CGGATCCAAT TVATAGAAGCG CGTGGTTCTA TTCTGTCGGT 198 0 

4b GGAATGTTTG ACATATATAA TAAATTCATT CAAGAAAGTA ATTGGCTCAG TGATCTTCGA 2 04 0 

CTGAAAATGA GTTATGGTAC AACGGGTAAC TCGGAGATTG GTAATTACAA CCACCAAGCA 2100 

CTCGTTACTG TGAACAATTA TACTG/\AGAT GCTATGGGGC TTAGCATTTC TACAGCAGGC 2160 

AATCCCGACC TCTCGTGGGA AAAGCAGTCT CAGTTCAACT TCGGTTTGGC TGCAGGGG^T 22''n 

TTCAATAATC GCTTATCTGC AGAGGTAGAT TTCTATGTCC GCACTACGAA TGATATGTTG 228 0 

t>U ATTGATGTCC CGATGCCTTA TATCAGTGGT TTCTTCTCAC AGTATCAGAA TGTAGGCTCT 2 34 0 

ATGAAAAATA CGGGTGTAGA CCTTTCTCTT AAGGGGACGA TCTACCAAAA TAAGGACTGG 2 4 Oo 

AATGTATATG CTTCTGCGAA TTTCAACTAC AATAGACAGG AAATAACAAA GCTTTTCTTC 2 4 60 

GGTCTCAATA AGTACATGTT GCCTAATACC GGTACTATAT GGGAAATTGG GTACCCCAAT 252 0 

TCGTTCTATA TGGCTGAATA TGCTGGAATC GACAAAAAAA CCGGTAAGCA GTTGTGGTAT 258 0 

GTTCCTGGTC AAGTCGATGC GGATGGTAAT AT^AGTTACAA CAAGCCAGTA CTCAGCTGAC 2 64 0 

TTGGAGACAC GAATTGATAA GTCTGTTACT CCTCCTATTA CAGGTGGTTT CTCCTTAGGT 27 00 

GCTTCTTGGA AAGGACTTTC TTTAGATGCT GATTTTGCCT ACATCGTTGG TAAATGGATG 27 60 

ATCAATAATG ACCGTTACTT TACAGAGAAT GCAGGTGGAT TGATGCAATT AAATAAAGAT 2 82 0 

AAAATGCTAT TGAATGCCTG GACAGAGGAT AATAAAGAAA CAGATGTTCC AAAATTGGGA 2 88 0 

CAGTCTCCTC AGTTTGATAC GCATTTGTTG GAGAATGCTT CTTTCCTGCG TTTGAAGAAT 2 94 0 

CTCAAACTCA CCTATGTACT CCCCAATAGT CTTTTTGCTG GGCAGAATGT GATTGGTGGA 3000 

GCTCGTGTCT ATTTGATGGC GCGCAATCTG TTAACTGTTA CGAAGTATAA AGGCTTTGAC 3 060 

CCTGAAGCAG GGGGGAATGT GGGAAAAAAT CAATATCCTA ATTCTAAGCA GTACGTTG^G 3120 
GGTATTCAGT TGTCTTTC 
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(iil) HYPOTHETICAL: MO 
(iv) AMTI-SENSE: HO 

5 

(vi.) ORIGINAL SOURCE: 

(A) ORGAI'IISM: PORV PHYROHOMAS GIUGIVALIS 

(ix) FEATURE: 
10 (A) IWIE/KEY: misc_featur« 

(B} LOCATIOIJ 1...2 607 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 37 

15 TGGCATAGGA ATATTTTTAT CTTTGCGAGT ACATTTAGCC CGAAAAATAT GCTCCCACTG 60 

CCATACCGTT ATGCAAAAAC CGAGCACCTT TTTCTCGCAA AAGGATACTG CAAGAATCCA 120 

ATAACAAACA TAATTATCCT ATTTATGAAG AAAAAGAATT TTTTGCTTCT TGGCATTTTC 180 

GTTGCTTTGC TGACTTTCAT CGGCAGCATG CAGGCACAAC AGGCCAAAGA TTATTTCAAC 240 

TTTGACGAAC GGGGCGAGGC CTACTTCTCA TTCAAAGTGC CTGATAGGGC CGTTCTACAA 300 

20 GAGCTGGCTC TGATCATGTC CATCGACGAG TTTGACCCCG TAACCAATGA AGCCATTGCC 360 

TATGCCAGCG AAGAGGAGTT CGAGGCATTC CTGCGCTATG GGCTCAAGCC TACATTCTTG 4 20 

ACTCCTCCAT CCATGCAGCG CGCTGTCGAG ATGTTCGACT ACCGCTCAGG AGAAAAATAC 4 80 

GAATGGAATG CTTACCCCAC CTATGAAGCC TATATCAGCA TGATGGAAGA GTTCCAAACA 54 0 

AAGTATCCAT CA-^TTTGTAC TACTTCCGTC ATTGGCAAGT CCGTAAAGGA TCGTAAACTG 600 

Z5 ATGATTTGCA AGCTGACGTC CTCTGCCAAT ACAGGGAAAA AGCCTCGCGT GCTCTATACT 660 

TCTACGATGC ACGGAGACGA AACGACCGGA TATGTGGTAC TGCTCCGACT CAIAGACCAT 7 20 

CTGCTGTCGA ACTACGAATC CGATCCGAGG ATTAAGAACA TTCTGGATAA AACGGAAGTA 7 80 

TGGATCTGCC CTTTGACCAA TCCGGACGGA GCATACAGAG CCGGAAACCA CACCGTACAA 84 0 

GGAGCTACTC GCTACAATGC CAACAATGTC GATTTGA^\CC GTAACTTCAA GGATGATGTA 900 

30 GCCGGTGATC ACCCCGATGG AAAACCTTGG CAGCCGGAGG CAACTGCATT CATGGATTTG 960 

GAAGGAAACA CCTCTTTCGT GCTCGGTGCC AATATACATG GAGGAACAGA GGTGGTGAAC 1020 

TATCCATGGu ATAATAAAAA AGAAAGACAT GCAGACGATG AGTGGTACAA ACTGATCAGT 1080 

CGCAACTACG CAGCCGCTTG TCAGAGTATT TCCGCCAGCT ACATGACCTC CGAAACCAAT 114 0 

TCGGGAATCA TCAACGGTTC AGACTGGTAT GTAATTCGCG GAAGTCGTCA GGACAATGCA 1200 

35 AATTATTTCC ATCGTCTGCG AGAAATTACC CTTGAAATCA GCAACACGAA GTTGGTGCCG 1260 

GCCTCTCAAC TTCCAAAGTA TTGGAATCTG AACAAAGAAT CTCTGCTTGC TCTGATCGAA 1320 

GAATCCTTAT ACGGCATCCA TGGTACAGTG ACTTCCGCTG CGAACGGACA GCCTCTCAAA 1380 

TGCCAGATCT TGATAGAAAA CCATGACAAG CGCAACTCCG ATGTTTACTC CGATGCTACC 14 4 0 

ACAGGCTACT ACGTACGTCC TATCAAAGCC GGCACTTATA CGGTGAAATA CAAAGCCGAG 1500 

40 GGTTATCCTG AGGC/IlACTCG TACCATTACG ATCAAGGACA AAGAAACCGT CATCATGGAC 1560 

ATI'GCATTGG GCAACTCGGT TCCTCTGCCT GTACCCGATT TCACAGCTTC TCCTATGACC 1620 

ATCTCAGTAG GCGAAAGCGT CCAATTCCTVA GATCAAACGA CAAATAACCC CACGAATTGG 1680 

GAGTGGACGT TCGAAGGCGG ACAGCCTGCC ATGAGTACAG AGCAGAATCC GCTCGTATCC 17 4 0 

TATAGTCATC CCGGTCAGTA CGACGTTACG CTCAAAGTGT GGAATGCAAG TGGTTCCAAC 1800 

45 ACGATTACGA AAGAAAAATT CATCACTGTC AATGCCGTTA TGCCTGTAGC TGAATTCGTC 18 60 

GGTACCCCGA CGGAAATAGA AGAGGGCCAG ACGGTATCTT TCCAAAACCA ATCCACCAAT 1920 

GCCACCAACT ACGTATGGAT ATTCGATGGC GGCACTCCCG CTACCAGTGA AGACGAAAAC 198 0 

CCGACTGTGC TTTACAGCAA AGCCGGCCAA TACGATGTCA CGCTCAAGGC GATCAGTGCT 204 0 

TCCGGTGA.AA CGGTGAAGAC GAAAGAAAAA TACATCACTG TCAAGAAAGC TCCGGTCCCT 2100 

50 GCTCCGGTAG CCGACTTCGA AGGAACACCT CGAAAAGTAA AGAAAGGCGA GACAGTTACT 2160 

TTCAAAGACT TGTCTACGAA CAATCCGACT TCATGGCTTT GGGTGTTCGA AGGCGGCTCT 222 0 

CCTGCCACCA GCACGGAGCA AAACCCGGTG GTCACCTACA ATGAAACAGG CAAGTACGAT 2280 

GTCCAGCTGA CTGCCACCAA CGAGGGCGGA AGCAATGTGA AGAAAGCAGA AGACTACATT 234 0 

GAGGTTATCC TCGATGACAG TGTCGAGGAC ATAGTGGCAC AGACGGGTAT CGTCATTCGT 2400 

55 CCGCAAAACG GAACGAAGCA GATCCTCATA GAAGCCAACG CTGCTATCAA AGCGATCGTT 24 60 

CTCTATGACA TCAATGGACG GGTCGTACTC AAAACTACTC CGAATCAGCT CCGCTCGACC 2520 

GTAGATCTTT CCATCCTGCC CGAAGGAATC TACACCATCA ATATCAAA^VC GGAAAAATCC 2580 

GCTCGCACGG AAAAGATCCA TATCGGG 2607 

60 

(2) IHFORMATIOH FOR SEQ ID HO: 38 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LEHGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDHESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 
(ili) HYPOTHETICAL: HO 
(iv) ANTI-SEHSE: HO 
75 (vi ) ORIGINAL SOURCE: 



65 



70 
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(A) ORGANISM: PORY PHYROHONAS GIIIGIVALIS 



10 



15 



(ix) FEATURE: 

{A) NAI-IE/KEY: miE-c_f eature 
(B) LOCATION 1 ... 318 

(xi) SEQUENCE DESCRI PTIOJJ : SEQ ID HO: 38 

CTTTCGTACA GTGGAGAGAG CGATGC/^AAA GAGTCTGATC AGAATTGCCG GAAATGTACC 
TTCATCGGCT TTGAAAAACG AGTAAATACG ATGCGACTGA TCAAGGCTTT TCTCGTGCAA 
CTCTTACTGC TCCCCATTTT CTTCTACAAG CGGTTTATAT CGCCGCTTAC ACCGCCTTCA 
TGGCGGTTTA CCCCCTCATG TTCGTCCTAT GCCATCGAAG CCTTACGTAA ATATGGCCCG 
GGCAAAGGAC TATTGCTGAG CATCAAGCGT ATTCTCCGCT GTCACCCGTG GGGTGGAAGT 
GGCTATGACC CCGTTCCG 



60 
120 
180 
240 
300 
318 



( 2 ) I N FORMAT ION FOR SEQ ID NO : 3 9 

(i) SEQUENCE CHARACTERISTICS: 
20 {A) LENGTH: 2583 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



( i V ) Al IT I -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 

(ix) FEATURE: 

(A) NAl-lE/KEY: misc_f eature 

(B) LOCATION 1_.2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 



TTAGAGATGG 
GAAGCAACCC 
CGCGACGGAG 
CTCAAACGGT 
TCGCCCATCT 
GACATGGAGG 
CAGGAGAGCT 
TCGGATTACT 
CTCGACGGGT 
GGG/\ATAGCG 
GATACTACCA 
GACATCACTG 
ATCGAAAGGG 
GAACCCGGTG 
AAGGTGAGCC 
GCCGGCACCA 
TVAGAAGAATC 
TCTGCAGCCG 
GTACAGTGCA 
GC ACT CGAAC 
ACCATCCTGC 
GAAGCGATCA 
GATAAGGCGA 
GCTCCGAAAG 
TCGGCCGT7\A 
ACTCAGCAGC 
GAGACGGTGG 
GAGCGGCTGA 
AAAGTAGTAG 
CTGGGACTTC 
GTAGGCAAGA 
ATGATCAGGG 
GCCCCTCCGG 
JKPJKCCCTATT 
CTGCTCTTAC 
TTCAAGAACA 



CATACGACTT 
GTCTCGGCGT 
ACAATGGCGC 
TGATCGAGTT 
TCACCCCTTC 
ACGAGGCCGT 
TAGTACAAAA 
TCGGACAGCG 
ACCAAGACAA 
GGACAGGCGG 
CCACGACACG 
CCATGGCGGC 
TGATACAGAT 
TAGGCAAGAG 
GTATTCTTTT 
AATATCGCGG 
CGCAGATCAT 
GATCGATGGA 
TCGGAGCCAC 
GCCGCTTCCA 
AAAACATCAA 
AAGCGGCAGT 
TAGATGCCAT 
AAATCGAGAT 
AGGCTCAGAA 
AGATAGCGGA 
ACGAGAATGT 
GCACGGGCGA 
GTCAGGACAC 
GCAATGAAAA 
CCTATTTGGC 
TGGATATGAG 
GATATGTGGG 
CCGTGGTTCT 
AGGTGATGGA 
CCGTGATCAT 



TACACAAACA 
CGTAGCCGTG 
GATCGACATC 
GGAAGCCATC 
GGCTCGGGAG 
CAGCCCGGTC 
GATATTTATG 
CAACCCCTCC 
CGACTTCGAC 
AGGCTCCGGC 
GAGTGGAGGC 
AGCAGGCAAG 
ACTCAGCCGG 
TGCCATCGTG 
CGACAAGCGG 
ACAGTTCGAA 
CCTCTTCATC 
TACGGCCAAT 
TACGCTGGAT 
GAAGGTGCCG 
AGAGAAATAC 
GGAACTGACC 
GGACGAGGCC 
ACTGGAGGCC 
CTACGAACTG 
AGAGAAGAAA 
AGTGGCGCAT 
AGGCGAACGT 
AGCCATCGAA 
GAAACCGATC 
CAAGAAGCTC 
CGAGTATATG 
CTATGAAGAA 
CTTGGATGAG 
CGAAGGTCAG 
CATCACCTCC 



TTCCGCAACA 
ACGCAAGATA 
ATGCGGCACT 
GCCGAGAGTT 
GCTATCGATG 
CATCTGTTGC 
AAACAAGGTA 
GAAGGGAAGT 
GACGAAGAGG 
GACGCCCCCG 
GACACGCCTG 
CTCGACCCGG 
CGCAAAAAGA 
GAAGGACTGG 
ATCATCAGCC 
GAGCGGTTGA 
GACGAGATAC 
ATGCTCAAAC 
GAGTATCGTA 
ATAGCCCCCT 
GAGGACTATC 
GATCGCTATG 
GGCGCGAGCG 
GAATTGGCAT 
GCTGCCTCCT 
AAATGGGAAG 
GTAGTGGCGT 
CTGCGCACGA 
AAGATGGTGC 
GGTTCTTTCC 
GCCGAATACC 
GAGAAGTTCT 
GGCGGCCAAC 
ATCGAAAAGG 
CTGACCGACA 
AACGTGGGTA 



GCCTGGAGTA 
TGCTCGTACT 
ATGGGATCAA 
TGCCTGCTTC 
ATGCCACAGA 
TGAGTATCCT 
TAATVATACGA 
CTCCCTCCGA 
ACGAATCCTC 
AACAGAATAC 
CACTGGACAC 
TAGTGGGTCG 
ACAATCCGGT 
CCGAACGCAT 
TCGATTTGGC 
AAGCCGTGCT 
ATACCATCGT 
CCGCTCTTGC 
AGAACATAGA 
CGACTGCAGA 
ACGGTGTACG 
TATCCGATCG 
TCCATATCAC 
CGGTGCGAGA 
TCCGCGATCA 
AGCAGATGTC 
TGATGACAGG 
TGGCAGATGA 
ATGCCATCCA 
TTTTCCTCGG 
TGTTCGAGGA 
CCGTTTCGCG 
TGACGGAGCG 
CGCATGCCGA 
GTCTGGGACG 
CACGCCAGCT 



CAGCTATCAG 
CGGTATCATT 
CTTGTACGAA 
GCCTGAGGGA 
CATCTGTGCC 
CAACTCGACA 
CACCATCCTG 
AATGGAGATC 
TCCGCCTTCC 
CGGCGGAGGC 
CTTCGGCACC 
GGAGCAGGAG 
GCTCATCGGC 
CGTGAACAGG 
TCAGATGGTA 
CGATGAGCTG 
GGGAGCAGGC 
CCGTGGACAG 
AAAGGACGGA 
AGAAACGCTG 
CTATACGGAC 
TTTCTTCCCA 
CAATGTGGTG 
GAACAAGCTC 
GGAGCGGCGC 
CAAGCACCGC 
CGTTCCGGCT 
TCTCAAGACC 
GCGCAATCGT 
CCCCACGGGG 
TGAGAATGCC 
TCTCGTGGGT 
CGTAAGACGC 
TGTCTTCAAT 
GCGCGTGAAT 
CAAAGACTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
132 0 
1380 
144 0 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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10 



20 



GGGCAGGGTA TCGGGTTCCG TTCGGAAAAA GACGAGGA.^\G CGAACAAGGA GCATAGCCGT 2 22 0 

TCCGTGATCC AAAAAGCTCT GAACAAGACG TTCAGCCCCG AATTTCTCAA CCGTTTGGAC 2 2 80 

GATATCATCC TCTTCGACCA ACTGGGCAAG ACGGAGATTC GCCGGATGGT GGACATAGAG 23*J0 

CTTAAAGCCG TCTTGGCGCG CATCCATCGT GCCGGATACG ACCTCGTCCT TACCGATGAA 2 4 00 

GCCAAGGATG TGATAGCGAC GAAGGGATAC GACCTCCaAT ACGGAGCACG ACCGCTCAAG 2^60 

CGCACACTCC AGAACGAAGT GGAGGATCGC CTCACGGATC TTATCCTCTC CGGACAGATC 2520 

GAGAAAGGGC AGACGCTTAC GCTCTCTGCT CGCGATGGCG AGATCATCGT ACAAGAACAA 2580 
GCA 



{2) liJFORIIATION FOR SEQ ID 110:4 0 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 1368 base pair^ 
15 (B) TYPE: nucleic acid 

(C) STRAHDEDMESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: MO 
(iv) AIITI -SENSE: NO 



2583 



25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) HAI>]E/?:EY: misc^feature 
30 (B) LOCATION 1,..1368 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 40 

AGAGGAGGAC AAATCCGCCG ACACCATACC GATTCGTCAA GGGGATCGGA CAGCAAAGCC 60 

Sb TGCCGCTTCT CCTTTCATGC CGAGACCATC GGTTTCTCCA ACCACCAAAG AGCAAAAATG 120 

AATTACCTGT ACATACTGAT TACACTTTTA CTCTCCGGCT TTTTTTCCGG TGCTGAGATT 180 

GCTTTCCTTT CTTCGGACAA ACTGCGTCTT GAGTTGGACA GGAATAGAGG CGATCTCACA 24 0 

GGCAGAGCGT TAAACTTGCT GTATCGACAT CCGGACCAA.C TGGTGACTAC TCTCCTTGTG 300 

GGTAATAATA TCGTTTTGGT AGTCTATGGT CTGCTGATGG CGGGATTGCT GGCCGCACCT 360 

4U TTGGCGCAAT GGATTGATAA CGATGCTATG ATCGTCGTTC TCCAATCTGT CTTATCCACT 4 20 

ATCATCATAC TGTTTACCGG GG7U\TTTCTA CCCAAAGCCA TTTTCAAGAC CAATGCCAAT 4 80 

ATGATGATGA GGGTATTCGC CCTCCCTATC GTAGCGATCT ATTATCTGCT TTATCCTCTG 54 0 

TCTAAACTCT TCACCGGTTT ATCTCGCTCT TTTATTCGTC TGGTGGACAA GAATTATGTG 600 

CCTACAACAG TAGGGTTGGG GCGCGTAGAT CTCGATCATT ATTTGGCAGA AAATATGTCC 660 

45 GGAGAAAACG AACAGAACGA CTTGACTACC GAAGTGAAAA TCATCCAGAA TGCGCTGGAT 7 20 

TTTTCCGGTA TTCAGGTGCG AGACTGCATG ATCCCACGCA ATGAGATGAT AGCATGTGAG 7 80 

TTGCAAACGG ATATTGAAGT ACTCAAAACG ACTTTTATCG ATACCGGTTT GTCCAAGATC 84 0 

ATTATCTACA GACAG7U\CAT AGATGACGTA GTAGGATATA TCCATTCGAG CGAAATGTTT 900 

CGTGGGCAAG ACTGGCAAAA ACGTATCAAT ACTACTGTAT TCGTACCCGA AAGCATGTAT 960 

5U GCC/VATAAAC TGATGCGACT ACTCATGCAG CGCAAGAAAA GCATTGCGAT CGTCATCGAT 102 0 

GAACTTGGAG GTACGGCCGG AATGGTCACA TTAGAGGATT TGGTAGAAGA GATTTTCGGT 1080 

GACATTGAGG ACGAACACGA CACTCGCAAG ATCATAGCCA AACAGCTCGG CCCTCATACC 114 0 

TATCTGGTCA GTGGTCGTAT GGAAATAGAT GATGTGAACG AACGTTTTGG GTTGTCCTTG 12 00 

CCTGAGTCTG ACGACTACCT TACCGTGGCC GGATTTATCC TGAATAGCCA TCAAAATATC 12 60 

0 0 CCACAGGCCA ATGAGGTCGT GGAGATTGCT CCTTATACTT TTACCATTCT CAGATCTTCT 132 0 

TCCACCAAGA TCGAACTGGT GAAAATGTCC ATCGACGACC AATCGAAC 1368 



60 



(2) INFORMATION FOR SEQ ID NO: 41 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 
65 (D) TOPOLOGY: circular 



70 



75 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: IIO 

( iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 
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(ix) FEATURE: 

(A) NW1E/KEY: mi sc_f eatur e 
(BJ LOCATION 1 ... 897 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 

TTAAAAAAGG AGATAACTAT GAAACAGAAC TACTTCAAAA GAGTCTGCTC ACTGCTTTGG 60 

CTGGTTTTAC CCATGCTTAT TATGCCATTG GAAGTAGCAG CTCAAGAGAT TATTCCGAAC 120 

GAAGAGGTGT TGGAATCATT GACTTTCGTT GCACCGGTTG AGGAGACAGA CGCAATAGAG 180 

U GCAGAGGTAG AAGCTCTGCA GGAGATAGTC GCTACTGAGG AGATTGCGGA GCAGGCTGTT 240 

CGTTCTTATA CCTACACGGT CTATCGTGAT GGCGTGAAGA TTGCTTCAGG ATTGACTGAG 300 

CCCACTTTTC TCGATGAAGA TGTTCCTGCC GGCGAACATA CCTACTGCGT AGAAGTACAG 360 

TATCAGGGAG GCGTATCCGA CAAAGTATGC GTGGACGTAG AGGTGAAGGA CTTCAAACCG 4 20 

GTTACCAATC TCACCGGAAC TGCTTCCAAT GACGAAGTTT CTTTGGACTG GGACGGTGTG 4 80 

5 GAAGAGAAAG CTG AAGAGCC GGCAAGTGAT AAAGCAGTCA GCTACAACGT CTACAAGAAT 54 0 

GGAACCTTGA TCGGTAATAC AGCTGAAACT CATTATGTGG AGACCGGTGT AGCCAATGGT 600 

ACATACATCT ACGAAGTGGA AGTAAAGTAT CCTGACGGTG TATCTCCGAA GGTGGCTGTA 660 

ACCGTGACCG TGACCAACAG CTCATTGAGC AATGTAGATG GACAGGCTCC TTACACATTG 7 20 

CGAGTAGAAG GCAAGAAGAT TATTGCGGAA GCCCATGGTA TGATCACGCT CTACGACATC 7 80 

0 AACGGACGTA CCGTGGCCGT AGCCCCGAAT CGATTGGAAT ACATGGCGCA AACCGGTTTC 84 0 

TATGCAGTGC GCTTCGATGT GGGGAATAAA CACCATGTAT CGAAAATACA AGTAAGA 8 97 



(2) INFORMATION FOR SEQ ID NO: 4 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1131 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

Ui ) MOLECULE TYPE: DllA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv) AI'iTI -SENSE: NO 



40 



45 



(vi) ORIGINAL SOURCE: 

{A) ORGAl'IISM: PORYPHYROMONAS GIl-IGIVALIS 

(ix) FEATURE: 

(A) NTMIE/KEY: misc_feature 

(B) LOCATIOtJ 1 . . .1131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 



50 



55 



60 



65 



TTGTTTAATC 
TCTGGCTTTA 
CCTGTGTCGG 
CCGTATGACA 
TGGAAGACCA 
GGACAGAGTG 
CCGGACAATT 
ATCTATTGGG 
TCTTCTACAG 
GCCAAACGGA 
CAAAGAAAGG 
ACGGATAATT 
AGAGCTCCGT 
ATAGCGAGTG 
GACTATTGTG 
GTGGTGGCAA 
GGCAAGACCA 
CGGCTGATAG 
ATTAAAATAC 



ATAAAAAATC 
TTCTTTTGTG 
ATTTACGTGC 
ATCCGATGAT 
TTGACGCAGA 
GTCTCTGTGT 
ATCTGATAAC 
TATGTACTCA 
GCAATAATGC 
TACAATCCCC 
TGGTACTCCC 
TCTGGCTCAA 
GTCCGCATCC 
GATTGTCGGC 
TCCAAGTCAA 
ATTCTGCAAA 
TTGTAGCGAG 
CTTCCGGCTG 
AGGTAAACGG 



ATGGTATG7\A 
TGAAAATACC 
AGAAGCGTAC 
TCCTCTAAGC 
TGGCGATGGC 
CTCTTCGGCT 
ACCCGAATTA 
AGATCTCACT 
TGCTGACTTT 
CGAGTTGATC 
TAACGATACT 
TTTGGATGAA 
GGGTGGTTAC 
ATTGGCATAT 
TTATCTGCAA 
CATCTATGGG 
TGCTTTCAAA 
CGATACGCTT 
AACTGTCTAT 



ACATTTCAAT 
CTTGCAC7VAC 
GGCTCTACCG 
GAGAGTTTTG 
TATAATTGGA 
TCATACATAG 
AAACTACCCA 
GCTCCATCGG 
GTTAATCTCT 
CGCGG7W\TC 
AAATATGTTG 
GTATCTATCC 
ACTTATTCTG 
ATCGATACGG 
GGAGACTCGT 
GCGGATAAGC 
GGAGAGATCA 
AGGTACAAAG 
ACTGAGAAAA 



TTTATCTCGT 
AAAAAACAGA 
TTTTCCTCCA 
AATCAGGTAT 
TGCATTTGAC 
GCGGCGTCGG 
CAGACGCGTT 
AGCACTATGC 
TATATGAAGA 
GGACACAAGG 
CTTTCCGCCA 
TGTATACCCC 
TATTCCGTGA 
ATGTACCGTA 
ATAAAGTCTG 
CTTTTGCGTT 
CTCTTTATGA 
CGGAA7\ATGG 
TCCAAATCCA 



TGTTTTCCGC 
GGAGTTTGCA 
CTGGACTCCG 
TCCAGCTATA 
CAATTTCACG 
AGCTTTGACT 
GGTGGAAATA 
CGTTTATTCC 
GACTTTGACT 
TGTTTGGTAT 
TTTTAATTCC 
TCTTCCCCGA 
TGGACAAAAG 
TGGGACTCAA 
CAAAAATATA 
GACCGTGGTT 
CATTCGTGGC 
TTTTTACCTC 
A 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1131 



(2) INFORflATION FOR SEQ ID NO: 4 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2547 base pairs 
(D) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
5 (iv) AMTI -SENSE: KO 

(vi) ORIGIUAL SOURCE: 

(A) ORGAJIISM: PORY PH i ROMOMAS GIMGIVALIS 

10 (ix) FEATURE: 

{A) IIAIIE/VXY: misc_feature 

(B) LOCATION 1 . . .254 7 



PCT/AU98/01023 



15 



60 



70 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 43 



TTCGGAATAT CACCTTCAAT GAAAAAAAGT TTTCTTTTAG CCATAGTAAT GCTCTTTGGC 60 

ATTGCCATGC AGGGACATTC TGCTCCGGTT ACGAAAGAGC GAGCTTTGAG TCTGGCTCGG 120 
CTGGCTTTGC GACAGGTATC CTTGCGAATG GGACTWVCAG CAGTATCTGA CAAGATTTCC 180 

ATCGATTACG TTTATCGGCA AGGAGATGCT GAGAGGGGTA TCACATCACA AGAGGAAGGC 24 0 

20 TCTCCTGCAT ATTTTTATGT AGCTAATCGT GG7W\TAATG AGGGCTATGC TCTTGTAGCA 300 

GCAGATGACA GAATACCGAC AATTTTAGCC TATTCACCCA TTGGCCGTTT CGACATGGAC 360 
AGTATGCCGG ACAATCTTCG CATGTGGCTA CAAATTTACG ATCAGGAAAT AGGCCTGATA 4 20 

CTTTCCGGAA AAGCTCAGCT CAATGAAGAG ATATTACGTA CCGAGGGCGT ACCGGCTGAA 4 80 

GTACATGCTC TGATGGATAA CGGTCATTTT GCCAACGATC CCATGCGATG GAATCAAGGT 54 0 

25 TACCCATGGA ACAATAAGGA ACCACTGCTT CCTAATGGCA ATCATGCCTA TACCGGCTGT 600 
GTTGCTACTG CTGCAGCACA AATCATGCGC TACCATAGCT GGCCGCTTCA AGGTGAAGGC 660 

TCTTTCGATT ATCATGCAGG TTCATTAGTT GGCAACTGGT CCGGCACATT TGGTGAAATG "7 2 0 

TACGACTGGA TCAATATGCC CGGAAATCCC GACCTTGATA ATCTGACTCA ATCTC7\AGTG 7 80 

GATGCCTACG CCACACTGAT GCGTGATGTG AGTGCATCTG TTTCGATGAG TTTTTATGAA 84 0 

30 AATGGAAGTG GTACGTACAG CGTTTATGTA GTAGGAGCCT TGCGAAACAA CTTTCGCTAC 900 
AAGCGTTCAC TGCAGCTACA TGTACGCGCC TTATATACCT CACAGGAGTG GCACGATATG 960 

ATCCGCGGGG AACTTGCCTC CGGAAGGCCG GTCTATTATG CAGGGAATAA CCAGAGCATA 1020 

GGACATGCTT TCGTTTGCGA TGGTTATGCT TCGGATGGTA CTTTCCATTT CAACTGGGGT 1080 

TGGGGAGGTG TTTCCAACGG CTTCTACAAA CTAACACTCC TCTCGCCGAC -TTCGTTGGGT 114 0 

35 ATCGGAGGTG AGGGAATAGG TTTTACCATT TATCAAGAGA TCATCACCGG TATCGAACCG 1200 

GCTAAGACTC CCGCTGAAGC CGGTACAGAT GCCTTGCCGA TCTTGGCACT GAAAGACATA 1260 

GAAGCCGAGT ATAAAAGTGA ATCCGGATTG AACGTAGGGT ATTCGATATA TAATACAGGT 1320 

GAAGAGCAAT CAAATCTTGA CCTCGGATAC AGATTGAACA AGGCTGACGG AGAAGTCATA 1380 

GAGGTGAAAA CTTCATCTAT CAATATCTCT TGGTACGGAT ACGGAGAGCA TCCCGAGAGT 14 40 

40 TTCTCATTGG CACCTAATCA GTTGTCACAA GGAATCAACA CCATCACCCT ACTTTATCGT 1500 

CGCACAGGCA CCGAACAGTG GGAGCCGGTA CGGCATGCAC AGGGAGGATA TGTCAATAGC 1560 

ATTAAAGTAA ATACGACAGA CCCGAACAAT GTCGTAGTCA CGGTAGATAA TAACGAAGGC 1620 

7\AGCTCAGTA TCGTCCCCAA CAGCTTTGTC GCAGATCTGA ATTCTTATGA ACATAGTACG 1680 

ATTACAGTAC AGTTCAATAG CGACAGCCCT GATGAGATCC GTACACCCGT AGCCTTTGCT 174 0 

45 CTATCTACAG GAGCTACTGC GGACGATGTA ATATCTTTGG GCTGGGTAAT GGCTGAAGTT 1800 

CCGGGCGGTA GCAGCAACTA TCCGGTGGTT TGGTCTAAAG ACGTTCTCAC TCTCTCGGAA 18 60 

GGCGACTATA CATTGTGGTA TAGATTTTCC ATCAACAACC AAAAGGATGA ATGGAAAAAG 1920 

ATCGGAAGCG TGTCAGTAAA AACACCGACA GAGTATACGC ACCCCTTATT CGAAGTGGGC 198 0 

CATAATCAAA CTTCTACCTA TACGCTGGAT ATGGCACACA ACAGAGTATT GCCCGACTTT 204 0 

50 ACACTCAAAA ATCTCGGATT GCCTTTCAAT GGTGAGTTGG TTGTTGTTTT CCGCCAAACA 2100 

CAATCCTCAT CGGGGTCTTT ATGGGCAGCT CAAGAAACAG TACATATCAA GCAAGGAGAA 2160 

ACTTTCGTAT ATAAACCTGT TGTCGAAGGC CCTATACCTG ATGGATCCTA TCGTGCGACC 2220 

CTCCATGCAT TCGTAAACGG ACAACAACAG TTGTACCTCA AGGGGAAAAG GAACTACACG 2280 

GTGAAGATCG TCAATGGTAC AGCGGTAGAA GCAATAGAAT CGTCAGAAGA GATCAGAGTA 2340 

55 TTCCCTAATC CGGCACGCGA TTATGTGGAA ATATCGGCAC CTTGCATTCC CCAAGAAACA 2 4 00 

TCTATCATTC TTTTCGATCT GTCAGGCAAG ATTGTCATGA AGAATAGTTT ATCAGCGGGG 24 60 

CATGGCAGAA TGGATGTCAG CCGACTTCCT AATGGGGCCT ACATCCTTAA GGTGGATGGA 2520 

TATACGACGA AAATAAATAT AGTGCAC 2 54 7 



(2) ItlEORMATIOM FOR SEQ ID HO: 4 4 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 base pairs 
65 (D) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : ci rcu 1 a r 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv) ANT I -SENSE: HO 

75 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: PORY PHYROMOIIAS GIMGIVALIS 

(ix) FEATURE: 

(A) NAIIE/KEY: misc^feature 

(B) LOCATION 1. . .885 

(xi) SEQUENCE DESCRIPTION: SEg ID HO: 44 

ACGAAGAAAA CACTGATGAA AAAGCTATTT CTCTCGCTCA CGAGTCTTGT AATGGTCTTC 
GCTGTTGCAA GTTGCGATAT AATCGACAAG GATCAAACCC TCTTGCCGGC TCCGACC7\AT 
GTGACACCCG ATAATCCGGA TGACAATCCT TCGGAGATCG ACATTACGCA GACGCACACA 
GAAAAATATG TTTTGGCTGA AGAATTTACC GGCCAAAAAT GTCTCAACTG TCCGAAAGGT 
CATCGCAAAC TGGCGGCTCT CAAGGAGCAA TACGGTAAGA GATTGACTGT TGTCGGTATA 
CATGCCGGCC CTGGATCTCT CGTGCCACCT CTTTTCCGTA CAGAAGCCGG AGACGCATAT 
TATAGCAAGT TCGCCAATAA TACCCCTCTC CCTGCGCTGA TGGTTTCGCG CAAAAAGTTC 
GGCTCTTCCT ACGTTTATGA TAAGAGCTAC AAAACGTGGG ACGTGCCTAT TGCCGAGCAG 
ATGGAGCAAA AGGCGAAGAT CAATATCTTT GCCGTGGCCG AATACACCGA TACCCAAAAG 
ATCAAGGTGA CTGTAAAGGG TAAAATACTG GAGGGGAATA CACTCCCGAA GTCCATGGTT 
CAGGTGTATC TGTTGGAGGA TAAGCTGATC GCTCCGCAGG TGGATGGCAA TACGACAGTC 
GAGAATTACG AGCACAATCA CGTGTTGCGT GGAGCCGTTA ATGGTATTTG GGGCGAAGAA 
TTTGTGAATC TCAAAGATTA TTTGTATACT TACGCCGTTG AACCGCTCTC GGGTATGTCC 
TTCGTAGCCG AGAATTATTC GATTGTGGCT TTTGTATACG ATGTGCAGAC GTTCGAAGTG 
TATGACGTTG TGCATGTAAA GATCAATCCG CAATCCGATG GCAAA 



(2) INFORMATION FOR SEQ ID NO: 4 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRArJDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: MO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI'iE/KEY: misc_feature 
{B) LOCATION 1 ... 690 

(Xi) SEQUENCE DESCRIPTION: SEQ ID l\0:4 5 

ACGAATAAAA AAGAAGAGAC AATGAAGAAA TCAAGTGTAG TAGCCTCAGT TTTGGCCGTG 
GCTCTCGTGT TCGCCGGTTG CGGACTGAAC AATATGGCAA AAGGCGGCCT TATCGGCGCC 
GGAGTAGGAG GTGCCATTGG TGCCGGAGTA GGTAACGTAG CCGGAAATAC GGCTGTCGGT 
GCCATCGTCG GTACTGCAGT CGGTGGAGCA GCCGGTGCTC TCATCGGAAA GAAGATGGAC 
AAGCAGAAAA AAGAACTGGA GGCCGCAGTA CCCGATGCTA CGATTCAGAC AGTAAATGAC 
GGAGAGGCTA TTCTGGTTAC TTTCGATAGC GGTATCCTCT TTGCGACGAA CTCCAGCACT 
CTGAGTCCCA ACTCACGCAC TGCGCTGACG AAGTTTGCTG CAAACATGAA CAAAAACCCC 
GACACGGATA TTCGTATCGT AGGCCATACG GACAATACCG GCTCCGACAA GATCAACGAT 
CCTCTGTCTG AGAGACGTGC AGCCAGCGTA TATTCTTTCC TGAATTCTCA GGGTGTGAGT 
ATGTCGCGCA TGGCAGCCGA AGGGCGTGGG AGCCATGAAC CGGTTGCAGA CAATAGCACA 
GTTGCCGGAC GTTCGGCCAA CCGCCGTGTG GAGGTTTATA TCTTGCCGAA TGCCAAGATG 
ATCGAACAAG CACAGCAAGG TACGCTGAAG 



(2) INFORMATION FOR SEQ ID NO: 46 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1026 base pair^ 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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10 



30 



40 



65 



(vi) ORIGIMAL SOURCE: 

(A) ORGANISM: PORY PHYROMOMAS GIIIGIVALIS 

(ix) FEATURE: 

(A) NAI'IE/KEY: misc_f eature 

(B) LOCATIOM 1 . . .1026 

(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 4 6 



7\ACAGGAACA GAAATATGTC GAAAAAATCG ATCCTTCTGC TTTGCTGTTC GCTGTGCTTC 60 

ATTTCTGCTA CGAAGGCTGT GACCCCCGTC AGAAATGTGC GCAATAGCCA AGTGAACAGC 120 

AAAGCAAAGA CCGAACGTAC AAAGCCCTCG GACTCTGTAC GGTACATTAG CAACATGATT 180 

GCAGATCGGC TGGAGTTCCG CAACAAGATT TCTTCCGAAA AAGAGGTAAG AAAAGCCGAA 24 0 

15 TATGAAAATC GGCTGGCGAT GGAAGCACTC AATTACCCTG CCATAGATTT ATATGGTGAA 300 

GATTCTTGGA GCGAGTATGT AAACCCTTTC GTGGGTGCAG GAACCGATGT CGAAATTCCG 360 

AACTCCTATG ACATTGATTG CTCTTCGTTC GTGATGCCCG TCGAAGATAA GCAGGTCACC A 20 

TCTC7\ATTTG GCTACCGTCG GCGTTTCGGA CGGATGCACT ATGGTATTGA TCTTTCAGTG 4 80 

AATCGTGGCG ATACGATACG AGCAGCCTTT GACGGGAAAG TTCGTGTACG CAGCTATGAA 54 0 

20 GCGCGTGGCT ATGGCTACTA CATAGTCTTG CGCCATCCGA ACGGACTGGA GACTGTGTAC 600 

GGACACATGA GTCGCCAATT GGTAGACGAG AATCAGATCG TTCGAGCAGG ACAACCGATC 660 

GGATTAGGAG GCAGCACGGG TCGAAGCACC GGTCCTCATC TTCACTTCGA GACCCGCTTC 7 20 

ATGGGTATTC CCATCAATCC GAGTACCATT ATAGACTTCG ATAACGGAGT GCCGCTCCGA 7 80 

GACATTTACA CATTCAAACG AGGGAGCAAT TCTCGCTATG CAAAAGCCTC TAAGACTTCT 84 0 

25 TCTCGCTATG CAAAAAAiAGG GAAGAAAGGC AGACAAGCTT CTTCTCCTAT GACCTATAGA 900 

ATCAAAAAAG GCGATACTTT GGAAACAATA GCCAAAAGGC ACGGCACTTC TGTTCAGAAA 960 

CTCTGTGCTA CCAATGGCAT TGGCAAGAGT AAAATTTTGA CTCCGGGCAA AGCCTTGAGG 102 0 

ATCAAA 102 6 



(2) INFORMATION FOR SEQ ID NO: 47 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AMTI-SEIISE: NO 



45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GIIIGIVALIS 

(ix) FEATURE: 

(A) NAI'lE/KEY: mi sc_f eat ure 
50 (B) LOCATION 1...4 77 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

CCGAGCAAAA CGATAATTAA GACAATGGCA AAAATCAATT TCTATGCTGA AGGCGTCAGC 6 0 

55 CTTCCTCGGA TCAGAAGACG GATCGTCGGT AAGTGGATAG CCGAAGTATG CAGCCGATAT 12 0 

GGGAAAGCGG TGGGAGAAAT CTCCTATCTT TTCTGTGATG ACGAATATAT CCTGAAAGCC 18 0 

AATCAGGAAT TTCTCGATCA TGACTACTAC ACCGACATCA TCACCTTCGA TTCCTGCGAA 24 0 

GCGGATACGG TGAATGGCGA CCTGCTTATC AGTCTCGATA CCGTACGCTC GAATGCCCGT 300 

GCTCTTGATC TTCGATACGA AGACGAACTG CATCGTGTCA TTATCCACGG CATACTGCAT 360 

60 CTTTGCGGAT TGAAAGACAA GAGCAAAAAG GATGAAGCCC AAATGCGTGC AGCCGAAGAG ^20 

A7VAGCCCTTG TCATGCTGCG AGAAACCATC GGATCGGAGC TTTCCCTATT GCATACA 47 7 



(2) INFORMATION FOR SEQ ID NO : 4 8 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
70 (D) TOPOLOGY: circular 



75 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) AllTI-SEMSE: MO 



{vil ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHVROl^lONAS GIMGIVALIS 

5 

(ix) FEATURE: 

(A) NAI'IE/KEY: mi£c_feature 

(B) LOCATION I ... 1185 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 8 



AAAAGTAAAA CTATGAAGGT AAAGTACTTA 
AACGCAAGTG CACAGGAGAA TACTGTACCG 
GCTTTTGCTC GCAATAAAGC AGGCAGCAAT 

15 GCGCAGTTCC TCAATGACAA CAACAACAAA 

TCTCTTTCTG TCGGAAAGTA TCACAGCCCT 
GGTCAAGCCC ACACTTTCCT CGGAAAAAAT 
GCAGCTCACT TCGACTTTAT GTTTGATGTG 
CGTTTCTTCC ATTTAATTCC ATGGGTAGGT 

20 GAATGGAGCA AAGACAATGT GGAATCACTG 
AGATTAGGAA AGCGAGTAGA CTTTGTGATC 
CTAAGTCGCG CATACAATGC CAiAGAAAACT 
TACAATGGAT TCCAGGGGAT GGCTACAGCA 
TTCAATGCCA TTGAIICCAAT GGACTACGCA 

25 CGTTTGCGCA GCGAGGTCGA AGTVACTCTCA 

GAAGTAACTC CTGTTACTAA GACAGAAAAT 
TTCGACAGCC ACGTTGTGGA CAAAGATCAA 
GTAAAAGAAA CTAACGAGCC GATTACCGTT 
CAATACAACG AGAAATTGTC TGAGCGTCGG 

30 A7\ATATGGTG TGCCTTCCGA ATTAATCTCT 
TTCAGCAAGA AAGCTTGGAA TCGTGTTGTA 



ATGCTCACAT TGGTTGGAGC AATTGCACTG 60 

GC7U\CGGGTC AGTTACCCGC TAAGAATGTT 120 

TGGTTTGTAA CACTGCAAGG CGGTGTTGCA 180 

GACCTCATGG ACCGCTTAGG AGCCATAGGT 240 

TTCTTTGCAA CTCGTTTGCA AATTAACGGA 300 

GGCGAACAAG AAATCAACAC CAATTTTGGT 360 

GTTT^CTACT TTGCACCATA TCGCGAAAAT ^20 

GTTGGCTACC AACACAAATT CATCGGTAGC ^80 

ACGGCGAATG TAGGAGTTAT GATGGCTTTC 54 0 

GAAGCACAAG CAGCTCACTC CAATCTCAAT 600 

CCCGTATTCG AAGATCCCGC AGGACGTTAT 660 

GGTCTTAATT TCCGCCTGGG AGCCGTAGGC 72 0 

CTTATCAATG ATCTGAATGG TCAGATTAAC 7 80 

AAACGTCCTG TATCATGCCC CGAATGTCCT 84 0 

ATACTGACGG AAAAAGCTGT ACTGTTCCGT 900 

TTGATCAACC TGTATGACGT AGCTCAGTTT 960 

GTTGGTTATG CTGATCCTAC GGGTAATACT 1020 

GCTAAAGCCG TTGTTGATGT TCTGACAGGT 1080 

GTAGAATGGA AGGGCGACTC TACGCAACCG 114 0 

ATCGTTCGCT CCAAG 118 5 



35 



{2) INFORMATION FOR SEQ ID NO: 4 9 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : double 
40 (D) TOPOLOGY: circular 



45 



50 



65 



70 



75 



(ii) IIOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: MO 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



(ix) FEATURE: 

(A) NAI'IE/KEY: misc_feature 

(B) LOCATION 1 . . .1161 

55 (xi) SEQUEIJCE DESCRIPTION: SEQ ID NO: 4 9 



60 



TATTAGCACT 


TGCGGGTCTC 


60 


AGAACAAAGC 


AGGGATGCAC 


120 


TTGACATTGC 


AGGTGGAGCA 


180 


TAGATCGTCT 


AAGTATCGTT 


240 


GTACTCGTCT 


CC7VATTCACA 


3 00 


GTAACCACAA 


TTACTTTGGA 


360 


TCGGTGTATA 


CCGTCCCAAT 


420 


GTTATAAATT 


CCATAGCGAA 


4 80 


CCGGAACAGT 


TAATGTCGGT 


540 


ATATTGAAGG 


ACAAGCTTTT 


600 


CAGACTTCCC 


TGTAATGGCT 


660 


CAGAAATTGT 


TCCTATGGAC 


720 


TTCGCGGTCA 


AGTGGAAGAG 


780 


CTACACAGCC 


TACAGTTACT 


840 


GTGCAAAGAT 


TGATCGTAAT 


900 


CCAACAACGC 


ACCGATCAAG 


960 


ATAACATGAA 


GCTTTCAGAG 


1020 


GTGTTTCTGC 


GGATCGCATT 


1080 


AGAACGCTTG 


GAATCGTATT 


1140 
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GTAGT/KATGA CTGCAGCGGA A 1161 
(2) XIJFORIIATION FOR SEQ ID MO: 50 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIfDEDHESS : double 
10 (D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DUA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: HO 



15 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .585 

25 {>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

GTAATTGGCA TCATTATGGA ATTTTTCATG TTATTCATAG CGGCGGTTTT CGTTAATAAC 60 

GTCGTGCTGT CGCAGTTCCT CGGTATATGC CCATTCTTAG GCGTATCGAA GAAGGTAGAC 120 

ACCTCAATCG GTATGGGTGC AGCCGTGACA TTCGTATTGG CACTGGCTAC CTTGGTTACC 180 

30 TTCCTGATTC AGAAGTTCGT TTTGGATCGT TTCGGATTGG GCTTTATGCA GACCATTGCA 2-10 

TTTATTTTGG TCATTGCCGC CTTGGTGCAG ATGGTGGAGA TCATACTCAA GAAAGTATCT 300 

CCTCCCCTCT ATCAGGCACT GGGTGTATTC TTGCCCTTGA TTACGACGAA CTGCTGTGTG 360 

CTCGGTGTGG CTATTTTGGT TATCCAGAAG GATTATACCC TGCTCCAGAG CTTCGTCTAT 4 20 

GCAATATCCA CGGCTATCGG TTTCACCTTG GCAATGGTTA CTTTCGCAGG TATTCGAGAG 4 80 

o5 CAACTCGATA TGACCAATCT CCCCAAAGCT ATGAAGGGAA TACCTTCGGC ACTCTTGGCT 540 
GCCGGTATAT TGGCTATGGC TTTCATGGGC TTCAGCGGTA TCGCC 585 



40 



50 



55 



60 



(2) INFORTIATION FOR SEQ ID NO: 51 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2628 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: double 

45 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTl -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f ea ture 

(B) LOCATION 1...2 628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 



TACCGATCTT ATCGTGGGAT AGGGAGTGGG ACACACTCTC CTAACCTCAA AAACCGACTA 60 

AAAAGGATCG GAATAAGGAT ACCGAACAGA CACTATATCC ATATCAAGCC AATCAAACCA 120 

AA.AAATAAA.A TGAAACAACT AAACATTATC AGCTTCATCA TTGCTTTCCT ATTCTTAGGA 180 

bo ACGAGCGCAT CGGCTCAGCA ATCGGGCGGA TCCGTTACAG GTACCGTAGT GGACAAAAGC 2^0 

TCAAAAGAAC CTATCGCATA CGTACAAGTA TTCGTCAAAG GAACCACTCT CGGAACTTCC 300 

ACGGATGCAA ACGGAAACTA CTCGATCAAG GGAATCCCTT CGGGTAATCA AACTATCGTA 360 

GCCCGACTCA TGGGTTACTC CACTTGCGAA GAAAAAGTAC ATATAGAAAA GGGTGGTTCC 420 

CGCCACGTAG ACCTCTATCT GACCGAAGAG ATTCTCTCTC TCGATGGGGT AGTGGTATCT 4 80 

70 GCCAATAGAA ACGAGACTTT CCGCCGTCAA GCACCCTCGT TGGTAACGGT ACTGTCGCCG 54 0 

GAACTTTTCC TCAAAACCAA CTCTACCAAC CTGAGTCAGG GACTTAAGTT CChGCCCGGT 600 

CTGCGCGTGG AGGACAACTG TCAGAACTGC GGTTTCAACC TVAGTTCGTAT CAATGGACTC 660 

GAAGGAGCCT ATTCGCAAAT TCTTATCGAC AGCCATCCCA TCTTCAGTTC GCTTGCCGGT 720 

GTCTATGGCT TGGAGCAGAT GCCTGCCAAT ATGATCGAAC GTGTAGAAGT AATTCGCGGT 780 

/O GGAGGTTCGG CTCTGTTCGG CTCTAATGCT GTGGGAGGCG TTATCAACGT AATTACGAAA 84 0 
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GAACCGCTTC GCAATTCGGC CGAGATCAGC 
GGGTGGGGGA GCTTCCAAAA TACGACCCAG 
AAAGCCGGTG TCATGGTATT CGGCCAACAC 
GACAACTTTA CCGAACTACC CAATCTGCGC 
5 AAGACCGGTC TCTACAGCAA AGCAACCCTC 
GGTGGCGACA GACTGGACAA TCCTCCTTTC 
TATATCAATG GCGGAAGTTT CAAATTCGAT 
AGTCTGTATG CTTCAGCACA AGACGTTCAG 
ACCGAAAATC TGCTGAACGG AGCAGTTCAG 

10 GATGCTTTCA CGGCTCTTAC TTCCTACGGG 
GGTATGTACC GTCATACCTT CGGAGAAAAC 
TACGGCCAAC TCGATGACAG AAGCGGCTAC 
ACTTTTAGTC AGTACGACCA GCTCGAATAT 
GCACGTATCG ACTATGTTCT CCTCAATCAG 

15 ATTTTCAGTC CTAGAGCCAA CGTACGATAC 

TCATACAGCG AAGGATTCCG CGCTCCTCAG 
GCCGGTGGTA CTCCTATCAG CCGTGTCCTT 
AGCATCAGTG CTTCTTTCGA TTATTACCAC 
GGAGAAGCCT TCTCCACCTT TATCAGCAAT 

20 ACGAGCGATG GCAAAGAATG GATCATTCGT 
AAGGTATATG GTGTGAATCT GGAGGGAAGA 
CTCGGCGGTA CATGGCAGAG AAGCCGCTAC 
AAAACAACGG GACAAGCCGA GATCTCTGTG 
GGCTATTTCG TTGCTACGGT ACGTCCTACC 

25 TTCACGGGCA AAATGGATGT AGTACACGAA 

ATAGCTCCGG ACGGATCGTT CGACTTTGAA 
GAAGGTCATG CCAAGCTCGT CAAGACTCCG 
CACGACTTCC ACCTTGCTTC CACTATGACC 
TTCAACAGCT ATCAGAAAGA CACGGAC7\AG 

30 GGTCCTATGC AGCCCAGAAG GATTTTCGTC 



CATTCTACGA TGACCTTCGA CCACGCGAAA 900 
TTCAACGGTT CTATGCTGAC GGAAGACCGC 960 

AACTACCGTC CCGGACAGGA TATAGACGGC 1020 

AACCGCTCGC TCGGTTTCCG CTCATACTAT 1080 

GAATATCACA GCATGCAGGA GTACCGTCGT ll*iO 

GAAGCCCAGA TAGCGGAATA TCTCCAGCAC 1200 

CAGGGCTTCA GCGGTGGCAA GGATTTCTTC 12 60 

CGTCGTAGCT ACTACGGGGG TGGCGACTAT 132 0 

AGTGGAAGCA CCGAATCGGA CGAATACAAC 1380 

ACTACCAAGG GATTCGATTT GCAAGGAGGA 1 4 'J 0 

TGGGACTTTA CCGGCGGACT CGAATATATC 1500 

AGACCGAGCA AAATAGATCA GAATACCTCT 15 60 

AAGACGGAGA AGTTAAGTGC CCTTATCGGA 162 0 

GATGGCAAAC GCTATATCGA TCCGCTCTTC 1680 

AATCCCAATA AGAATCTCAG CTTCCGACTC 17 40 

TATTTCGATG AAGATCTGCA CGTAGAGTTG 18 00 

TCCCCCAATC TGAAAGAAGA ACGTTCACGA 18 60 

AGAGCCGACG AATGGCAATT CAATATCATG 1920 

CAGTTCAAAC CATCCGATAA GGTCGAAACC 1980 

ACCATCTACA ACGACAAGGA TGGAGTATCG 2 04 0 

ATCGCCTACA ACAAATCGTT CGACCTCCAG 2100 

GGAAGCATCT ATACCGCTGT GGAAGCGGAC 2160 

AAAGACTATG TACGCACTCC GAATCTGTAC 2220 

GAGCACTTCG CCATCAATCT CTCCGGTACA 22 80 

GCCTATGAAG GCGATATTCC CGCAGAACAC 2 34 0 

ATGAATGGTC AGCAATTCAA AGGTTTGGCC 2 4 00 

GCCTTCGCCG ATATAGACCT CAAGCTGAGC 2 4 60 

TTGGAATTGA ATGCCGGAAT ACAGAACATA 2520 

GGACCGGGTA GAGCTTCTAC TTACGTATAC 2 58 0 

GGTACAAAGA TCAATTTC 2 628 



(2) INFORMATION FOR SEQ ID MO: 52 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2697 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

40 

(ii) I'lOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

45 (iv) AIITI -SENSE: NO 

( V i ) OKI G 1 N AL SOURCE : 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

50 (Ix) FEATURE: 

(A) NAllE/KEY: mi sc_f eat ure 

(B) LOCATION l..,2697 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52 



ATATTGAATC ATTTGAGAAA GACTATGTAC AAAAAGATTA TTGCCGTAGC AGCTCTCTTC 60 
TGCGCCAGCA TAGGGATCCT GAAAGGACAG TCCTCGGATC TGACCCCTCA GGATACTATA 120 
TATAGCCCTG AAATATCCTA TGCCT^AGCCT ATTCATAAGA CCATAGCATC TATTGAGATC 180 
GAGGGAATGA GGTCTTTCGA TGACTTTGTC TTGCGCAATC TTTCAGGCTT GGCTGTAGGT 24 0 

bU GATGAAGTCC TGATTCCTGG AGATGCCATG TCTGCTGCCG TGAATAGAAT TATGCGTCAG 300 
GGCTACTTCT CAAATGTGCG AATCATCGCG GATAAATATG TCGGCAATAA AGTCTATCTG 360 
AAAATCATTG TCACTGAACG TCCTCGCATC AGTAAGGTTA CTTTTAGCGG GGTAAAGAAG 4 20 

TCTGAGAGAG AAGATCTTGA AATGAAAATC GGTCTTCGCG AGGGGATTCA GATGACCAGA 4 80 

AATAATGAAG ACAAGGTCAG GCAAATCGTA CAGAAGTATT TTAGTGAGAA AGGTTATCGC 54 0 

b5 GATGCCAGCA TACGGATAAC GCAGGAACCG GATCTTTCCA AAGATGGCTT TGTCAATGTG 600 
CTTATCTCGA TTGAGAAGAA AAGCAAAACC AAGGTGAATG 7VAATTTATTT TTCCGGCAAC 660 
AAGGCCCTTA GCAATCATAA GCTAAGAATG GCGATGAAGA ACACCAATGC CAAATTCAGT 72 0 

CTTAG/^GC ATATTCGCTC ATCTTTCTTG AAACTTTTTA GTACTCATAA GTTTGTGGAA 7 80 

GAGAGCTACC GTGAAGATTT GGTCCGATTG ATAGAGAAGT ATCAGGAATA TGGATATCGT 84 0 

/U GATGCTGAAA TACTGACCGA CAGTGTCGTG AAGGCTCCTG ACGGCAAAAG AGTGGATATT 900 
TATCTCAACA TCGAAGAGGG GCAGAAGTAT TATATTAAGG ATGTCAACTT TGTGGGCAAT 960 

TCACAATATC CATCGGAGTA TTTGGAACGA GTGCTCGGAA TAAAATCCGG AGATGTGTAC 1020 

AATCAGAGAC GATTGGCTAA GCGTCTCAAT GAAGATGAAG ATGCTGTGGG GAACCTGTAC 108 0 

TATAACAATG GCTATATTTT TGCGTGGGTC GATCCCGTGG AAACAAATGT AGTGGGGGAT 114 0 

/O TCTGTTTCGC TTGATATTCG TATAGCGGAG GGGAAGCAGG CCAATATCA^\ TAAGGTGATC 12^0 
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ATCAAAGGAA ATACTGTCGT GTACGAAGAC GTAGTACGCC GAGAGCTTTA CACAAAGCCC 12 60 

GGCCAGCTCT TTAGTCGCGA GGATATCATT AACTCTATTC GTCTCATCAA TCAGCTTGGG 1320 

CATTTCGATG CCGAAAAATC TATTCCCCGT CCGATTCCCA ATCCCGAAAC AGGAACAGTG 1380 

GATATAGAGT ATGATTTGGT GCCGCGTAGC AGTGACCAAT TGGAGCTTTC TGTCGGTTGG 1A40 

5 AGTCAGTCCG GACTTCTGTT CCGAGGAGCC ATTAAGTTCA CGAACTTCTC TGTCGGCAAC 1500 

TTGCTCCATC CCTCGATGTA TAAGAAAGGG ATCATTCCGC AAGGGGATGG GCAAACACTA 1560 

TCACTGAGTG CTCAGACCAA TGGAAAGTAC TATCAGCAGT ATAGTGTCAC ATTTATGGAT 162 0 

CCATGGTTTG GGGGCAAGCG GCCGGATATG TTCAGCTTCA GTGCATTCTA TTCCAAGACT 1680 

ACGGCGATTG ACTCCAAGTT CTACAATAGC AATGCCGGCA ACT ACT AT AA TGCCTACTAT 17^10 

10 AATAGCTACT ACAACAACTA TAATAGTTAT TACAACGGTA TGTCGAACTA TACCGGCGAC 18 00 

CTCTATACTC AGGCCAGCGA TCCGGATCGT TCGCTTCAGA TGTTAGGTAC TTCGATCGGT 18 60 

TACGGTAAGC GTTTGACTTG GCCGGACAAT TGGTTCCAGA TTTATACTTC TCTGAACTAC 1920 

ACCTACTATA GACTGCGAAA TTGGAGCTAC AATACCTTCC AAAATTTCCA TCATGGCTCG 1980 

GCTAATGATC TCAACTTGGA GCTGCGTCTC TCTCGTACTT CCATCGATAA TCCTATTTAT 204 0 

15 ACCAGAAGCG GATCGGATTT CATGGTTTCT GTTGCTGCTA CTCTTCCTTA TTCTTTGTGG 2100 

GACAATCATG ACTATGCCAG CCAGAACCTC AGCGTAAGCG ATCGTTACAG ATTTATCGAG 2160 

TATCACAAGT GGAAGTTTAG AGGACGAGTT TTTACTCCAT TGCTCAATCC TGCTACGCAT 2220 

AAATATACAC CGGTGCTCAT GAGTCGAGTG GAAGGAGCAG TTCTTGGTTC GTATAATTCC 22 80 

AATAAGAAAT CTCCTTTCGG TACTTTCTAT ATGGGAGGTG ATGGTATGTC CAGCTATTAT 23^0 

20 GGTGGCTACA TGAATGAGAC TATAGGTTTG CGTGGTTATA AGAACGGATC TATTGCCGGT 2*100 

AATAACTACG ACTATGCATA TGCTTATATG CGGCTTACGA TGGAACTACG TTTCCCGATT 24 60 

CTGTTTGAAA ACTCATTCAA TGCGTGGCTC TTAGCTTTTG CCGAAGCAGG CAATGCGTGG 2520 

CGCAGTATCG ACAATTATAA TCCCTTTAAC CTGA.i\GCGAT CGGCCGGTGT AGGATTGCGT 2580 

GTAACGTTAC CGATGGTCGG AATGCTCGGT ATCGATTGGG GATATGGCTT TGACCGTCCG 26^0 

25 GACAATTCTC TACAGCGAGG AGGAAGCAAT GTCCACTTTG TGCTCGGACA GGAGTTC 2 6 97 

(2) 114 FORMAT I OM FOR SEQ ID NO: 53 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

35 

(ii) MOLECULE TYPE: DIJA (genomic) 

(iii) HYPOTHETICAL: NO 

40 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAl'IISH: PORYPHYROHOHAS GINGIVALIS 

45 (ix) FEATURE: 

(A) NAtvIE/KEY: misc^feature 

(B) LOCATION 1 . . .531 



50 



65 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 



CAAATAAACA TGAATGGCGA TATGAAACGG TTTTTGATTT TGATCGGCTT TGCACTGGCG 60 

GTAGCTTTCT CCGGTTTTTC CCAAAAGTTC GCTTTGGTAG ATATGGAATA TATCCTCAGG 120 

AATATTCCTG ACTATGAGAT GATGAACGAA CAGCTGGAAC AGGTGTCCAA GAAATGGCAA 180 

AATGAAATCG AAGCTCTCGA AAATGAAGCC CAATCTATGT ATAAGAAGTA TCAGAGCGAT 24 0 

55 CTCGTATTCT TGTCTGCTGC ACAGAAGAAA ACCCAAGAAG AGGCTATCGT AAAGAAAGAG 300 

CAGCAAGCAT CCGAGCTCAA GCGGAAGTAT TTCGGCCCGG AGGGGGAGCT GTATAAGAAA 360 

CGCTCCGATC TGATGAAGCC TATTCAGGAT GAGATTTGGA ATGCTATCAA AGAGATTGCC 4 20 

AAGCGTAACA ACTATCAGAT GGTGCTTGAT AGAGGTACGT CCGGAATTAT CTTTGCCAGT 4 80 

CCGTCTATTG ACATTAGCGA CCTTGTACTG AGCAAGATGG GCTTTAGCAA G 531 

60 

(2) INFORMATION FOR SEQ ID NO: 54 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

70 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1 J_> 





wo 99/29870 



PCT/AU98/01023 



36 / 490 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(vi) ORIGINAL SOURCE: 

(A) ORGAMISM: PORYPHYROMONAS GIHGIVALIS 

( ix ) FEATURE : 

(A) NAIIE/KEY: mi sc_f eat ure 

(B) LOCATION 1 ... 510 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 

CGAATAAATA AACAACACGA AATGAAGAAA TTTTTTCTCA TGCTTCTGAT GGCTCTTCCT 60 

TTGAGCCTCT TGGCACAAAA GGTGGCAGTG GTAAACACTG AGGAGATCAT TTCCAAAATG 12 0 

CCGGAACAAG TAGCTGCTAC CAAACAGCTC AACGAATTGG CCGAAAAGTA TCGCCTTGAT 180 

CTCAAGAGTA TGGACGATGA GTTTGCCAAA AAGACAGAAG AATTTGTAAA GGAAAAAGAC 24 0 

TCTCTACTGG AGAACATCCG CAATCGTCGT CAGCAGGAAC TTCAGGATAT TCAAACTCGT 300 

TATCAGCAGT CATACCAAAC GATGCAGGAG GATTTGCAAA AGCGCCAACA ACAGCTTTTT 360 

GCTCCTATCC AACAAAAGGT GGCTGATGCC ATCAAGAAAG TGGGTGACGA AGAAAACTGT 4 20 

GCCTACATCA TGGAGGCCGG TATGATGCTT TACACCGGAG CTACTGCTAT TGACTTGACC 4 80 

GCAAAGGTAA AAGCGAAACT CGGAATCAAG 510 



(2} IHFORIIATION FOR SEQ ID HO: 55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAllISM: PORYPHYROMONAS GIHGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 . . .2484 

(xi) SEQUENCE DESCI^I PTION : SEQ ID HO: 55 

ATCATGAAGG AAGCTATTCC CCGAAAGAAC AAGTATATAA AGCTCAACGG TATATACAGA 60 

TTGTCATTCA TTCTGCTATG CTGCCTGCTA TGCTCTCAGG CAGCTATGGC ACAAGGCGTC 120 

AGGGTATCGG GCTATGTGCT CGACCGTGGG GAAAAGCCGA TCCCGTTCGC CGGAGTCAAA 180 

GTGCGTGGTA CGGGGACAGG CGCAACGACG AATCTGAAAG GATACTACGA GTTTCGGATG 24 0 

AAGGCCACGA CGGACAGCAT CACGATCGAG TTCAGCTCCA TGGGGTACCA AGGGGTAAGT 300 

CGCAGCTTTC CGTCTCTGAC CAAGGACACT CGGCTGAATG TTCGTTTGGC AGAGGCCGAG 360 

ATGGAGCTTT CGAGCGTGAC GGTACAGGCC ACAAAACGCA GACTCAACAC GATGGAGCGC 4 20 

GTCAATACCC GAGACCTTCG TGTCAATGCA GGGCCTACGG GAGGGGTGGA ATCGCTCATC 4 80 

AGTACCTACG CAGGAGTAAC GCAGAACAAT GAACTAAGCT CGCAATACTC GGTTCGCGGA 54 0 

GGAAGCTACG ATGAGAATAT GGTCTATGTA AACGGAGTGG AGGTTTATCG CCCGCTGCTG 600 

GTTCGCTCTG CACAGCAGGA AGGTCTGAGC TTCGTCAATC CGGATCTGAC ACAATCCGTA 660 

CAGTTCTCCG CCGGAGGGTT CACGGCCGAC TATGGCGACA AGATGTCCTC CGTACTGGAT 720 

ATTCGCTACA AGCAACCGCA GGAGAAGGAA GGAGCGGTAC TCCTCGGGAT GCTACAATCG 7 80 

AGTGCCTACT ATGGCAGCAG TGCCGGAGCC TTCAGCCAAA TCACGGGTGT ACGCTACAAG 84 0 

AGTGCCAAAT CGCTCTTGGG CACTACGGAC ACGAAAGCCG AATACGATCC GATCTATGCG 900 

GACGGACAGA CATTCATGAC GTACCGTTTC AGCCCCAAGC TGTCGGTTAG TTTCCTCGGC 960 

AATATTTCGC AAACTCGCTA CAAGTTTGTC CCTCAGACCC GTGAGACGAG CTTCGGTACA 1020 

CTGAGCGATG CCAAAAAGTT GAAGATCTTT TTCGACGGTC AGGAACAAGA TCGTTTCCTG 1080 

ACCTACTTCG GTGCCTTCAG CATGAACTTC GTGCCGGACG ACAAACAGCG GCATACGGTT 114 0 

ACGCTTTCGG CCTTCAACAG TAACGAACGG GAGACCTACG ATATTCAGGG AGAATACTTT 1200 

CTGAACGATG TGCAGCTGGG GGCGGACGGA ACTGCTTCGA TGGCTTCGGG CTCAGAGAAC 12 60 

TCCAACGGCT TGGGCATCGG GCGCAATCAC GAGCATGCGC GC7U\CAGGCT GAGCTACCGC 1320 

GTGCTGAACA TGGGTTACAG AGGGGAGATG AAGCTGAACG AGAAGCATCG CCTGCAAGCC 1380 

GGCGTATCGG CACAGATGGA GAAAATAGCC GACCATATCA GCGAATGGGA ACGGAGGGAT 14 40 

TCGGTAGGAT ACAACCTACC TCACTCGGAG ACCGTATTGC TGATGTACAA TAACCTATAT 1500 

GCCGATACGC AGATGAGGGG AACGCGCTTG TCGGCATTCG TACAGGATCG ATTCAACTTC 1560 

AGCATGGGAG GAGGTACATT TTCTCTCATT CCGGGTATCA GAGCTTCGTG GTGGAGCTTC 162 0 

AACAAGGAGT TGCTCGTCAG CCCACGTATC AGCGTGGGTT ATTCTCCCGA AAGCAACCCG 1680 

GCTTTGGTAC TGCGTGCAGC CGCCGGACTT TATTATCAGG CACCGTTTTA * CA7VAGAGCTA 17 4 0 

AGGCAGACGC ATAAGGATGC CGAAGGCAAT AACGTGGTTG TCCTCAACGA GAAGATCCGC 1800 

TCTCAGGGAG CTTTTCACAT TCTCGCAGGA GCAGACTATA CCTTCGAAAT GGGGGGGCGA 18 60 

AAATACAAGT TTACGGCAGA GGCTTACTAC AAGAGCCTGT TCAACATCAA CCCGTATATA 1920 
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ATAGAGAACG TGAAGATCCG CTATCTGGGC GAAAACATCG GTTCGGGTTA TGCTGCGGGT 1980 

ATCGATCTCA AGCTCTTCGG CGAACTGGTA CCCGGAGTGG ATTCGTGGCT GACGGCTTCC 204 0 

ATTATAAAAG CCCGTCAGAA ACTGGATGGC TACGGTTCTT TACCACTGAT GAACGCACCC 2100 

ACTTACAATT TCTCCTTCTT CCTTCAGGAG TACGTGCCGG GCAATAAACG CATCACAGCC 21 GO 

5 ACCCTGCGGG CTGCACTAAG CGGAGGATTG CCCCAGCTCA ATCCGAGCAA AGGGCTTAGC 22 2 0 

TCGCCGGCCT TTACCGCACC GGCCTATAAG CGTGTCGATC TGGGGGTAAT GTACAAATGG 22 80 

CTCGACCCGG ATGACTCCTT TGCCGGCCGA AGCAAATGGC TAATGGGAGT AAAAGGGGCC 234 0 

TACATAGGGG CTGACCTCTT CAATCTGTTC GACATGACCA ACGTCAATTC TTACTACTGG 2 4 00 

GTGTCGGATG CCTACCAACA GCAATACGCC GTACCGAACT ACCTGACACG CCGCCAATTC 2 4 60 

10 AACCTGCGTC TCCTCGTCGA ATTC 2 4 84 

(2) INFORMATION FOR SEQ ID NO: 56 

15 (i) SEQUEMCK CHARACTERISTICS: 

(A) LENGTH: 2037 base pai us 

(B) TYPE: nucleic acid 

(C) STRAtlDEDNESS : double 

( D) TOPOLOGY : circular 

20 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 

( V i ) OR I GI N AL S OURCE : 

(A) ORGANISM: PORYPHYROHOI JAS GINGIVALIS 

30 (ixl FEATURE: 

{A) NAI"IE/KEY: misc_feature 

(B) LOCATION 1.,.2037 



35 



70 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 



CCTATCCCTT TTCTTCTTAA TATGTACAGC GGACATCATA AAATCCATTA TCCTTTTCTT 60 

ATCCTGTTGG TATGCCTTGC TTTTGCTGCC TGCAAGAGCG TGAAGTTGAA AGATGCGGAG 120 

AAGGCACATG ATCGCCAAGA GTATACCAAG GCTGCCGATA TGTACAATAC ATTATACAGG 180 

CGTACCCGAC GAAAGCAGGT GGAGATGAAA GCTTATACGG CTTTCCGATC CGGTGAAAAC 24 0 

40 TATCGTGCCG CCGGCAGACA AGCCAAAGCT TTGCGTGGCT ATCTGAATGC CAGACGCTAC 300 

GGGTATCCGG ATTCTGTGGT ACTGCTCCGT TTGGCACAGA CTTATCAGCA AGGAGGTAAC 360 

TATAAGGAAG CCGAGGTACT CTTCCGTGGA TATCTGGAAG CTTATCCGAA AAGTTATTTT 4 20 

GCAGCTATCG GTTTGGAGGG GTGTCTCTTT GCCCGCCAGC AAAAGGAATA TCCTACACGT 4 80 

TACCGGATAC GGCGAGCTGC CGAGTGGAAT TCGGCACGGG GCGACTTCGG CCCGGCCTAT 54 0 

45 GCACCCGATG CTTCGGCTCT CTATTTCACA TCGAGCAGAA GCAAAGACGA CGGTTTGGAT 600 

AATAGCAGCA TAACGGGACT GAAACCCAAC G AC ATT TATA TCATCAAACG AGATGCACAA 660 

GGACGATGGG GACGTCCCGA TAGCGTGTCC GGAGGAATCA ACACTCCATG GGATGAAGGC 7 20 

GTGCCAACGA TCACGCCCGA TGGTAGTACC ATATATTATA CGTTGGCGCA GCAAGGAGCC 7 80 

GATTACGACC GTACGGTACA GATCTATTCC GCCGCTCGGA GCGGAGAAGG CGGTTGGAGC 84 0 

50 AACGGTTCGC TCGTGGACAT TATGCGCGAT TCGCTCCGTA TGGCTGCTCA TCCCTCTATG 900 

TCGGCATCCG GCGATTACCT GTATTTCGTC AGCAATATAG GCGGTAGCTA TGGCGGCAAG 960 

GATATTTATC GTGTCAAGGT GTCGGATCGT TCTTATGGTT CACCGGAGAA TTTGGGGCCT 1020 

GATATCAATA CGCCGGGGGA CGAAATGTTT CCCTTCATAG ATGGGGATAG TACCCTTTTC 1080 

TTCGCTTCGG ACGGACACGC CGGTCTGGGA GGACTGGATA TTTTCAAAGC CACGCTGGAC 114 0 

55 TCTACCGGCC AATGGCATGT AGTCAATATG GGACAACCGG TCAATTCCTC TGCCGATGAT 12 00 

TTCGGCTTGG CTGTGGAGCC TAAAGGCAAA AACAAAGAAG AAGCTTTGCC GGACAACGGA 12 60 

GTCAAAGGTG TATTTTGTTC CAACCGAGGC GATGCACGCG GATGGCCGCA CCTCTTCCAT 1320 

TTCGAACTGC CGGCTATCTA CACCGAGATT CAAGGTTATG TGATGGACAG AGAAGAAAAT 1380 

CCCATAGCCG GAGCCACTGT CAGGATCGTA GGCGAACGCG GCCCCGTAGG ACAGGGATTC 14 4 0 

60 GTGACTACTC GTGACGATGG CTCCTATAAG ATGAGCGTGC AGGGCGATAC TCGCTATGTA 1500 

ATGCTTGCCG GAGCATCGGG TTATTTGAAT CAGTACGTAG AACTCAAGAC CGATACCGCC 1560 

AAGCAGAGTG AGACCTACTA TGTGGACTTT TTCCTTGCAT CGCGTGAGAA AGCCGAGGGC 1620 

TTGCAAAATA TTTTCTATGA TTTCGATAAA GCTACTCTTC GCCCCGAAAG CATGAAGAGC 1680 

TTGGACGAAC TGATTCGTAT CCTCACGGAC AATCCGGATA TTCGGATCGA ATTGGGTTCG 17 4 0 

65 CATGCCGACA GGAAAGGCCC CGATGCTTAC AACCTCGGAC TATCTGACCG CAGAGCCAAA 18 00 

TCCGTGGTGG ATTACCTCAC GAGTCGTGGC ATAGCGGCCG ACAGGCTTAC GTGGAAAGGC 1860 

TACGGTAAGT CTGTCCCCAA GACGGTGACA GCCAAAATTG CCGAACGGCA CGATTTCCTG 192 0 

AAGGAAGGCG ATGTGCTCAC CGAGGAATTC GTAGCACCTT TGACCGAGGA GCAGCAGTCA 198 0 

GTCTGCGACC AACTGAACCG TCGTACCGAG TTCCGTGTGA TCGAAGAAGA GTTGCGT 2 037 



(2) INFOR14ATIOH FOR SEQ ID NO: 57 



(i) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 2316 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDHESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: MO 

(iv) AHTI-SEMSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGAI>IISM: PORYPHYROMOMAS GINGIVAL IS 

( ix ) FEATURE : 

(A) IJAIIE/KEY: misc feature 

(B) LOCATIOM 1 . . . 2316 

(xi) SEQUEtJCE DESCRIPTION: SEQ ID l(0:57 



CCTGCACAGC 
GAAACAGAAA 
TCTGCCTCCA 
ATATCAATAA 
GCTTTTGCCA 
GTCACGGTCT 
CTCATCTCAT 
ACGCAAAGTT 
GGTTTCAAGC 
GACAATATCT 
TTCTCTTCCA 
TCCAAGGACA 
GGATCATTCA 
TTGGACAAGC 
CTGGAAGAAG 
TATACTGTAG 
AATCTGTATC 
GTTTACGGAG 
AAACATGGCT 
TATAACAATG 
GCCCTACTCC 
CGAAACATGA 
AACCCCGGAT 
AACGATGCTC 
GCGAACGAGT 
GTCGGAATCA 
TTCAGTGCTC 
ATAGGCAATC 
AGCAATGCAC 
AAAGATCTGA 
GACAAGGCTC 
GCCAACAAGT 
AAGAAAAGCC 
ACCTTCGGTA 
ATGGGACGCA 
CAGCAACTGC 
CAAGCAATGG 
GTGAACTTGA 
GGACGTAACA 



CGAAAACATA 
GGACATTGTA 
TACTCTTGCA 
TTATGAAAAA 
AGACGGACAA 
ATTCTACACG 
CGCGCAuACAT 
CGCTCGATGT 
CCTCCGGCAA 
CTACGCTCAA 
TCTACGGCAC 
AGATCCATGG 
ACTTGGGTGG 
AGAACAAGGA 
CTATAGTAGA 
CAACGGGACG 
AAAACGTATT 
AATCCATWVA 
GCCACACGCT 
CCGATCCCAC 
AGGACAAGAT 
CGATGGAGTC 
ATGCCACGAA 
TATCGATATC 
ATCTCAACAA 
TVATATGAGTT 
CCGATGCTTT 
CTGACCTGAA 
GCTGCGGGAT 
TCTTGTCCAG 
GTATGAGCGG 
TCTCTCTCCG 
AGACCGATGC 
TCGAATATCG 
GGATCGAGCA 
TTGCAGCAGA 
TGTTCAATGC 
ACAACATCTT 
TCATGGGTAA 



TTGCATACGA 
TGCGGACAAG 
AGAATACTCC 
GTTTTTCTTC 
CGTCCCGACA 
CACGGCCGTA 
CAAGCAGTCC 
CATACAATAC 
GTATGTAACC 
CACGAGCAAC 
CAATGCCATG 
CAACGTTTCT 
CCGCTTCGAG 
CTATAAGACC 
TGTAAATGCT 
TCTGCGTTTC 
CCTCGGAGAT 
AAATCTGAAT 
TCAATTCTCC 
CGGTTTCATC 
TTCCTTTGGA 
AGAAAGATTC 
CAATATCGGT 
TGCCGGTGCA 
TGAAGCCAAA 
TGTGAAAGGC 
CCAAAAAGCA 
ACCCGAAAAG 
CCAAGCCGAC 
CCCTGACTAT 
TATCGAGGCC 
CGCATTTGCG 
CCCTTGGAGC 
TGGCAAAGAA 
AAACTGGTAT 
AGAGCCTGAA 
CTCTGCTTAC 
GGATGAGCTT 
GGTTATGGTC 



TACTTTCGGA 
ATGCGCCGGC 
ACAACAGAAT 
GCGCTACTAT 
GATTCGCTAC 
CCTCTGAAAA 
GGCTTTAACA 
CCGGGCTTTA 
GTATTGGTAA 
ATCGAACAAA 
GGCGGTGTGG 
CTCTTCGGCG 
GATATTTTCT 
GGATCAAACA 
ACCAAAAACA 
GGTATCGACT 
GCGATCCCCG 
CGTTCTTCGA 
CCCTACTTCA 
AACTACAAAA 
GGACAAAATA 
GAGCAGGCAG 
TTGTTCGGAC 
CGTGCCGACT 
CAGGAAACTC 
CTTACAGCTC 
GGCCAATACG 
TCCATGACCT 
GTAACCTTAA 
GCTAATAATA 
CTTTTGTCTT 
7\ATGCCACGA 
GAAATGTACT 
GGACTTGAAG 
GCTTACTACC 
TTGGCTGCTC 
TACCACATGA 
TATACGGAGA 
AACTTC 



GAGAGGTATC 
ATATCGTCAA 
TATTCATTCA 
CGATTGGTAT 
GAGTACACAA 
AGATACCGGC 
ACATGACCGA 
GTTCGAACAT 
ACGGCATCCC 
TCGAGATCCT 
TGAACATCAT 
GTAGCTACCA 
CATTCGATCT 
ATTTCCTATC 
AGAAAATGAA 
TCACGCCCGA 
TAGGAGGATC 
CCTCTTTCGA 
ACATAGAGAA 
GCGACTACTA 
TCGTACTCGG 
GAGTGAATAC 
AGGCCAATTT 
TCATGTTCTT 
ATAACGTAAT 
ATGGTACATT 
TAGGCCCGTT 
GGGACTTCGG 
CCTATTTCCA 
TCACCACATA 
ATGACTTCGG 
TCATGCTCAA 
ACGTTCGCAA 
TGATGCTCAA 
CCGAAGTTCG 
AGGGACTGCT 
ACAAGTATCT 
AAGACGGCTA 



TCGGAGGAGG 
GTGCTGTCCC 
TCACTTTAAC 
TTCAGCGCAG 
TCTTCAGACC 
CAAGATGGAA 
CATCCTCAAG 
CGGTATCCGC 
TGCGGGAACG 
CAAAGGCCCG 
CACCCACA7y\ 
GACCATGGCC 
TAGTCTGGGC 
CCTGAGCAAA 
GGGGAGCGAC 
ATGGTCGCTG 
TATATGGGGC 
GCTGCTCGGC 
ATCGGAGAAC 
CACCTATGGT 
TGTCGACAGC 
AAAGCCATAC 
CTACCTGCTG 
TGACCTGAAA 
CAATCCGAAT 
CGGTAGTGCA 
CGGCACGACC 
TATCGGATAC 
CACCGACCAC 
CATCAATGCC 
CAGCCTCTTT 
TTCCGAGATG 
GCAGAACATC 
CGGTCGCTTC 
CCCCGAACTC 
CCGTCATCCG 
CACCTTCGGT 
CCACATGCCC 



(2) iriFORIlATIOM FOR SEQ ID HO: 58 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1^52 base pairs 

(B) TYPE: nucleic acid 

(C) STRAt IDE DUES S: double 

(D) TOPOLOGY: circular- 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv) AHTI-SEMSE: MO 

(vi) ORIGIHAL SOURCE: 



60 
120 
180 
240 
300 
360 
^20 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2316 
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(A) ORGANISM: PORY PHYROMO! IAS GIIIGIVALIS 

(i;-;) FEATURE: 

(A) MAI'lE/KEY: misc_f eature 
(B> LOCATION 1 . . .14 52 

(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 58 

GGCCGGGCCT CTTCCCCTTA CAGGCAAATG GATGGCATAC TAAACGATGA ATATAGACAA 60 

GCATCTATGA ACAGGTTTTC AAATCATTGG CCCTGCATCC TCGTGGGGTT TGTACTCTGG 120 

TTTGTATCGG CGAGTCGGAC TGTGGCACAA AACGCCTCCG AAACGACGGT ATCGTACGAT 180 

ACGGATACCG CCGTACTCTC CGAAGCCGAT GTGCTTCGGA TCGCTCTTAG TGAGAATGCC 24 0 

ACAGTGAAAG TGGCCGATAT GGATGTGCGC AAACAGGAAT ATGCACGTAG GGCAGCACGT 300 

GCCGATCTCT TCCCGAAAGT AGACCTCAAT GGCGTTTACA GCCATACGCT AAAGAAGCAG 360 

GTCTTATATA TAGATATGCC CGGTTTCAGC AGTAGCGAAG GTATCGAAAT GGGGCGTACA 4 20 

CACAATACGC AAGGAGGGGT GAACGTCTCC ATGCCATTGG TGTCGGCACA GCTTTGGAAA ^80 

AGCATTGCCA TGACCGGAGA ACAGCTCGAT CTGGCTCTGG AGAAAGCTCG CAGCTCCCGA 54 0 

ATCGATTTGG TGGCAGAGGT GAAGAAGGCT TACCTCAGTG TATTGTTGGC CGAGGACTCT 600 

TATGGCGTAT TCAAGCGCAG CTATGACAAT GCTCTGGCCA ATTATAAGAA CATATCCGAC 660 

AAGTTCGATC GTGGACTTGT GGCCGAGTAT GATAAGATTC GAGCCAATGT ACAGGTACGC 72 0 

AACATCGAGC CTAACCTCTT GCAAGCGCAG AACTCCGTAG CCCTTGCTCT CTGGCAGCTC 7 80 

AAGGTCCTGA TGAGCATGGA AGTGGAAACT CCGATCAGAC TCTCCGGTTC ATTGTCCGAC 84 0 

TATAAAGAAC AAGTCTATAC CGGCTATTTT GCCGCCGATA CGCTTATTTC CAACAACTCC 900 

TCCCTGCGTC AGCTCGATAT ACAGCGTCGT CTGGCTGTCA GTGCAGACAA GCTGAACAAG 960 

TACAGCTTCC TGCCTACACT CAATCTGGGA GGGCAGTACA CCTATTCGCT CAACAGCAAC 102 0 

GACATCAAAT TCTGGGGCGA GGGACAACGC TGGACGCCTT TCTCCACCAT ATCGCTCAGC 108 0 

CTGTACATTC CTATATTCAA TGGAGGCAAA CGTCTGTACA ACGTGAAGCA 7\AGTGCTTTA 114 0 

TCGATCCGTC AGATCGATCT GCAACGACGC CACATAGAGC AATCCATCCG AATGGGAATC 12 00 

AAGAACCAAA ATGACCGTCT GCGTACCTGT ATGCAGAGAT TTGTGGCCTC GGAAGAGGCT 1260 

GTCCGAAGTG CAGAAAAGGG CTATCAGATA GCAGAGAAAC GCTATCAGAC AGGCGAAGGC 1320 

ACTCTCGTCG AGCTCAACGA TGCCGATGTG GCTCTTTTGC AGGCTCGACT CAATTATAAT 13 8 0 

CAGGCCATAT TCGACTTTAT GACCGCAAAG GCCGAATTGG ACAAGATGAA CGGCATGGGG 14 4 0 

ATTCCCGAAC AA 14 52 



(2) INFORMATION FOR SEQ ID NO: 59 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



45 



( i i ) MOLECULE TY PE : OH A ( genomi c ) 
(iii) HYPOTHETICAL: NO 



50 



(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHY ROHONAS GINGIVALIS 



55 



( ix ) FEATURE : 

(A) NAJIE/KEY: misc_feature 

(B) LOCATION 1...162 0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 

TTTCATAACT TTGACTTCCT AAACGGTATA AAATTGTTTT CGATGGCAAA TAATACTCTT 60 

60 TTGGCGAAGA CTCGACGTTA TGTCTGCCTT GTCGGTTTCT GTTGGCTCAT GGCGATGATG 120 

CACCTCTCTG GGCAGGAAGT CACTATGTGG GGGGACAGCC ATGGAGTGGC GCCGAACCAA 180 

GTGCGCCGAA CGCTGGTGAA GGTAGCCTTA AGTGAATCCC TTCCTCCGGG TGCAAAACAG 24 0 

ATTCGTATCG GATTCTCTCT TCCGAAAGAA ACGGAGGAAA AAGTCACCGC CCTATATCTC 300 

CTTGTGAGTG ATTCTTTAGC GGTGCGCGAC TTGCCGGACT ACAAAGGGCG AGTCTCTTAC 360 

65 GATAGCTTCC CGATCTCAAA GGAAGATCGT ACCACAGCCC TTTCTGCGGA TTCGGTAGCC 4 20 

GGACGCCGCT TCTTTTATTT GGCTGCGGAT ATAGGGCCTG TTGCTTCTTT TTCCCGATCC 4 80 

GATACGCTGA CTGCCCGTGT GGAAGAGGTG GCTGTCGATG GCCGCCCTTT GCCGTTGAAA 54 0 

GAGCTGTCGC CTGCCTCCCG TCGTCTGTAT AGGGGGTATG AGGCCCTCTT TGTACCCGGT 600 

GATGGCGGAT CGCGGAACTA TCGTATCCCG GCCATTTTGA AAACGGCTAA TGGAACACTC 660 

70 ATAGCGATGG CCGACAGACG AAAATATAAT CAGACGGATC TGCCGGAGGA TATAGATATA 7 20 

GTCATGCGGC GCAGTACGGA CGGAGGGAAA TCGTGGAGCG ATCCCAGGAT TATCGTACAG 780 

GGAGAGGGGC GCAATCATGG CTTTGGCGAT GTAGCCCTGG TGCAAACCCA AGCAGGAAAG 840 

CTCCTGATGA TCTTTGTCGG TGGAGTAGGC CTGTGGCAGT CTACCCCCGA TCGTCCTCAG 900 

CGCACTTATA TATCGGAAAG TCGGGACGAA GGACTGACTT GGTCGCCTCC TCGGGATATA 960 

75 ACCCATTTCA TCTTCGGCAA GGATTGTGCC GATCCGGGAC GCAGTCGCTG GTTGGCCTCC 1020 
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10 



TTTTGTGCTT 
ATCCGCGAAT 
GGCGATACAT 
TTGATGCCCG 
CAGCGTTTCT 
GAGGGCATCC 
GATCAAGTGC 
CTCTTCGATC 
GCCTACTCGG 
GATGAGATCT 



CGGGACAAGG 
CAGGGCAGGA 
GGCAGCTTTC 
ATGGCAGGGT 
TCGCTCTCTC 
ATGACCCCGG 
TGCACTCCCT 
ATGTCTCCGG 
ATATGACTCT 
CATTGGTTTT 



GCTTGTGCTG 
GTACGTCCTG 
CGACTGTGCA 
ACTGATGAGC 
CTCCGACGAT 
CTGTAATGGA 
GCCTCTCGGC 
CCGCTGGTCC 
GCTGGCGGAT 
CATTCGGTTC 



CCATCCGGTC 
AACAACTATG 
TACCGCCGTG 
ATACGCAATC 
GGCCTTACTT 
GCTATGCTTC 
CCGGATGGGC 
GCTCCCGTTG 
GGAACGATCG 
GTCCTTGACG 



GTATCACGTT 
TCCTCTATAG 
GCGATGAGGC 
AGGGACGGCA 
GGGAGAGAGC 
AAGTGAAAAG 
GTCGCGATGG 
TTGTCAATTC 
GTTATTTCGT 
ATCTCTTCGA 



TGTGGCTGCC 
CGACGATGAG 
AAAGCTTTCA 
GGAGAGCCGA 
CAAGCAGTTC 
GAACGGAAGG 
AGCTGTCTAT 
AGGATCGAGT 
CGAAGAGGGC 
TGTCCGGCAA 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



(2) IMFORMATIOlvi FOR SEQ ID HO: 60 

15 (i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 87 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

20 

{ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

25 (iv) AI IT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOI JAS GINGIVALIS 

30 (ix) FEATURE; 

(A) NAl-IE/KEY: mi£c_feature 

(B) LOCATION 1 ... 879 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID 110:60 



AAGTCTCCGA GCGATTCGGC TTTGCCGTCG GTATGGAGCG CGAATACAAT ATCTGGACTC 60 

GTCGGTGGGA AACGCATTAC TTTGCTTATC CTGTATTCTA TGGCGATAAG AAGTAGTAAT 120 

ATAGAATCAA TACAGTGCTT TGTGATGAAA AAAGAAAAAC TTTGGATTGC GATCGTCGCC 180 

GGTTTGGCTT TCGTATTGGG CCTTTATGCT CTTGGCCGCA GTGTCGCTCA GCTACGCCGC 24 0 

40 TCTCAGCCTT CGGTGACTGT GACCGGTATG GCCGAGCGTA ATTTCAAATC CGATCTGATC 300 

GTTTGGACTG CTTCGTACCA GCTCCAGATG ATGGATCTCG AATCGGCCTA CAAGGCTTTG 360 

AAGGAAAAAC AGATATTGGT AGCAGACTAT TTGAAAAACA AGCAGCTGCC CGATTCGTCT 4 20 

TATATCTTCT CAAGCGTAGC CATCTCTAAA GAATACAACT ACTATTACGA TCCTCGGCAG 4 80 

GAACAAAACG TCAGGACCTT TGCCGGGTAT CTGCTCAGCC AGACAGTTAC GGTGACCTCA 54 0 

45 CAGGACATCG AACATGTGGA GAAAATATCT CGCGATATAA CGGAGCTGAT CAATCAGGGG 600 

GTAGAGATTA CCTCCGACCG TCCGGCCTAT TACTACACCA AGCTCAATGA TCTGAAGGTG 66 0 

GAGATGCTGC GCAATGCCTC CGAAGACGCT TTCAATCGTG CTTCGGTCAT TGCGGAGGGG 720 

AGCGGTTCCT CCGTGGGTAA GATGCTATCT TCTTCGATGG GCGTGTTCCA GATAGTGGGG 780 

CTCAACTCGA ACGAAGATTA TAGCTGGGGA GGTTCGTTCA ATACGTCTTC CAAGATGAAG 84 0 

50 ACGGCAAGGA TAACGGTTAA GGCTTCTTTC GCTTTGAAG 87 9 



(2) INFORMATION FOR SEQ ID NO: 61 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

60 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

65 (iv) ANT I -SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGAJ^ISN: PORY PHYROMONAS GINGIVALIS 

70 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 ... 840 



75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 
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GGGAAACTCC AAATGAAAAA AACAATTGCA 
GCCGTCGGCT GTAAGAAAAA TGCTGACACT 
TTGTCCATGG GTATTTTGTA CGGACAGGAT 
CAAGGCCAGC CGATTGATTC GGTAGCTTTC 
5 ACGCGCTTCT CGTACAATCT GGGAGCCATC 

CATGATTCCA TCGATATCGA CAAGTTCTAT 
ACCGTATCTA TCGCCATGAA GCCTGCAGAT 
AAAAAGCAGC GAGAAAACAA TATGAAGCAG 
TACATCGATA CCTTTAAAAA AGAAGATGGT 
10 AAGACTCTTC AGGAAGGTAC GGGAGCTACT 
TATGTGGGTA CTCTGGTCGA TGGTAAAGAG 
GCCGTTACCG GTGTGATTAA AGGCTGGACG 
AAAGTTCGCG TGGTAATCCC ACAGGAGCTG 
GAACCGTTCT CTACCCTGAC GTTCGAGATG 

15 



ATTATCGCCT CAGCCCTCTT GGCTTTAGGA 60 

ACCGCTGTCA GTGAAAAGGA TAGCATAGCC 120 

TTTGCCAATC AGTTCGAAAT GTCCCGCTTG 180 

TTGGACGGTT TCAAATATGG TATCGATACG 24 0 

TATGCTTCCA ATATAGCTCG TCAGCTGGCT 300 

GCAGCCATGC GTGCGGCTCT TCTTAA.AGAC 360 

GCACAGGCTT TCATGCAACG AATCCAAGCC A20 

TTTGGCCAGA ACATCGAAAA GGGTAATGAA 4 80 

GTAACTGTTA CGACAACTGG TCTGGCATAC 5 AO 

CCCTCTTTGG CCGATACTGT ACGTGTCAAG 600 

TT CG ACAAAA ACGAAGAAGG AATCGAATTT 660 

GAGATGCTCC AACTCATGAA GGTCGGTCAG 720 

GCTTATGGGG AGACCGGCAA CTATACCATC 7 80 

GAACTTATCG GGATCAAGCC CGGGAAAAAG 84 0 



(2) IMFORilATIOH FOR SEQ ID UO:62 



(i) SEQLfEI^CE CHARACTERISTICS: 
20 (A) LENGTH: 2-109 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 

(D) TOPOLOGY: circular 

25 (ii) IIOLECULE TYPE: DMA (genomic) 



(iii) HYPOTHETICAL: IIO 



(iv) AlITI-SEMSE: 110 

30 

(vi) ORIGINAL SOURCE: 

(A) ORGANISt'I: PORYPHYROMOI IAS GIMGIVALIS 

(ix) FEATURE: 
35 (A) NAIIE/KEY: mi sc_f ea t ure 

(B) LOCATION 1 ... 2409 



(xi) SEQUENCE DESCRIPTION: SI 

40 TGTCGAAAGC AGAAGCTCTC AAACCCGAAG 
TTCCGACAAA GCGATAAGAA TCCGATGAAA 
TTTGTCCTAC TGACAGGTGC CTGCTCCACC 
TATATCGGAA TGGGCAAGAC ACAGATACTC 
GCTCTGACCG AAGTGGAGAG TACACTGAAA 

45 GCAAGTGCCT CCTTACCCAA GATACCATTC 
GATTCCACTG TCATTTCGAA ATGGATATTC 
AGTCAGGTCA AATCCGATAG CCGGGCTAAG 
TACTTCGATG CTAAAGTAAA AAGCAGTGTG 
AAAATCTCCT ATACGGTGGA TATGGCCTCT 

50 CCGATCAGCA CTTTCCCCGA CAGCATTCTG 
AAAGGAGACC AGTTCAATTT GGCAAAGCTG 
CTGAGAGACA ATGGTTACTA CTACTTCCGC 
CTCCTCGTAA GAGGTGCCGT ATGCCTGCGA 
GCCATGCGCC CGTGGAGGAT AGGGAAACGG 

55 AGCCCGACAG ACTCGCTCGA AGTGGAGGAT 
GTTCGCCCCA AGATTTTGGC CAAACGCTTT 
AAAGACGATG AGACGACACG CAAATCCTTG 
CTCAATTTTT TGCAACGCGA TTCCATTTCC 
CTCGACAAAC CTTGGGATGC ATCATTAGAG 

60 ATCGGTCCCG GACTGAATTT TGCTCTTGCT 
CTTTCTTGGA ATATCGGTGG ATCGTATGAG 
AGCAATCGGC TGATCGATAT AAATTCGTAC 
CCCTCGATTG TATTTCCCGG TCTGCTGGAT 
TTTCAGGCTT CTGCCACCGC GCTGAACAGG 

65 TTTTCGACCA CCTACGAATT TCAGCCCTCC 
AAGCTCAACT ACAACCTCCT GGGGCATCAG 
AATCCGCCCC TGCTGCTCAG CCTTCAGAGT 
ACGTTCAACA AATCCGTTTC AGAGAAAAGT 
TCCGAGGCAG GCAATCTCCT GAATCTGATC 

70 ACCAAGAATT TCGTCGGCGT CCCCTTCTCT 
TATTCCTATA CCATAGACCG CAATCAGTCA 
TATAGCTATG GCAATATGCG AGTGGCACCC 
AATAGTATCA GAGCTTTCAC CGTCCGTAGC 
GACAATCAGT ATTCCTATTT GGATCAGGTG 

75 TATAGAGGCA AGCTTTTCGG GGATCTCCAC 



ID NO: 62 ' 

AGGAGCCGGT ATCCGTGCAG ACGGATATCA 60 

GTATTACGGC AAGTATTCCT CCCCATCCTT 120 

ACAAAGAATC TGCCGGAAGG CGAACAGCTG 180 

CGGCAGGACA AGAGCCACGC CGGCCAACAG 24 0 

GTTACACCCA ATGGAGCTAT TTTCGGCAGT 300 

GGGCTATGGC TATACAACAG CTTCGTGGGG 360 

GACAAGTTTG CAGCCAAGCC GGTTTTCATC 4 20 

GTGGCGACGA ACATCCTCCG CGAACACGGG 4 80 

ACCACTCTGA AAAAGGACTC GCTCAAAGCC 54 0 

CCTTATCATT ACGACAGCAT CATTCCCTTA 600 

GCTTACAGGC AGACTCCGTC TTTGATCAGG 660 

CACGAAGAGC GTCAGACCAT CAGTGCCCTG 7 20 

CCACAGGATA TTATCTACGA AGCCGATACC 7 80 

GCCAAGCTCT CGGAAGATAC TCCACCCCAA 8^0 

ACAGCAGTCC TGCTCGGAAT GAACGGAGAA 900 

ATGAAAGTCC TTTACTATCG TTWVATGCCG 960 

CGTTTCTTCT CCGGCAATCT GTATCGGCAG 1020 

GCTCGTTTGG GAGCCTTCTC CGTTATCGAT 108 0 

GGCCTTTTGG ATGTGCGACT GCTAACCACC 11 "3 0 

ACCTTGTTCA CGAGCAAAAG CAATGACTTC 12 00 

CGGCGCAATG TATTCGGCGG AGGAGAAAAT 1260 

TGGGAGACCG GCAATCGTCC CGAAAATAGC 132 0 

AACATGAATA CGGCCGTGAA CCTCTCGTTT 138 0 

AAATACTATT ACTACCCCAC GACTACGACT 14 4 0 

GCACACTACT TTAGCATGTA CTCTTTCGGC 1500 

AAGGAACACC GGCATGCTAT TTTCCCGCTC 1560 

ACAGAAACTT TCCAGGCCAT TACGGCGAAC 162 0 

CAGTTCCTTG CTCAAATGGG GTATATCTAT 168 0 

CCTCATCATC TTTGGATGCA ATTCGGACTA 17 4 0 

TATCTGGCAG CCGGCAAGAA GTACAGCGAC 1800 

CAGTTCATCA AAGCCACGGG AGAACTGCGC 1860 

CTGGCAACCC GTTTCGGGAC AGGCGTGATA 192 0 

TATAGCGAGC AGTTCTATGT AGGCGGTGCC 198 0 

ATCGGCCCCG GACGGTTCAA TCCGGATTCC 204 0 

GGCGAATTCA AACTCGAAGC CAACGTGGAA 2100 

GCAGCCGTTT TCCTCGATGC GGGCAACGTT 2160 
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TGGCTCTTGA GGGAGGATTC TTCCCGTCCG GGCGGTGCTC TGTCCGAAGT GGGATCGGTG 2220 

AGCAATTTCC TGAATAGCAT CGCTCTCGGC ACCGGTGTCG GCCTTCGCTA CGATCTGGCA 22 80 

TTTCTCGTGG TTCGTGTCGA TGTCGGCTTC GGTCTCCACC TTCCTTACAA TACGGGTAAG 234 0 

AAAGGTTACT ACAATATCCC ACGCTTTAAG GATGCCATCG GTTTCCATTT GGCTGTCGGC 2 4 Qo 

5 TATCCCTTC PjOQ 



(2) IHFORI-IATIOH FOR SEQ ID MO: 63 

10 (i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 234 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJIDEDMESS : double 

(D) TOPOLOGY: circular 

15 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: HO 

20 (iv) AIJTI-SEMSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROIIONAS GIMGIVALIS 



25 



30 



50 



fix) FEATURE: 

(A) NAME/ KEY: mi -cofeature 

(B) LOCATION 1 . . . 2 34 9 

(:<i) SEQUENCE DESCRIPTION: SEQ ID HO: 63 



TCTCTCTCCC CGTATATCCG TTTTCCTATG TCCTCGCATT CCGTTCGGTA TCTAATCGGC 60 

ATTGCCGGCT GCTTGCTCCT CATGCTTGCT TCCTCCTGCT CGGTCACCCG TTATGTGCCG 120 

GACGGTAGCA GACTATTAGA CAGGGTAACG ATCGCAAGCG AAACGGGCAG TATCGCTCTG 180 

CCGGAAGATA TTCGGGACTA TACCCTCCAG CAACCCAATT ACAGACTGTT CGGGATGACT 2 4 <"> 

35 CGCTGGCTAC TGCGCGTCTA TAGCAGCTCG AATCCGAACA GCAACAGCTG GTGGAACCGT 300 

TCGCTCCGGA AAATGGGCGA ACCGCCTGTC CTCATCGATT CTGTCCTCAC CGATCGTACT 360 

GCCAACCGTC TGGCAAAGGC GATGGCCGGC GATGGCTTTC TCGATGCTAC TGCTCGTGCC 4 20 

GTGGTAGACA CCGGCTTGTA CAAGAAAGCT CGCATTACTT ATCTGATTCA GCCCGGAAGC 4 80 

CGTTATTATA TACGCAATAT GGCTTTGGAT GTGAAGAATC CACTCCTTCC TCCCGTTGCG 5 4rt 

4U CTTGGCAATT CGCTTCCTTC GGCATACAAG GTCGGGATCA GCGAGGGTTC TCCCTTGTCG 600 

CCCATCGTAC TCGATGAAGA GAGAAAGGCG ATAGCTCGTC ATATGCGCAA CAACGGCTTC 660 

TGGAAGTTCT CCGCCGAGGA TGTTTATTAT GAAGCAGATA CTACCGTTTC AGGAGGATCG 720 

GGTACGAAAT CTGCCGATCT GAAATTAGTG GTCAATGGCA TCGGGCGTTA TCCATATCGG 7 80 

ATCGGCAGGG TATTCTTTCA TGCCGATTAT GATCCTCTCG AATCGGACTT CAGAGTTCAG 84 0 

45 GAGCTGCCAC GTATCGATTC GATTTCGCGT GGCGATTACA CTGTTTACTA TGGGAGTAGG 900 

GGACGTTATA TCCGGGCATC GGCTCTCACG CGGTCGGTGT CCGTTACACC GGGAGCTTTT 960 

TTCTGCGAGG ATGATGTGGA ACGCTCTTAT ATCAAGCTGA ATGCGCTCCC TATCGTTCGG 102 0 

AACGTGAATA TCCGATTTGT GGAGCACAAT GGTAAGGATG AGATTGCTCT GGCGGATAGC 108^ 

TCTCGCCTTG TGGACTGCTA TATTCTTACC GTTCCGGCCA AGAGCAAATC GTTCGAAGCC 114 0 

GAAGTCCTCG GCACCAATTC CGCTGGAGAC TTCGGGGCGG CTTTGTCTCT CGGTTTCACC 12 00 

GATCGCAATT TGTTTCGTGG GGCGGAGATG TTCAATATCA AACTCAAGGG TGCTTACGAA 12 6^ 

GCCATTCGCA AGGGTTCGCA CAGCTTCATG GAATATGGGG TGGAAAGCTC GCTCCGTTTC 13^0 

CCTCGTCTCC TCTTCCCATT CATTTCTGAC GAAACGCGCC GGCGGCTACG GGCATCCACG 1380 

G/iATGGAAGA TCGGGTATAA TTACCAGACA CGTCCGGAGT TTGATCGGGT GATTCTCTCC 1440 

55 GCTCAACTCA ATTATTCATG GCAGACCTAC CTGCACAATC GTCTGCGTCA TACGATCCGC 1500 

CTGCTGGATG TCGATTATCT CCATCTCCCG TACATCGATC CCGACTTCGC CCAATCCCTT 1560 

CCGCCTACGA CTGCACTGTA TAACTACACG GAGCAGTTTA TCCTCGGCTC GGCATATATA 162 0 

CTGAACTATA CCACGGCTTC GTCCATGGAG CGTACCGTAT CCAATCCTTT TACGGCACGG 168 0 

TTCAGTATCC AGACAGCCGG CAACCTGCTG CAAGCCATTT CTTATCTGAC CGATTCTCCG 17 4 0 

bU AAAGACGAAC ACGGGTTGTA TAAAATGTTC GGTCTGCACT ATGCTCAGTT CGTCAAGCTC 18 0n 

GATCTCGATC TGGCTAAAAC CGTTCTTCTC GAAAAGGACA ATACTTTGGC ACTGCATCTG I8 60 

GGTTTCGGAC TGGCTTTCCC TTATGGCAAT GCTCGCCATA TACCCTTTGA GTTACGTTAC 192 0 

TTTGCCGGAG GATCGAACAG CGTTCGCGGC TGGAGTGTCC GTACCCTCGG CCCGGGGAGT 1980 

ATGAAGATGA CTCCGGACAA GACCTTCTTC GATCAGATGG GTGATATTCG TCTGGATCTG 2 0^0 

D5 AATGTCGAAT ACAGGACAAA GCTGTTCTGG AAGTTTCGCG CAGCAGCTTT TGTCGATGCC 2100 

GGCAATGTCT GGACGATAAA GGAGTATGAG AATCAGGAGG ACGGTCTCTT TCGTTTCGAT 2160 

CGCTTCTACA AGGAAATAGC TTTGGCCTAC GGTCTGGGGC TTCGTCTCGA CTTCGATTAT 22 2 0 

TTCCTTGTGC GGCTGGATGC CGGACTGAAA GCCTACGATC CTCAGCAGAC AGGGCGTTAC 2 280 

AAATGGGCTA TCACACGCCC AAACCTTTCT TCCAATTTCG CTTGGCACAT TGCAGTAGGC ''34 r. 

7U TATCCGTTC 2 34 9 



75 



(2) INFORMATION FOR SEQ ID WO: 64 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2625 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDWESS: double 

(D) TOPOLOGY: circular 

5 

(ii) MOLECULE TYPE: DUA (genomic) 

(iii) HYPOTHETICAL: NO 

10 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVALIS 

15 (ix) FEATURE: 

(A) NAI-IE/KEY: misc_f ea ture 

(B) LOCATION 1...2625 



20 



70 



SEQUENCE DESCRIPTION: SEQ ID NO: 64 



GTCGAATCTA AATTGTTATG TCTTATGAGA AAAAGAATTC TACAACTTTT CCTGACCGCA 60 

TTGCTGCTGG CATTAGGCTC CTCTCTCGCC ATAGCGCAAA CAGTGGTGAC CGGTAAGGTG 120 

ATCGATTCAG AAACGTCCGA ACCGCTCATC GGTGTATCCG TAAGCACCGG TCAGGGAGCA 180 

TCCCTCCGCG GTGTAACCAC CGATATGGAT GGTGGCTTCC GATTCGAAGT ACCGGCCAAA 24 0 

25 TCTGTCTTGA CTTTCCGTTG CGTAGGTTAT GCTACCGTAA CTCGCTCTAT AGGCAGAGGT 3 00 

TCTCAAGAAG ACCTCGGTAC GATTCTCCTC GATCCCCAGG CCATCGGCTT GGATGAGATT 3 60 

CAGGTAATAG CCTCTGTGGT GCCCAAAGAC CGTATGACGC CGGTACCCGT TTCCAATATC A 20 

CGTGTGGCTG ATATTCAGGC AGCATCGTTG AATGTCGAAT TTCCCGAACT GGTTAAA.TCC 4 80 

ACTCCCTCTA CCTATACGAC AAAAGGAAGC GGAGGTTTCG GTGATGGTCG TACCAATGTG 54 0 

30 CGTGGATTCG ACACTTAC7\A CTTCGGTGTA CTCATCAACG GAGTTCCTGT CAATGGTATG 600 

GAAGACGGGA AAGTATATTG GAGCAATTGG AGTGGTCTGA TGAATCAAGC CAGTACCATT 660 

CAGATTCAGC GCGGACTCGG AGCCTCCAAG CTCGGTATCA GCTCGGTAGG TGGTACGATG 72 0 

AACATTATCA CGAAGACTAC GGACGCCAAC ACCGGAGGTT CGGCTTATGT CGGTATGGGT 7 80 

AATGATGGAT TGCACAAAGA ATCGTTCTCC ATTTCTACGG GTATGAACGA CGGTTGGGCT 84 0 

35 ATCACCATTG CAGGCTCCCA TATGACGGGT CTGGGTTATG TGAAGGGGCT GAAGGGACGT 900 

GCATTCTCTT ACTTCTTCAA CGTTTCGAAG AAGTTCAATG AACGTCATAC CCTCTCTCTT 960 

ACCGGATTCG GTGCACCACA ATGGCACAAC CAACGTTCTT CCAAATATTC TGTAGCCGAC 102 0 

TATGACAAAT ACGGCATCCG TCACAATCAA TCCTTCGGCT ATCTGCGAGG CGAACTGACT 108 0 

CCTACGGCTT ATGCTTACAA TACGTACCAC AAGCCCCAGT TCTCGCTGAA CCACTTCTGG 114 0 

40 AAGATGGATG AAAATACCTC TCTTTATACc gCAHCCTACG CATCTTTGGC TACCGGTGGA 12 00 

GGTCGTCGCG CTTATGGAAA GAACAGTAAG TGGGTATTGA TCAACTACAA CACCGGACAA 1260 

CCCTATGAAC AAACAAAGGT GACTCCCGAT GGACTTATCG ACTACGATGC CGTACTGGCT 132 0 

GCCAATGCTG CGGCGAGCAA TGGCTCGGAA GCAATTTTTG CCCTTGGCTC CAACTCTCAC 138 0 

AAGTGGTTCG GTCTACTCTC TTCATTCAAG AAGAAACTTA ATAGTTCGCT GACTTTGACA 14 4 0 

45 GCCGGATACG ATGGGCGTTA CTACCGTGGC GACCACTATG ACAAGATCAC CGATCTGCTC 1500 

GGCGGTAGCT ACTACATAGA GGATCCCAAG ACAAAGCTCG CATACCATGC GGAAGGTCAG 1560 

CAACTGAAAG TGGGTGACAT TGTAAATCGG GACTACACAG GCGAAATCAT GTGGCACGGC 162 0 

CTCTTCGCAC AGATGGAGCA TTCGTCCGAA TGGATCGATG CATTCGTATC AGGATCTATC 168 0 

AACTACGAAC TATACCGCAA TCACAACTAT GGCGGTAGCA AGTCCACCGG CTACCTGCCC 174 0 

50 GGCGTATCGC CGTGGAAAAG CTTCCTTCCG TGGAGTGGCA AGGCAGGTCT GAGCTACAAG 1800 

TTCGCACAGG GACACAATGT ATTCGCCAAT GGCGGTTTCT TCACACGTGC ACCACTCTTT 1860 

GGCAATATCT ATGCTGCGGG GGCTATCATT CCCAATGACA AAGCCAATAT GGAAAAGGTG 192 0 

CTTACAGGAG AGGTCGGCTA TGGATTCACG AATCACAAAA ACTTCGAGTT CAATATCAAC 198 0 

GGATACTATA CGAAGTGGAT GGATCGCGTG ACCTCGAAGA GAATCGGAAA CGAGTATGTT 204 0 

55 TATCTCAATG GCGTTGATGC TGTTCACTGT GGGGTAGAGG CTGAGGTCAG CTATCGTCCT 2100 

ATTCGTCAGA TCGACCTTCG CGGTATGTTC TCTCTCGGTG ACTGGACTTG GCAAAACAAT 216 0 

GTAAGTTACA CTTCTTACGA CGAAGCCGGC AATGAGACAG GGCAGGATAT AACCTATATC 222 0 

AAGGGTCTTC ACGTCGGAGA TGCAGCACAG ATGACGGCTG CTGTATCGGC AGACATAGAG 22 8 0 

CTGTTCAAGG GTTTCCATGT CATAGGTAAG TACAACTTCC TTGGCAAGAA CTATGCAGGA 234 0 

60 TTCAACCCCG CAACGCGTAA TGCACAGCAG TACGAAGCGG ATGGCAAAGA AATCGTGGAA 24 00 

TCATGGAAGT TGCCCGATGT AGGTCTGTTC GATCTGTCTG CATCCTACAA TTTCAAGCTT 24 60 

GGTTCACTCA GCACCACATT CTATTTCAAC ATGGACAACG TAGCCGACAA GCGATATGTG 2 52 0 

AGCGATGCCG ACGACAATAT CATCGGTAAG AAACACGATG AGGCTTCGGC TCTCGTATGG 2 580 

TACGGTTTCG GCCGCACTTG GTCTACCGGT ATTCGTGTAA ACTTC 2 62 5 

65 



(2) INFORMATION FOR SEQ ID NO: 65 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1380 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI^IDEDHESS: double 

(D) TOPOLOGY: circular 

75 {ii) MOLECULE TYPE: DNA (genomic) 
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25 



30 



45 



(ii) MOLECULE TYPE: DUA (genomic) 
50 (iii) HYPOTHETICAL: IJO 

(iv) AHTI-SEMSE: NO 



55 



60 



(vi) ORIGIMAL SOURCE: 

(A) ORGANISM: PORYPHYROHOMAS GIHGIVALIS 

(ix) FEATURE: 

(A) MAI IE/KEY: mi sc_f ea t ur e 

(B) LOCATIOM 1 ... 1026 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 



60 



{iii> HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GINGIVALIS 

(ix) FEATURE: 
10 (A) NAI^IE/KEY: misc_feature 

(B) LOCATION 1 . . . 1380 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

15 AACAGATGTA ATCCCTCGCG TCAATACTTC CACCTATCGC AGAATGACGT TGGAAGATCT ou- 

CTCCGAGCTT TTCTAATCCA TCTATCTATG 7\AGTTTTCAA TCCGCCTTTT CCTCTGCAT^ 120 

ATCTTTCTCC TCTCTGCATT TATCCTGCCT GCTCTCGGAC A7WU\TCCAA GCAGGTACAG 180 

CGACTTGAGA AGCAACGTAA GGAGGCCCTC AAAGCCATCG AAAAAACCGA TCGCGAACTA 24 n 

CGAAATACCA AGAAAGACAA GCAAGACAAA CAAAAGCATC TCAACCTCCT GAACAAGCAG 300 

ZU GTTGCTCAAC GCAAGCAGAT GGTACAACTC TTGGACAATG AGGTCAAAGA GTTGCAATCC 36 n 

GACATTGATT CCATGACGGG TGTATGTCAT CAGCTCTCTG TAGAAGAGAA AGCCCGATCC 4 20 

GATGAATATG CCCAAGCTCT ACAGTCTATG CAAAAGCGGA AACGCTCGTT GGATCGCATC 4 BO 

CTTTTCATTT CATCGGCCAA GAGCTTTGAC GAAGGCATGC GACGGATGCG TTTCTTGGAA 540 

CAATACGCTT CTGCATACAA GCTGGCATCT GTCCGGCTGC GCGATACACG TAGCAAGTTG 600 

GAGACTGAAC GTGCGACTGT AGAAGACGCC AAAAAGGAGA AAGGACATCT CTTAGTCATC 660 

AGAGAAGAGG AAAAAAAGAA ACTCG.AAGGA CAGCAAGCCG AGCAACGTCG GCAGGTGCAG 72 n 

GCTTTGGGAG CCAAACAAAA AGACTTGGAA GCGCAGCTGC GAAAGCAGAA AAAGCAAGCC 7 80 

G7V\GCTCTGA ACAGAAAGAT CGAGAAACAG ATTGCCAAGG AAATAGAAGC TGCCGAACGT 8^0 

CGTGCTCGAG AAGAACGTGA ACGGTTGGCA CGCGAAGCCA AAGCCAAGGG TAAGCCGGTT 900 

CCTGCCGAAC CGGAACGGAA GGCGGAGACC AAAGGCGGCT ATGCTATGGA TGCCTCTGAG 960 

CGTGCTCTCT CGGGCAGCTT TGCACAGAAC AAAGGTCGCC TGCCCGGCCC CGTTCGCGGC 1020 

AGATACCGAA TCGTAAGCGA CTTTGGCGTG CATCAGCACA GTGAGCTGAA AAAAGTACAA 1080 

GTTAATAATG GAGGTATCGA CATCGCTGTA GCAACAGGAT CCGATGCTAC CAGCGTATTC 1140 

GATGGTGTAG TGTCCAGTGT ATTCGTGATA CCCGGTTATA ATTCGGCCGT AATGGTTCGT 1200 

OO CACGGTAACT ATATCACGGT TTATGCGAAT CTGAGCAAAG TGTATGTAAA TTCCGGCACT 1260 

CGTGTTAAAA CGGGTCAGGC TCTTGGTCGT GCCTATACGG ATCCTTCCAA CAACCAGACC 132 0 

ATTATTCACT TCGAAATCTG GAAAGAACGC AGCAAACAAA ACCCAAGACT ATGGTTACGA 1380 

40 (2) INFORMATION FOR SEQ ID NO: 66 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



AGTTTTTATC AAGAAATAGA CAGACTTATG AAAAAGTATT TGTTATATGC CTCGTTGCTA 6n 

ACGAGTGTTT TGCTCTTTTC CTGTTCAAAG AACAATCCTA ACGAGCCGGT GGAAGACAGA 120 

t>0 TCCATCGAAA TTTCTATAAG GGTAGATGAT TTCACCAAAA CGGGTGAGGC AGTACGCTAT 18n 

GAAAGGAATC AAGGAAGTGC TGCCGAAAGG CTCATTACCA ATCTTTACCT CTTGTTGTTC 24 0 

GATCAGTCAG GGGCGAATCC GGCGAA.ATAC TATATTACCG GTAACACTTT CACCGGAGGG 300 

ACCTGGCTTC CTGACGATAT GAAGGTGAAG TTGGATATGA CACAATCCGA GGCCGGAGAG 360 

CGCAAAGTAT ATGTCGTAGC CAATGTTGAT AATGCGGTTA AAACGGCTCT TGATGCTGTC 420 

/U GCTAACGAAA GCGATTTGCA GACTGTAAAG AGGACGACTG CAATGCCGTG GTCGACCGAT 4 80 

ATAGCCTCTC CTTTCCTGAT GTCCGGAAAC AAGACACACG ACTTCTTGGC CAATCGTCTT 5^0 

TTGGACAATG TGCCCCTTGT GCGTGCCATT GCCAAGGTGG AGCTGAATAT CTCGCTGAGT 600 

GAGAAATTTC AGATTGTGCC GATAATTGTC AATGGTAGTT TGAGTGAGTT CAAGTTCAGA 66n 

TACGTAAACT TCGACAAGGA GACCTACGTA GTGAAGCCAA CGACCAA.GCC GGACAATCTC 7 20 

/O ATTAGTTCTG CTAATGGTGT TTGGCCTCAG ATTACAGATT GGACTGTATG GGGTG^TT^^ 7 8n 
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TTAAATACTT CTCCTGCTCC GGATGCGGGC ACAGGTTATA CATTGGATGC AAATGGCAAG Q40 

GTAACGGCAC TACGGATTGT TACCTATCTG AATGAGCGCG ATAGCAAAGG GGCTACGGTA 900 

GAGGTCGCAT TGCCTCGTGT GGATGATGGC ACCCTTCCTC CTCCGGAATT CGGTCCGGAG 960 

CTTTATCGTT TGCCTTTGCC GGACAAGATC CTGCGCAATC ATTGGTACAA GTATGAAGTC 102 0 

5 GAGATT 1026 

(2) IIJFORIIATIOII FOR SEQ ID MO: 67 

10 (i) SEQUEllCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIIDEDMESS: double 

(D) TOPOLOGY: circular 

15 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: liO 

20 (iv) AliTI-SEIISE: HO 

(vi) ORIGIHAL SOURCE: 

(A) ORGANISM: PORYPHYROt-lONAS GINGIVALIS 

25 (i::) FEATURE: 

(A) HAIIE/KEY: misc_feature 

(B) LOCATION 1 . . . 987 



30 



70 



SEQUENCE DESCRIPTION: SEQ ID HO: 67 



AACCTTAGGA CACAGCCTTT CTTTTTGGTA GATTTGCAAT CTATGATCAG AACGATACTT 60 

TCACGATATG TATCCTCGAA CTTTTGGAGT CGGGGAGCTA CCTTTTTTTT CACGATTTTC 120 

CCGGCCTTCA TCCTCGCCGC TACTGCTTTG CCGGCTTGTG GAGGGGGTAC TGCTTCAGGC 180 

TCCGATCGTA CGCTGGCTGT GACCATCGAG CCACAGAAAT ACTTCATCGA GTCCATTGCG 24 0 

35 GATAAGTCGG TGCAGGTGGT GGCATTGGTA CCGGCCGGCA GCAATCCGGA GGAATACGAC 300 

CCTTCGCCTA CCGTGATGAA GCGTTTGTCC GAAGCAGATG CCTACTTCTA TATAGGAGGA 360 

CTGGGGTTCG AGCAAAGAAA TCTCGCTGCC ATTCGGGACA ATAACCCTAA GCTCCCTCTT 420 

TTCGAAATGG GCAAAGCCTT GGCGGATGCC GGAAGTGCAG ATCTCCACGG CTCCTGCACA 4 80 

GATCATTCTC ATACAGACCT GCATGCCCAT GATCCGCACT ATTGGAGCAG TGTGGTAGGG 54 0 

40 GCAAAGGCAC TCAGTCGTGC TGCATACGAC GCGCTTGTGG AGCTTTATCC GAACGAGAAA 600 

GACAAATGGG ACAAAGGGCA CGACCGTCTC AACGGACGTA TCGACAGCGT GAAGAGACTC 660 

GTCGATACCA TGTTTGCCAA TGGCAAAGCA GACAAAGCCT TCGTCATATA TCACCCATCG 7 20 

CTCAGCTTTT TCGCCCAAGA GTTCGGCCTG CGGCAGATCG TCATAGAGGA AGATGGGAAA 7 80 

GAGCCTACGG CTGCCCACCT TCGTCGTGTG ATCGATCAGG CACGTGCCGA TGGTGTCAGA 84 0 

45 ATCGTATTTA TCCAACCCGA ATTTGAAACG CGTCAGGCGG AGGACATCGC ACGCGAGATC 900 

GGTGCTCGTC CGGTAAGGAT CAATCCTCTG CGCAGCTCGT GGGAGGAGGA AATTTTACAT 960 

ATTGCTCGCG CTTTGGCTCA TGAACGG 987 

50 (2) INFORMATION FOR SEQ ID r>JO:68 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2634 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ill MOLECULE TYPE: DNA (genomic) 

60 (iii) HYPOTHETICAL: MO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
65 (A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI4E/KEY: nd sc_f eat ure 

(B) LOCATION 1 ... 2634 



(y.i) SEQUENCE DESCRIPTION: SEQ ID 1/0:68 



GCAGATTCTA TTCGATATCC TCTTTACTTT TTTGGGCGGA ATCGGAAGAA ATGCTTTAGG 60 
GAACCTATTC CCACCTTATA CAATAAAAAC ATGATCGGAA AAAAAATCTT TTTTATCCTG 120 
75 CTGGCGCTCA TTGCGTTCAG TGGGCTGAAC GCAGCGACAG ACACTGAGTT CAAGTACCCG 180 
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ACCGATGCCA ATATCATCGG TCACGTCAAA GACAGCAAGA CGGGTGAACA CCTTGTCGGT 2 40 

ATCACTATTG CTATCAAAGG CACTACCTTT GGTACATCTA CAGATGCAAC CGGGCACTAC 300 

TATCTTCGTA ACTTGCGTCC GGGTGAGATC ACTTTGATTA TGCGTGGCAT GGGCTATAAG 360 

AGCCAGGAGC GCGTAGTCCG CGTAGAAAAG GACAAGACTA TCGAGGTGAA TTTCGAAGCA 4 20 

5 GAAGAGGATG CCATCAATCT GGACGAAGTC GTGATTTCGG CCAACCGCGA ACTGACGCTT 4 80 

CGCCGTCTTG CTCCTACTCT GGTAAATGTA TTGAACGAAA AAGTCTTCTG GCAAGTCAAT 54 0 

GCTTCTAACC TGGCTCAAGG CTTGTCATTC CAGCCGGGAG TTCGTGTAGA- GAACAACTGT 600 

CAGAACTGTG GTTTCAATCA AGTTCGTATC AATGGACTGG ATGGTCGTTA TGCACAGATC 660 

CTCATCGACA GCCGTCCCAT CATGAGTGCC CTTGCCGGTG TTTACGGTCT GGAGCAGATC 7 20 

10 CCTGCCAATA TGATCGAACG TGTGGAGGTA GTACGTGGTG GAGGATCGGC CTTGTACGGT 7 80 

TCTTCTGCTA TTGCCGGAGT GGTGAATATC ATCACCAAGG AACCTTCTCA CAATTCTTTC 84 0 

ACATTCAATG AATCTCTGAG CTTTACCGGT TTCAGCAAGC TGGATAACAA CACGAACTTC 900 

AATGCCTCCA TCGTCAGCGA TGACAACCGT GCCGGTGCCA TGGTATTCGG GCAGGCTCGT 960 

TACCGCAACC ATTGGGATGC TAACAATGAC GGTTATTCCG AATTGGGT7\A AATAGATGCC 102 0 

15 CGCTCGCTGG GAGCGCATTC TTATTTGCGC TTGAGCGACT ACAGC7W\TT GACGGGAGAG 1080 

TTTCACACGA TCAGTGAATT CCGCCGTGGT GGCGATCGTA TCGATTTGCC TCCTCACGTA 114 0 

GTGGGTGTAG CTGAACAAAC TGACCATAGC GTATTTAGCG GAAACTTGAA ATACGATCTC 1200 

TTCTCTTCCA ACTATAAACA CCACTTCCAG GCTTATACTT CCGGACAGAT CGTAAATCGC 1260 

AAGAGCTATT ACGGAGGTAT CGGAGAGATT GACGTCAATG GCCACCCCGG TGGTACGGAA 1320 

20 GGCTACCCTA TCCCTCAAGA TCAATACGGC AATAATTATG GCGTGACCAA AGGCAAGACA 1380 

TATATGGGCG GTATCCAGTA CAGCTACGAC TTGGACAAAT TCCTCCTCAT GCCTTCGCAA 14 4 0 

CTTTTGTTCG GAGCCGAATA TACGCGTGAT GAACTCAATG A-::'GTGATGCC CATCCTTTCA 15 00 

TGGCAGACCG GCGAGGATGC CAATGGGAAT ACCATTCCCC TCTATCCCGA ATTGGATCAG 1560 

AATATCAACA ACTACAGCCT ATTCGGTCAG AACGAATGGA AAAATGACAG ATGGAGCATC 162 0 

25 CTTGTTGGCG CTCGCTTGGA CAAGCATAGC GAAGTCAAGG ATATGATTCT GAGTCCTCGT 168 0 

ACCACACTGC GTTTCAACGT GAATCCGGAC ATCAACCTGC GCGCTACATA TGCAA7\AGGG 174 0 

TTCCGCGCAC CGCAGGTATT CGATGAAGAC TTGCACGTAG GGGTTGTAGG GGGTGAGGCA 1800 

CAGAAAGTAT TCAACGATCC GAACCTCAAG CCTGAAATTT CTCATGCATT CAGTTTGAGT 1860 

GCCGATATGT ATCATCGTTT CGGTAACGTC CAGACCAACT TCCTTGTGGA AGGCTTCTAT 192 0 

30 ACTCGTTTGC TGGATGTATT CACCAACGAG GAGCAGCCTG ATCAGCACGA TGGCATCAAA 198 0 

CGCTACACGC GTATCAACGG TAGCGGAGCC AAAGTATTCG GTCTCAATCT GGAAGGTAAG 204 0 

GTCGCATACA AGTCCTTCCA GCTCCAAGCC GGTCTTACCC TGGCCAGCAA CAAATACGAC 2100 

GAAGCACAGG AGTGGGGTCT GAATACGGTG AAAGACACCA ACGGAGCTTT TGTTACCGAG 2160 

GCCAATGCAA ATGGACAACA GGAATACAAG AACGAATGGA TGACGGATAC GCAGATCACC 2220 

35 CGTACCCCCA GCGTATACGG TTATTTTACT TTGGCCTACA ATCCTGCTCA CTCATGGAAC 2280 

ATAGCCCTTA CGGGAGCATA TACCGGTCAG ATGTATGTAC CCCACGCTAT CGAATATGGT 234 0 

GTGAAGTCTG CCGAACTGGA TATTATGCAG AACAATCCTG AGATTACCGA CGAAACCGGA 2 4 00 

AAGGCTCCCC GTATTGATGA GCTGAAGAAG ACACCTGCAT TCTTCGATTT GGGCTTGAAA 24 60 

GTGGGTTATG ACTTCCACGT ATTCCAGGCT ACTGAGGTTC AACTCTATGT AGGTATGAAC 2 52 0 

40 AATATCTTCA ACTCTTTCCA GAAGGACTTC GATCGTGGAG CTGCACGTGA CAGCGGATAT 2580 

ATCTATGGTC CTACGCAGCC GCGTACAGGC TACATGGGCT TGGTAGTGAA GTTC 2634 



45 



55 



60 



(2) ItlFORl-lATION FOR SEQ ID NO: 69 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 618 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIJDEDHESS: double 
50 ( D ) TOPOLOGY : circular 



(ii) MOLECLfLE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: HO 

(Iv) AUTI -SENSE: I JO 

(vi) ORIGINAL SOURCE: 

(A) ORGAIIISH: PORYPHYROMOIIAS GINGIVALIS 



(ix) FEATURE: 

(A) NAl'IE/KEY: misc_feature 

(B) LOCATION 1 ... 618 

65 SEQUENCE DESCRIPTION: SEQ ID NO: 69 

AAACAGATAG TTATGACAGT AAAGCGCGCA GTGCGAATAG CACTTCTCAC GCTGATAGGC 60 

ATTCTTTTTT CCTCACCTTC TCTTGTTCGG GCGCAAAGTC TTTTCAGCAC CGAACATGTC 120 

TTGCAACTAT ACAACAAGAT ACTCTATGGA GAGTCGGCGG CGGATACCGT CGCAGAGAAA 180 

70 ACGGCAGGTG AGTCGGCATT TCCTTTTATA GACAAACTCA TCAATCTCGG CCGCACTTTC 24 0 

CTCGGCAAAC CATATCGCTA TCGCGGTCCT TCCCCATGGC CGATGGACTG CTCGGGCTAT 300 

GTGTCTTACC TCTACTCCAA ATTCGACATC AAACTCCCAC GTGGTGCGGC AGCACAGAGC 360 

CAATATACGA ATCCTATCGA GCGCGAGGAT GTTCGTCCGG GCGACCTCCT TTTTTTCAAA 4 20 

GGCCGCAATG CACGCAGCAA CCGTATCGGG CATGTAGCTT TGGTCGTATC TGTCGATGAA 4 80 

75 GATGATATTA CCATGATGCA CAGCCGCAAT TCGCGAGGGA TCGTGATCGA AA7\ACTC7V\T 54 0 
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CGCAGTGCAT ACTTCTCCCG TCGCTTGGTG AGCTATGGCA GGGTACCCGG AGCCAAGAGA 
GTGATCCCAC GAAAAAGT 



{2) INFORIIATIOII FOR SEQ ID HO: 70 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LEiJGTH: 1 'J 01 base pairs 

(B) TYPE: nucleic acid 

(C) STRAtlDEDMESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: HO 



(iv) AilTI -SENSE: HO 



(vi) ORIGINAL SOURCE: 

(A) ORGAiaSM: PORYPHYROMOHAS GIHGIVALIS 



(ix) FEATLfRE: 

(A) NAIIE/KEY: mi sc_f eat u re 

(B) LOCATIOH 1 ... 1401 



(xi) SEQUENCE DESCRIPTIOM: SEQ ID HO: 7 0 



AAAGGTACGT 
GCACTGACCG 
ACCAATCCGA 
CTGAATCTGC 
CGACACAGTT 
TCGAAAGACA 
GGAGCTTCGG 
TACAACGTGG 
ATCGCGGCTC 
CAGTTGGCAC 
TGGGCAGAGG 
GTACAATATC 
GAGCACCCCG 
TTGGGATCTC 
CTGCATTCGA 
GCATACTTCC 
GGCAAGGAGT 
TACAGTATCG 
GTTCGGAGCA 
GCCCTCTATA 
GCAGCAGCCG 
TTCGAGGCAG 
AGCCAAGT GG 
TTCTATCAGG 



GGAATAGAAA 
TCCTATCTTC 
TGTCAGGCCT 
GCAGACAGGA 
TCCTGCCCTC 
AAACGGGAGT 
TGGAAGTATT 
AGGATGGTAT 
TCTATATCAA 
TGATTCGCGA 
GTAAGCTCCT 
GTTCGGAGGA 
AAAGCATTGC 
TCCTTGCTCC 
GCGAGCTGCA 
CGACGCTCAG 
ATGCCGCCAT 
GACTCTCTTT 
GTCGCCTGCA 
AAGAGATCAG 
AAAACAGCAA 
GGCGCTTGTC 
AAGAACTTCG 
GCAAAGACTT 



AAACCGAAGA 
GCTCTCCTTG 
CTCCCTGGAA 
GAT CG AACAA 
GGTCAATGCA 
AACCGTAGAT 
CAGCGGCACA 
AGCCCGACTT 
TTTGCTCTTC 
GCAACGCAAT 
CGACATAAAT 
GGAGCTGGCT 
AGTCAAGGCT 
CGAAGAGATC 
AATAGCTTCG 
CCTCTCTGCC 
CAACCCCTCC 
GAATATCCCC 
AATACGCTCA 
GCAAGCATAC 
GGCCGCTACG 
TGCCTACGAA 
TGCCAAGTAT 
C 



GAAGAAATGA 
CTTCGTGCAC 
GACTGTATCC 
GAAAuACCGAA 
GGCATCGGAC 
CGCTCCTCGA 
CGTCGTCTGC 
CAAAAAGCGC 
CGTCAGGAAA 
CGCACGGCCG 
GCCCAGATGG 
CGTCTGGACT 
CCCGACACAG 
TATCGCACGG 
GCACGCGAAG 
GGATACAGCA 
TTCTCCGAAC 
ATCTTCTCTG 
AGCGAGCTTC 
AGCAATGCCG 
CTCAAGGCAT 
TATGCCGAGG 
GACTTCATAT 



AACGGACAAT 
AAAATGAATC 
GGATAGCCAA 
TCATTAGTCT 
ACAACTATAG 
TGAATACCAA 
ACGACCTCAA 
GTGAAGACCT 
TGACTCGTAC 
AAATGGTTCG 
CCAAGGACGA 
TGGGGCAAGC 
ACGTTCTCGT 
CTCTCGGCTT 
GTCTGGCCTC 
ACGGTTACTT 
AGTGGAAGAA 
CCATGCAAAC 
GACTCGTCGA 
TGGCAGCCGA 
ACGAATACGC 
CAAAAACAAA 
ACAAAGCCAA 



CCTCCTGACG 
CGAJ\GCATCA 
GGAGCGCAAC 
CGATGCAGCA 
CTTCGGACGT 
TCTCAGCATC 
GCAGCAAAAG 
CAGCCTGCAA 
GGCAGAAACA 
CGTAGGTAAA 
ACAACTTCTC 
CCTCGAACTG 
AGCAGAAAGG 
GAAACCGGCA 
GGCTCGTGCG 
CCGCGACCTC 
CAACGGCAGC 
GCAAGATCGC 
AGAGAAAAAA 
TAAGGCCATC 
TCGCGACAGC 
ATACGCCCTC 
AGTTTTGGAT 



(2) IHFORI-IATION FOR SEQ ID HO: 71 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDHESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv) AHTl-SEHSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGAMISII: PORYPHYROMOHAS GIHGIVALIS 

(ix) FEATURE: 

(A) HAME/KEY: misc_feature 

(B) LOCATIOH 1. . .1353 

{>:1] SEQUENCE DESCRIPTION: SEQ ID 110:71 
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10 



15 



20 



25 



AAAACATCAT 
GCGGCTTTGG 
GACTATGCAC 
AAGGCCGATT 
GCTTCGTGGA 
AGCTTCAACA 
GTCTATCGAC 
CAGCAGGAGC 
CAAATGCAAG 
GCTCGAATGG 
ATGGCCGGTG 
CGGCTCAAAG 
GCTGACAGTC 
GCTGCCCATC 
CGTTTGCGAG 
ACGGGATTCT 
AACCGTCGGG 
CTTGTGAGCC 
GAAGCGGAGC 
CTGGCTTCCT 
GTCTTGCAGC 
CGGCTCCTGG 
AAACTCATAG 



ATCGGAATAA 
CTGCGAATCC 
GCCGGCACAG 
ACCTTCAGGC 
ATTTCGGACG 
ATTCGTACAG 
TGCGGATGGC 
TGGCAGCTCT 
AGCTGGCCAT 
AAGAGCTGGG 
ACCGTTTGGC 
AAAAAATGAA 
TCTCCGCCGA 
ATCATCCCGT 
CCGCGCGAGG 
CACGCTTTTT 
GGGAATACGT 
ATCTGCGTCA 
GCAGGCTCTA 
ACCGCCAGGC 
GTTATGAGGA 
ATGCCCGTGT 
CCTATTACGG 



TATGCGTTTC 
CCTTACGGGC 
TTCGGCCGTG 
CGTCGGCAAT 
CGGATTGGAT 
CATACATGCC 
GCATGCACGC 
CGGCACCACG 
GCAGAAGTAC 
GATGAAGAGT 
CCTGACTCAA 
CTTCCCCATC 
CATGGCCGAA 
CCTCCTCCGT 
TGCATTCCTG 
GAATGGATCG 
CAGTCTGAAT 
GGCGCGTGCC 
CAGCGAGATC 
GAAGGAGCAT 
GGGGCTGAAT 
GCAGCGACTG 
CTGCCTTTCG 



CAACATTATC 
CAATCGAATA 
GCGCTGTCCG 
TTTCTGCCCC 
GCCGAGACGA 
ACGATGACCC 
CGGGAGGCTT 
GAGGCCTACT 
GAGGAGAGCA 
CGTCCCGATG 
GCGGACAATC 
GATGACGAAC 
TCGGACAGCT 
GCCAAACTCG 
CCGAGTGTGT 
GACTATACGC 
CTGAGTATCC 
GAACGCAGGG 
GCCCAAGCCA 
ACCGACGCCA 
ACGGCCATCG 
AGAGCGGCCA 
GAG 



TCATCTGTAC 
TGACCCTCGA 
CTGCGGAACT 
GTGTATCGGC 
ATACCTACAC 
TTTTCGACGG 
CGCGCCTCTC 
ACGACCTCGT 
GCCGCCTCCA 
TCCTCGAGAT 
AGTGCATCAT 
TCGTCGTAGA 
CGGCCGGCGT 
ACGAGCAGGC 
CGGTATCCGG 
CCTTCAGCGA 
CCATCTTTTC 
CGGCAATCGT 
TGGCCGACCG 
TGCAAACCGC 
ACCTGACCAC 
TGACCTACCG 



GGCTGCCGTA 
AGAGTGCATA 
GGAGCAGTCC 
CGGAACCGGT 
CGACATCAAC 
TTTGCAGAGT 
CGTTCGCGAG 
CTATGCGCGC 
CCGGCAGACG 
GCAGTCGCGA 
CGCTCTGATC 
CGATATGCCG 
CTTCGCCCGT 
TGCCACCGAC 
AGGATGGAAC 
GCAGTTTCGG 
GGGATTCAGC 
CCGACGGGGC 
GGATGCCGCT 
TTACGAAGCC 
TCAGGCCAAT 
GCTCAAATGC 



60 
120 
180 
240 
300 
360 
420 
4Q0 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
ll'JO 
1200 
1260 
1320 
1353 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID MO: 72 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2886 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AHTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1 ... 28 86 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 



GCCATTTTTG 
AAGAGCATGA 
ATGGCAACTG 
TCTTCAAGAT 
GATTTC7\ATC 
CCCGTCCGGA 
ATCTCTTCCA 
GCCATTACGC 
ACCCCCGACC 
GCTTTTGCCA 
GGAGGGACTT 
GGACGCCCCG 
GAGATCAACA 
CAAATGATCA 
ACGAAAGGAG 
AATTTCGGTG 
AGAGGATGCA 
ATGAAGGCTT 
GMGTCCCTC 
TCGAGCGGTG 
AAGACTCCGG 
TTCTATTTCA 
AATCAGTKATA 
GAGTTCTACG 
CGCATGGACA 
TATCGTGACG 



TCGTATCATT 
ACAAATTTTA 
CACTGACCGC 
CCGCCGGAAC 
CGGAAGACCT 
TAGGACAAGT 
TCGGAGACGT 
TTTATTACGA 
ATGAAATTGT 
CAGAGCCGGT 
TGCCTGACAT 
TAACGGATAA 
TCAATTGTCC 
TGGTAAAAGG 
ACTTTACTCC 
TAACGCAATC 
GCAATGGTAC 
TCCTCCCGAT 
TGCGCTATCG 
CCGGTATTCA 
CTCTGAATAC 
AATACGATCA 
AGCACGTGGT 
GCAGACTGAA 
TCTATCTGGA 
GTTATAAGCC 



GCAAATTGAA 
CAAATCACTT 
TTCTGCACAG 
GCATTCATTC 
GATCGCACAG 
AATACCGGTG 
AGATGTATAT 
TGCATTCAAT 
GTTGGGAGCA 
ACCGGGGAGT 
CAAGATCTCC 
CCATTACGGG 
TGAAGGTGCA 
ACAGTATATC 
GCTGATCATT 
CGAGTTGGAT 
ATTGGCCATC 
CAAAGGTAAA 
TGTCTATTAC 
TCATCCGGCC 
ATGGATCTCC 
AGGTGGTACG 
CGGCACACTG 
CAGTCATTGG 
TCCCCAAAAC 
TTTGCCCTCT 



AAAATAACAG 
TTGCAGTCAG 
ATTTCGTTCG 
GACGATGCAA 
AGCCGTTGGC 
GATGTGGACT 
CGCCTGCAAT 
ATTCCGGAGG 
TATACGAACG 
GAGCTTATTA 
GGTGCGGGTT 
ATCGGTGAGG 
GACTGGCAGG 
TCAATGTGCT 
TCTGCCGGAC 
AAGTGGATCT 
TTCCGTGGCA 
TCCGATGGTC 
AATGGATGGG 
GGAGATGCCA 
TCCAGTGGTT 
GAAGGAGGAT 
ACCGGAGGTG 
AACGAGTATG 
AATGGCCAGA 
GTGCCCCGGC 



AGAATAAGTA 
GACTGGCTGC 
GAGGGGAACC 
TGACTATCCG 
AATCGCAAAG 
TTGCATCCAA 
TCAAGTTGGA 
GCGGACGCCT 
CCACTCATCG 
TGGATTATGA 
ATATATTCGA 
ACGATTCCGA 
CAGAGAAGAA 
CAGGCAACCT 
ACTGTGCTTC 
TCACTTTCCA 
ACAGTATCAT 
TCTTGCTGCA 
ACAGTACGCC 
TGAAGATTTC 
CCGGAGGGAC 
CGTCCGGTTC 
CCGGCAATTG 
CATCCGATGG 
CGACCATCCT 
TATTGTTGCA 



TAATTCAGAC 
CTTCGTGTCG 
CTTGAGTTTC 
CCTTACTCCG 
AGATGGCCGG 
GGCTTCGCAC 
AGGAGCC7W\ 
CTATATCTAT 
CCGCAACGGA 
AGTGTCTCGC 
CAAAGTCGGC 
TTCGGATTGC 
CGGTGTGGTG 
GCTCAATAAT 
CATAACAACC 
CTATGAAAAA 
CGGAGCTTCC 
ACTCAACGAT 
CGATATTCCC 
CATCCTAAAG 
TGACGATCAC 
TTCTCTCTTC 
TGGCGGGACG 
CAATACGAGC 
CAACGGAACG 
GTCTACAGGC 



60 
120 
180 
2-10 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
12 00 
12 60 
1320 
1380 
1440 
1500 
1560 
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GATCAGGTCG AATTGAATTG GACGGCTGTT 
GTCGAATACC ACATATTCCG AAATGGAAAG 
TCGGATGCCA TCGACGAAAG TATTATCGGT 
CGCTTCATTT ATCCCTCGCC GTTGGATGGA 
5 TCTGCCGACC TTGCCATAGG AGACATTCAG 

CCCGGAGGAG GAGTATCATT AAGCTGGAAA 
TTCGGAGAAA GCCCCAATCC TGTGTTCAAA 
GCCGCACAAA GCCCCAATCC TCCCGTTGGC 
ACATATCCCG AAAAGGCTGC TATCGCTGCC 

10 ACTTTCCACC TCTTCCTCAA GAGCAACACA 
TCCGATTGGC AGGCCGGAAC ATGGTTGAGG 
AATGACCATA TGCTTTTTGC CGGTATCAGA 
ATCCGTTATG TAAGAAATCC GGATAACCTT 
TVACAACGGAG TCTCTTTCGA AGGCTACGGA 

15 AAATATCTGG TGGTAAATAC CGATGCTCCG 
TATGCTAAGG GAACGAATGT GGCTCCATTC 
AACGGAACAT TTATCGGCAC ACAGGATCCA 
ACAGAGAGCG ATGAATACGA AATAAAACTG 
GTTGCTCAGA TTGAGAATAA CAATGCTGTC 

20 TTCAGCATTA AGAACGCTCA TATGGTTCAC 
CAGGTTCGTT CTTGGAACAA CCTCCGCAAT 
GCCGGTACTT ATATGCTCGT TATGCAGACG 
AAGCAG 

25 

(2) IIIFORI lATrOlI FOR SEQ ID MO: 73 



CCTGCCGATC AATATCCATC ATCTTATCAG 1620 

GAAATAGCTA CGACAAAGGA GTTGTCCTAT 1680 

AGCGGTATCA TTCGATACGA AGTAAGCGCA 17 4 0 

GTGGAATCTT ATAAGGATAC GGACAAGACT 18 00 

ACCAAGCTGA AGCCGGACGT AACACCTCTC 18 60 

GTTCCTTTCT TAAGCCAGTT GGTTTCCCGA 192 0 

ACCTTTG/\AG TGCCCTATGT TTCTGCCGCA 198 0 

GTAGTCATTG CAGACAAGTT TATGGCCGGT 204 0 

GTTTATGTAA TGCCATCCGC TCCGGACTCT 2100 

AACAGAAGAT TGCAGAAGGT GACAACTCCC 2160 

ATCAATTTGG ATAAGCCGTT CCCGGTGAAT 222 0 

ATGCCTAATA AGTACAAGCT CAATCGTGCT 228 0 

TTCTCCATTA CCGGTAAGAA GATTTCATAT 234 0 

ATACCCTCGC TCTTGGGCTA TATGGCTATC 24 00 

AAGATCGATA TGTCGCTTGT ACAGGAGCCT 24 60 

CCCGAATTGG TCGGCATATA TGTCTATAAG 252 0 

TCCGTCACAA CTTATTCGGT TTCAGACGGA 258 0 

GTATATAAGG GATCGGGCAT TTCGAATGGC 2 64 0 

GTTGCATATC CGTCTGTTGT AACAGATCGT 27 00 

GCTGCCGCCC TCTACTCATT GGATGGCAAG 2760 

GGCGTGACAT TCAGTGTTCA AGGACTTACG 2820 

GCAAACGGCC CTGTGAGCCA AAAGATCGTG 2880 

2886 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 2106 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



35 



(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: HO 



(iv) AMTI-SEMSE: MO 

40 (vi) ORIGIIIAL SOURCE: 

(A) ORGANISM: PORY PHYROMOMAS GIMGIVALIS 

(ix) FEATURE: 

(A) MAIIE/KEY: misc_f eature 
45 ( B ) LOCATION 1 ... 2 1 06 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 



TGGAAAAAAA GTAATCCTCA ATACCGGGGC 

50 ACAGAAAGCT ATTTAAGCCT AAGGAGGAAT 
TTATCATTGA TGTTACTCTC TCTCTGGACG 
GATGGCGAGA ATACGCTTGT TTTACGCGTA 
ACAGAACCCG GTCAGGGCAT ATACAATGAG 
TATTTAGAGG GACAACTTCG TTGGCAGGTG 

55 GCCTATATCA TTCCGGTCAA AGAGCAAATG 
AGCATCTATG TAGTGGCCAA TCTCGATTTC 
CAATTTGTGG TAGAGAAATC TATTGAAGTC 
ATGCTTGCTC ATGGCAATAA GCAGATCAAT 
GATTATAAAC TCAAACGAGT GGCAGCAAAG 

60 CAAGGATATG AAGTGGTCGG AAATATACAG 
TTCCTTACCA CAGAAGCTCA AGAGATCCCA 
CTTGATATTG CAGAGTCGGC ACCTGCCAAT 
TGGACACTCT CCACACCGGA GAAGCGACCG 
ACAGGACAGC CGGACAACAC AGCCAAACCG 

65 GACAATCAGG TCAAGAGCAA TGTCCTCTAT 
TCTTTACAAG AGCCGGAAGC TGTTTCTGTA 
CTCCATCAGG ATGCATTCAA TCTGCCTGCC 
ATCTTCATGA ATAACGTGAA CACATACTCG 
AGCATTAGCA TACAGTCAGT TACCTTTAGC 

70 GATCTTGTAG CAAGTAGTAG CGACCAGTAT 
ATAATCACTT CCAAGATACC GGTTAATAAC 
AATGGGGTAG CCGGTTTGAA AGAGACTGTC 
GTCAATACAC TTGGCACAGC ATCGGCATGG 
AACAATAAAG CGATTTACCA TGTCGTAGTA 

75 GGGACACAGA CAATCATCGG TTATCCCCCC 



AGGACAGCAA ATTACCGAGA TCATTATAAT 60 

CAAACAATGA AATATCTTAT CAGACTCTTC 120 

GGCTGTACAC ACGAGGAGCT CTCTATTTGC 180 

GAGACCGGTA AAGCCCCAAA TGCTCGTGCC 24 0 

AATAAAGTAG GCTCCATTTC TGTGCTCTTC 300 

AAGTCTACAG ACTATCAAAT CCATGAAGGG 360 

CGACCAGTAT TCAATGGCAA CAACAACTTC 420 

AATGCTCCGG CCACAGAAGC TGCGCTTTCT 4 80 

TCTTCTACGA CAGCCCCTGC CGATTTCGTA 54 0 

ATGGCTACGA CAGAAGGGAA ACTGTTGGGG 600 

ATTCGCATGA T7\AAACCCAC CATCAATGTG 6 60 

GCAAAGTTTC GCAATTCGGT AACGAAGGGG 720 

GCTGCTGCAT CCTATAAGAC ATCGG7\ATAT 780 

TCTATCCATT TCTATTCTTA CTATAACAAA 84 0 

GAATTCTTCA TCATGGTCAA ATTCAAAAAG 900 

TACTACTACA GAGTGCCCCT CGAATCTCAG 960 

AATCTGAATG TGAAAATCGA AATCTTGGGT 102 0 

AACGGCACAC TCGCAATAGA AGAATGGATT 1080 

ACCAATTACT TGATAGTGGA ACAGCACGAA 114 0 

GTGAAATATC AAACTTCGCA GAAACCAATC 1200 

TACGTCTCTT CTGATGGCAC TCAGCACAAT 1260 

CCTACGATTA CAAGCGATAA TACAAGCATC 1320 

GTACCAAAGA AGATCGTTTT TGAGGTAACT 1380 

ACAGTACTCC AATATCCTGC ACAATTTATT 14 40 

AGACCAGACG GATCTTTGGC TCCGGGGCTT 1500 

CTGGTTCCAC CCGAGAATTT ATTTGAAGAT 1560 

ACTGAAACAA TTTCTTTTCA TTIAGAAAGAG 1620 
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15 



20 



25 



30 



50 



65 



AACAATACCT ATCCGATAGT ATGGTCTGAC ACAAATACGA CAAAACAGGA CCTTGAGACA 168 0 
TCAAGAATGA TTTCACCTTC CTTTGAGTTA GCCTCCCAAC TTGGGGCTAC TCTCCCGATG 

CCCTATCTCG AGTATTGGCC AGGGACATCA TATCTCCTTG ACTATTCGGG AAACTATAAT 1800 

AATAAGAGAT ACGCCTTGTT TAATTGCGCT TTTTACTGGG AGAAAAGAAA AGTTAATAAC IQr.Q 

GAAGAAATTA AATTCGATGA CTGGCGTTTG CCGACAGAAG CTGAGATCAA ATTGATAGAT 1920 

AAGCTGCAAC ATAATGAGCA GAGTGCTGTC CAAGCTATCA TGACAGGGAA TTATTATTGG 198 0 

GATAGTTACT CTGCAAATGG GTCTTATAAA ATGCAAGGAG GAGGGGGCCA AGGAAATTCC 204 0 

TCCA.AAGCCT ATGTTCGTTG CGTGCGGGAT GTGAAAAAGC CGATTCGTGA CAAGAAGTCA 2100 

GGTAAG 2106 



(2) IIJFORMATION FOR SEQ ID IJO:74 

(i) SEQUENCE CHARACTERISTICS: 

(Al LEIIGTH: 3936 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDIIESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: UO 

(iv) AIITI-SEIJSE: HO 

{vi} ORIGINAL SOURCE: 

(A) ORGAIJISM: PORYPHYROI lOIIAS GIIIGTVALIS 

(ix) FEATURE: 

(A) IWIE/KEY: misc_feature 

(B) LOCATION 1 . . .3 936 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7^ 

35 AAGAGGGTAG ATAGCTATGA ATGCAGACAA AAAGCTTGCA 7\ATGTGCAAT TTGTGTGATA 6n 

CAAAAATTTA CTAATGTAAA ACTAAATGAT ATGCG7WW\ TTTTGAGCTT TTTGATGATG 12 n 

TGCTCTCTGC ATTTAGGTCT ACAATCTCAG ACTTGGCATG GAGATCCGGA CTCAGTGGCA 180 

GCCCTACCTT CTATCGGTAT TCAAGAGTCA AGTTGTACCC GAATCACGTT CGAGGTTGTT 2^10 

TTCCCCGGAT TTTATAGTGT GG/^AAAACGA GAAGGCAACC AAGTCTTTCA GCGCATTTCC 30 0 

4U ATGCCGGGTT GTGGCTCGTT TGGGAATCTG GGCGAAGCTG AATTGCCTGT TTTGAAAAAG 360 

ATGATAGCCG TTCCGGAATT TTCAACAGCT AACGTTGCTG TAAAAATCAA AGAGACGGAG 4 20 

ACATTCGACA ATTATAATAT CTATCCTAAT CCTACCTATG TCGTAGAGGA GTTGCCTGAG 4 80 

GGGGGGACTT ATCTGGTAGA GGCTTTCGCG ATAAACAATG ACTATTATAG CC7W\ATGTA 54 0 

AGCCTCCCTT CTACTCACTA TGTCTATTCT CAAGACGGGT ATTTTCGCTC ACAAAGATTT 600 

4b ATCGAAGTTA CCCTGTATCC TTTTCGATAC AACCCTGTCC GACAAGAAAT TCTATTTGCA 660 

AAAAAAATCG AGGTTACAAT AACTTTCGAT AATCCTCAGC CACCTTTACA AAAAAACACC 7 20 

GGCATATTTA ACAAAGTAGC CTCCTCTGCA TTTATTAATT ATGAAGCTGA TGGCAAATCG 7 8n 

GCGATAGAAA ATGATATGGT GTTCAGTCGT GGTACAACAA CGTACATAAG CGG.AAATGTT 84 0 

GCCAGCAACC TCCCTCAGAA CTGTGACTAC TTGGTTATTT ACGATGATAT GTTCAACGTA 9 00 

AATCAACAAC CACACGACGA AATCAAACGG CTGTGCGAAC ATAGAGCCTT CTACAACGGC 960 

TTTGATGTAG CTGCTGTAAG TATAAAGGAC GTATTGAATA GCTTCCCATC AAATGCCACC 102 0 

TCATACATCA ACGAT^ACTAA ACTGAAAAAT TTCATTCGCT CAGTTTACAA CCAAAGCAAT lOSn 

GCGAAGAGGA CTTTAGATGG CAAACTGGGA TACGTGCTAC TGATCGGAAA ACCATTGAGC 114 0 

AAATATTTGG CTGACACTGA TMTACAAAA GTCCCAACCT CTTTTATTCA TAATGTCTCC 12 00 

OO TTAATTCCAA GTCATCCAAC TTTTGGTTCC ATATGCGCCT CCGACTATTT TTTTAGTTGT 12 60 

GTTTCGCCCC TTGATACTGT CGGCGATTTG TTTATCGGTC GATTTAGCGT CACCAATGCT 132 0 

CATGAATTGC ACAATCTGAT TGAAAAGACT ATCAACAAAG AAATCTCATA TAATCCTATT 13 80 

GCACACAAAA ATATTCTTTA CGCAGAAGGG AAAGGCTGCG ATGCTCCAAT CTTACGTTTA 14 40 

TTCTTAAAAG AAATCGCCTC TGGTTACACA GTCAACTCTA TCTTAAAATC TAATCAGGTC 1500 

DU TCTGCAATAG ACTCGATATT TGACTGCTTG AATAATGGTT CCCATCATTT TTATTTTAAC 1560 

ACTCATGGAA TGCCGACTGT TTGGGGGATA GGGCAGGGAC TCGACGTCAA TACTCTAACA 162 0 

GCCCGATTGA ACAATACATC TTCGCAGGGA TTATGTACGA GTCTATCATG TAGTTCGGCT 1680 

GTAGCAGATT CAACTATTAG ATCGCTTGGA GAAGTCCTGA CCACATACGC ACCTAACAAG 17 4 n 

GGATTCTCGG CTTTCTTAGG AGG;^:^GCAGA GCCACCCAAT ATGCCGTTTA TTTAGAAGGC 18 00 

CCCTGTCCTC CGTCAGAATT TTATGAATAT TTACCTTATT CTTTATATCA CAATCTCTCG 18 60 

ACTGTTGTTG GCGAAATGTT GCTATCATCC ATTATCAATA CTAATTCTGT TGATACGTAT 1Q2(-. 

TCGA-AMTCA ACTTCAATTT GCTTGGCGAC CCTGCACTAA ACATTATGGC TCATGGCATG 1^80 

GAGGTTAGTA ATTGTATTAC ACTACCAAAC AACACCATTA T/Qj^GCAGTCC GATAACAATA 2 04 0 

AAAAATGGTG GCTGCCTAAA. AATACCGGAA .A.AAGGAGTTT TGCATTTTAC TAATAATGGC 210^' 

/U TCCATACAAG TCATGTCCGG AGGAACTCTG GAAATAGGCA ATCAGGCTAA AATATCCGGA 216r. 

GAGACCGGTG CTAACCCCAC CTTTATTACC GTTTACGGCG ATGGTCTTGC GATTAACAAG 22 2 0 

CAGGTAGAGA TAGACAATAT AGACCGACTT AACTTGTTTT CTACGCATTC GGTCATGCCC 2^80 

AAATTTCATT TTGACAGTGT GAAATTCAAC AGTGCCCCGC TGTATACAAC GAACTGTATT 2 34 0 

GTGGAGATAA GCAATTGCGA ATTTACCAAT CGPJKGTGhCP, TTATTTCAAA GAATTGTGAC 2 4 00 

/O CTAAGCGTTG AAAACAGTAT GTTTAGCAGT TCGGGGATAA CGGTATTCAA GCCTATGGCT 2 4 60 
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ACAAGCTCCA TCACCGGATT ATCTACAAAA 
ACAGGAAACT TCGCCTACCA TATCACAAAC 
GCCATC7\AGT TAGACAATAT TCCTGAGTAT 
GATGAGGCTC TTGTACT7W\ TAATAGTGGC 
5 CGGAATGTGA TAAAAAACTG TAGGATTGGG 
AACCGAAATA AGATCAGTAA CAATCATATA 
TATTTCGATA ATGCTCCTGT AATCAATGAA 
ACTTGGCAGC TCTATTCATC AAACGGTACA 
CAGGGGGGAG ATACAGATAC ATGGATTTAC 

10 GTTTCAAATA ATCACTGGGG CAACAATGAT 
CCAGACTTGT TCATTTGGAT ACCTTTTTGG 
AGCTCTGCTG AAGCAGTAGA ATTCCAAACA 
CTTTCGGCAA AAGTGGCTCT CAAGATGATG 
ATAGCTGCTT TGAAGGAATT GTTCAGGATA 

15 TTGAAAGATT ATTTCAGATC CAATCCAACC 
GCTGATTTCC TGTCTGCGCG ATGCGATATT 
TGGTACGAAA ATCGCTTGAA TAGTGA7\ATC 
GACCTTGGTG ACATTTATTG GAATATGCAG 
TTGAACATAC TTTCCTGTGA ACAAAGGAAA 

20 TATTTGTTGT CAACTCTTCC CGAATCAACA 
AAATCAAGCC TTGATAAATC CAAGATAATC 
GTAACAATAA TCTACTATAC CGATAACCCT 
AATGGAGCCT CGGCTGATAT AACCGGGTTG 
ATACAGTTCA ATACATCCAA CTTTGATCCC 

25 CAGAAAATTA TAGATACGGA AAAATTACGA 



GCAAAGATTA CCGACAATAC TTTTTTTGCG 2520 

ACGCCAGGCT TAACAGCAAC CTCCAATGCT 2 580 

TACATTTCCG GTAATA7\AAT AGTCAATTGC 2640 

AACAGAACGA ACAGACTCCA CAATATCACA 27 00 

AGCACGCTTT ATAATTCCTA TGGTATTTAC 27 60 

GGAGTACGTC TCCTCAACAA CAGTTGTTTT 2 8 20 

GAAGATAAGC AGACGTTTAT TTCTAATAGG 2 8 80 

TTCCCTCTCA ACTTCCATTA CAACAGCTTG 2 9^0 

AACGACACGT ATACGAATCG CTATATTGAC 3000 

TTGTTTGATC CGAATCAGGT TTTCAATACG 3060 

GATGGATTGC CAAATGGGAG ATCGGGCAAT 3120 

GCATTGGACT GTATTGGCAA TAGCGATTAT 3180 

GTTGAAACCT ACCCGGAATC CGACTTTGCA 32*40 

GAGAAAA.TGT CAGGCAACGA TTACGAAGGC 33 00 

ATCATCTCTT CCCAGAACTT GTTCCCGACA 3360 

GTGTGTGAAA ACTATCAGTC TGCCATCGAT 3*120 

TCCTATCAGG ACAGTGTTTT TGCAGTCATT 34 80 

TTAGACTCAC TCAGAGGGAC TGGTATAGAT 35^0 

TCGCTCGAAA GCCATCAAAA TGTAAAAAAT 3600 

GGTACTCTCC TGCCTCCATT AGAATGCAAC 3660 

TCTATTTCGC CCAATCCGGC GAAAGCTGTT 37 20 

TCCTGTTCTG TAATAAAAAT ATATGGAATA 37 80 

CCCAAACATC TATCCGAAGG TTATTACAGC 38 4 0 

GGTTTCTACC TGGTAACGCT AAATGTTGAT 3 900 

ATCAAA 3 936 



(2) IMFORMATIOW FOR SEQ ID HO: 7 5 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281^ base pairs 

(B) TYPE: nucleic acid 

(C) STRAIfDEDNESS : double 

(D) TOPOLOGY: circular 

35 

(11) MOLECULE TYPE: DHA (genomic) 
(iii) HYPOTHETICAL: I^JO 
40 (Iv) AIJTI -SENSE: HO 



(vi) ORIGIHAL SOURCE: 

(A) ORGANISM: PORY PHYROIdONAS Glt-IGIVALIS 

45 (ix) FEATURE: 

(A) HAI'-IE/KEY: misc_feature 

( B) LOCATION 1 ... 2814 



(xi) SEQUENCE DESCRIPTION: SI 

50 

TCAGAAAATT ATAGATACGG AA7W\TTACG 
ATTGTTTTTA GAGCATTTCT AACGATTTTG 
GCTCAAGAGA TCTCAGGCAT GAATGCATCC 
ATCTTATATG AAAGTTTTGA GAATGGACCT 

55 GATGCTGATG GTGCCACTTG GGGAAGCCCA 
AATGGCCTTT GCACCTACTC CCATATACGT 
ATTACACCCA ATATAGAAGG AGCCAAACGG 
ACCAATCCGG AACATTACGC AGTAATGGTA 
GTTTTGTTGT TTGATGATTC CATAACAGGG 

60 ATCGTGGACT TACCGGAAGG GACCAAATAT 
TCACACACAG AATTCTTGAA ATTGGATGAT 
GAACCTGCTA CCGACTTCAC AGTAATCAAT 
AACTATCCGG AGGATTATCA ACCGGAAGGA 
TACAACATCT ATGCGAACGG TACACTACTG 

65 TATGTGGACA GCACTTACTC TTTGCGAGAC 
GCCGTTTACG ATGAAAGCAT AGAATCTTCG 
GATGCCATCC TTTATGAAAA TTTTGAGAAT 
GACGCTGATG GAGATGGATT TAGCTGGGGA 
GGCCATAATG GAGGCCATTG CTCCTTGTCG 

70 ACTCCCGACA ACTATCTGAT TACCCCCAAG 
GTAAGCACGC AGGATGCCAA TTGGGCAGCG 
GGGACTGCTG TCGGAGATTT CGTCATATTG 
GGCGCATGGT ATGAAAGAAC CATCAACTTA 
CATTACAACT GTACCGATAT ATATTTCTTG 

75 CCTGCATCAG AGCCCGAACC TGTTACCGAT 



ZQ ID NO:75 

AATCAAATAA TGGCTATCAT GATGAAAAGT 60 

CTCTCGTGGG CAGCGATCAC GAATCCGACT 120 

TGTCTGGCTG CTCCGGCTCA ACCGGATACT 180 

GTTCCCAATG GCTGGCTTGA GATAGATGCT 24 0 

TCAGGCTCTT TCTCTGTACC TTACGGACAC 300 

TCCGGTATCT CAACAGCGGG CAACTATCTG 360 

GTCAAGTACT GGGTATGCAA TCAGTATAGT 4 20 

TCGACAACGG GGACTGCCAT TGAAGACTTT 4Q0 

AAACCGACTC CTCTTGTATG GCGTAGACGA 54 0 

ATTGCATGGC GACATTACAA AGTCACCGAC 600 

GTCACTGTGT ATAGGTCGAT CGAAGGGCCC 660 

ATTGGTCAGA ATGTGGGACG ATTGACTTGG 7 20 

AAGGGGAATG AAGAGTTGCA GCTTAGCGGC 7 80 

GCACAAATAA AAGATGTCTC CATACTGGAG 84 0 

AATCCCTTGC AAGTGGAGTA CTGCGTTACA 900 

ACCGTATGTG GCACGCTGCA TTACGCCACG 960 

GGACCTGTTC CCAATGGTTG GCTTGTGATA 1020 

CACTATTTGA ATGCATACGA CGCTTTTCCC 1080 

GCTTCTTATG TTCCGGGTAT AGGCCCGGTG 114 0 

GTTGAAGGAG CCAAACGTGT CAAGTACTGG 12 00 

GAACATTACG CGGTGATGGC TTCGACAACG 12 60 

TTCGAAGAAA CCATGACAGC GAAGCCGACC 132 0 

CCTGAAGGGA CTAAATACAT CGCATGGCGG 1380 

AAGTTGGACG ATATCACTGT ATTCGGGACT 144 0 

TTCGTTGTCT CGCTTATTGA AAACAACAAG 1500 
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GGACGATT7\A AGTGGAATTA TCCTAACGGC 
CCATTGCAGC TTGCCGGCTA CAATATCTAT 
GACCCGACTG TTTTGGAGTA TATCGATGAG 
GTGGAATATT GTGTCACTGC CGTTTATAAC 
5 AAGCTGATTT ATGATTCTCA ATCGGACATT 

ATTCCTGAAG GCTGGTTGTT GATTGATGCT 
CCTTGGACTA TGTATGGACA TGACAGTGAG 
ATGATTGGCG TTTTAACTCC GGATAACTAT 
CTTGTCAAGT ATTGGGTAAG TGCGCAAGAT 

10 ATGGTTTCTA CTACGGGAAC TGCTGTTGAA 

ACCGCTAAGG CTAACGGTGC ATGGTATGAG 
TATATTGCCT GGCGGCATTA TGATTGCACC 
ACGGTTTATC GTTCTACTGA GACTGTTCCC 
ATTGAGAATA ACAAGGGTCG CCTGAAATGG 

15 ACTGATGATA AAAAACCATT GCAGCTTACC 
CTTGTTCACA TACAAGACCC GACTGTTTTG 
GACGGTCAGG TGGAAATGGA ATATTGTGTC 
CAATCGGTTT GCGATAAGCT GAACTATACT 
ACAAGCTTGA AAATATATCC TAATCCGGCA 

20 CGGAGCAAGT CGACAATCGA GTTGTATAAT 
ACTCATTCAG AGAAAACGGA AATCGATGTT 
AAAGTAGTCG GTGGAAATAA AACAACAACC 



TACGAACCCG ATAAGACTGA TGATAAAGAC 1560 

GCAAACGGCT CGCTCCTTGT TCACATACAA 162 0 

ACTTATTCTT CACGAGACGA TCAGGTGGAA 168 0 

GACAATATCG AGTCCCAATC GGTTTGCGAT 17 4 0 

ATCTTATATG AAGGCTTTGA GGCCGGAAGT 1800 

GATGGCGACA ATGTTAATTG GGACTATTAT 18 60 

AAGTGTATTG CATCCCCTTC GTACTTACCG 192 0 

TTGGTTACAC CCAGACTCGA AGGAGCCAAG 198 0 

GCTGTTTATT CGGCTGAGCA TTATGCTGTG 20-10 

GATTTTGTCC TCTTGTTCGA AGAGACAATG 2100 

CGAACTATTA CATTGCCTGC AGGAACAAAA 2160 

GATATGTTTT TCTTGCTCTT GGATGACATT 2 22 0 

GAGCCTGTTA CTGATTTCGT TGTCTCGCTT 2280 

AATTATCCTA ACGGCTACGA ACCCGATAAG 234 0 

GGCTACAACA TCTATGCAAA TGGCTCGCTC 2 4 00 

GAGTATATCG ATGAGACTTA TTCTTCACGA 24 60 

ACTGCCGTTT ATAACGACAA TATCGAGTCC 2520 

ATCACATCCT TGGATAATAT TCAATCTGAT 2580 

TCGTATGTGG TAAGGATAGA GGGATTGAGT 2 64 0 

GCGCTGGGAA TTTGCATATT AAGGGAAGAG 2 700 

TCACGTCTCA ATGACGGAGT CTACTTGATT 27 60 

GAAAAGGTAG AGATAAAGAG GCCG 2 814 



25 (2) INFORMATION FOR SEQ ID NO: 76 



30 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 base pair^ 
(E) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 
35 (iii) HYPOTHETICAL: NO 

(iv) AHTI -SENSE: NO 



( vi ) ORIGINAL SOURCE : 

(A) ORGA]]ISM: PORY PHYROHONAS GINGIVALIS 

(ix) FEATURE: 

(A) NA14E/KEY: misc_feature 

(B) LOCATION 1-..1818 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 



AT/VATCTTCT GTACGATTCA TCACTCTGAG TTGGAAATAA TGAACAGCAT CATGAAATAT 60 

CAATTATATA CGGCCGTCAT AATGGCTCTC TCTGTATCAT CCGTTTGCGG TCAAACCCCA 12 0 

OU CG/VAATACAG AAACCAAACG CCCCGACACG CTGCGCAGGG AGCTTACTAT CGTTAATGAC 180 

CAGAGTGTGG AGATGGAGCA TGCGGATCCG CTTCCGGCTG CATACAAGGC CATCGAACCT 24 0 

CGATTAAAAC CTTTCCGTCC GGAATATAAC AAGCGTACAT TCGGATTTGT CCCTGAAGTT 300 

TCCTCTTCAG GCAGGAACAA TCTTCCGAAT ATCCTGCCGA CGGAAGGTCA TATGAAGCAC 3 60 

CGGGGGTACC TGAATATCGG TATCGGCCAT ACGCTA.AACC AGCGAATGGA TGCCGGCTAT 4 20 

Ob CGTCTGATAG ATGCAGAGCA GGAGAGACTG AATCTTTTCC TCTCCTATCG TGGGATGAAA 4 80 

TCGGCTTTCA ATACCGGTGA CTTCGACGGC GACAGAAAGG ATAGACGAAT GATGGCAGGA 54 0 

GTGGACTACG AGCAGCGCAG GCCTTCCTTT GTGCTTGCTA CCGGCTTGTA TTATTCGAAC 600 

CATTATTTCA ATAACTACGG ACGGGGAGCT ACCACCAATG TGGGCAGCAT CCCTCAGCTA 660 

TCGACACCTG TTACTCCTCA GATGGACAAC GGGACCCACA ACGTCCGTGT ATACTTGGGT 7 20 

uU GCAAAAAATG ATGTGATCGA TGCCAGGATC GACTATCGTT TCTTCCGTTC TATTCCCTAT 7 80 

CTGGGTACCG ATCCGATGAA GGCTCTCACA GAACATACGC CTGAACTGAA CGTGACGATG 84 0 

AGTAATGAGT TGTCCGATGA TATTAAGCTC GGTGTCGAAG TTCGTACGGG AGGATTGTTT 900 

TTTGCCAA/\A ACAGCGAAAT GATTCA.AACG GGCGTTCTGT CCGAAACCGA CCGCAACCTG 960 

TATTATGTGG AGGGCGCGCC CACAATCGGA TTTGTCGGAG ACTCGGACAA TATGCAATGG 102 0 

DO AACATACAGG CCGGAGTAGG GATTTCTTCC CATTTCGGAG CCAAAGGGAG GTTGTTTTTC 1080 

TGGCCTAAAC TGGATGCTTC GCTTAGTATC TTCCCTTCAT GGCGTGTGTA TGCGAAAGCC 114 0 

TTCGGCGGTG TGATTCGAAA TGGTCTCGCC GATGTTATGC AAGAGGAGAT GCCCTACCTG 12^0 

ATGCCCAATA CGATTGTACT CCCTTCGCGC AATGCTTTGA CCGCCCAATT AGGGGTGAAG 12 60 

GGGAATATAG CCGATGTGGT ACGTATGGAG GTTTATGGCG ACTTCTCCAA GCTGACAGGT 13^0 

/U GTGCCTTTCT ATACTCCGAC TCTACCCTTA TATAATCCAT CCGACTTGTA TCAGTATA.i\T 138n 

GTGAGTTTCT TGCCGATATA TGCCGACGGC AGCCGCTGGC GCGCAGGTGG TAAGCTGGAA 14 40 

TACTCTTATC GCGATATGCT CCGCTTTCTG GTAGACGCAT CCTATGGCAA GTGGAATTTG 1500 

GATGGAGGAC TTGTCGCCTC CATGCAGCCC GATCTTATAT TGAAGGCAGA AGTAGGTGTT 1560 

CATCCCATTG CCCCATTGGA TGTCAGACTC CGGTATACAC AGCTGAACGG ACGGTATCGG 1620 

/O TATTCTTTCG GCTCGGCTGG CTCGGAAGCC TTGGGTATCG GTAATGTACA TCTTCTTAGT 3 680 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



53 /490 



GCGGATGTTT CATACAAGCT GAAAAAGAAC TTGAGCCTTT ATCTCAAAAT CGATAATATG 
CTGGCGGAAA CGACAGAGCT TATCGGTTAT TATCCTATGC AGCCGTTCCA TTGTTTCGCC 
GGTTTTAGCT GGACTTTC 



(2) IHFORHATIOM FOR SEQ ID MO: 7 7 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 1071 base pairs 

(B) TYPE: nucleic acid 

(C) STRAfJDEDMESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

iiv) AlITI- SENSE: HO 

(vi) ORIGIHAL SOURCE: 

(A) ORGAJJISH: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAJ IE/KEY: mi sc_f ea t ur e 
{3} LOCATION 1...1071 

SEQUEMCE DESCRIPTION: SEQ ID NO: 77 

AACTCCCATG CAACAGTCTC CGTTATATGT TCAATGATGG AAAAATGTAT CTTTGCTCAC 
TATCCACATA ACCTTGTGTT CATGATTCGC AAGCATTTCG GTATCATTTT GGGATTTCTT 
TCTCTTGTGT TTTCGGCAGG TGCTCAACAA GAGAAGCAGG TGTTTCATTT TCTGAACCTT 
CCGGCTACTG CACAGGCTTT GGCTGCCGGA GGCAAAGCTA TCACCATCGT AGACGACAAT 
CCCGGACTGG CTTTTGAGT^lA TCCGGCTCTG CTCGGATATG AATCCGGTGG CCGCGCCTTT 
CTTTCCTATT TATATTATAT GAGTGGTTCG CATATGGGCA ATGCCTGTTA TGCCTCGTCC 
GTCGGAGAGC GTGGCATGTG GGGTGTTGGC ATGCGTTTCC TGAACTACGG GTCTATGCAA 
GGATACGATC AGAATGCGAT TGCCACCGGC TCTTTTAGTG CTTCGGATAT AGCTGTACAA 
GGATTTTACA GCCATGAACT GAGCAACCAC TTCCGCGGTG GAGTCAGCCT AAAAGCATTG 
TATTCTTCTA TCGAGACGTA TAGTTCCTTT GGCCTTGGTG TGGATGTCGG TATCAGTTAT 
TACGACGATG ACAAAGGATA TTCCGCTTCC GCTCTGTTCA AGAACGTAGG GGCGCAACTG 
AAAGGCTATA ATGAAGAACG GGAACCGCTC GATTGGGATT TCCAGCTCGG CTTTTCCCGC 
AGTTTTATCA ATGCTCCGTT TCGCTTGCAC ATCACGTTGT TCAATCTGAA TCCGCACTAT 
TTCAAGCGTC TTGTACCACG CGATCTGTCC AAGATGCAAA AGTTCCTCCG ACACTTCTCG 
ATAGGAGCAG AATTTACTCC TTCCGAGAGG TTTTGGGTCG GGCTGGGATA TACGCCACAG 
ATTGCACAGG ATTTCGAGGT GGAAGGCGGC AACAAATGGG GAGGTCTTTC GGCCGGCGTC 
GGTTTCACTT CAGGTGTAGT ACGTGTAGGC GTATCTGCTG CCACCTATCA TCCTGCAGCT 
CTTTCGTTCA TGTGTTCGGT AGGTATCCGT TTGGACGATA AGAGCATCTT C 



(2) INFORMATION FOR SEQ ID HO: 78 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 
{C) STRAMDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGIHAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 ... 1011 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

CCTCAGCCCG TCGGCCTTAA AGAAATAACC ATTAAACCCA TGTGCCTCGA ACCCATAATT 

GCTCCGATTT CATCCGAGTT GCTCGAGCAG GAGCTGACTG CCGATCGTTT TCTGCGGATG 

ACAAACAAAG CCGGCAATGA GATCTATGTT TTTACGGCCG AAGAAGCTCC GCATTGCATG 

AAAGAAGTAG GCCGACTGCG AGAAGAAGCC TTTCGGCATT ATGGCGGAGG TACTGGCAA.G 
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GCGATCGATA TAGACGAGTT CGACACCATG 
GATCCGCAAA ACAAGGCTAT ACTCGGAGGC 
TTCGATACCG ATGGCAAGCC TTTGCTGGCA 
TTTTTGCACG ATTATCTCCC CTACACAGTC 
5 TACCAATCGA CACGGATGGG CACAAAGGCC 
ATCGGAGCAC TCACTGTAGT CAATCCAGAG 
TACAAAGACT ATGATCGGCG AGCTCGCAAT 
TCCGATCCGG AAGGCTTGGT CAAGCCTATT 
GACGAAGCCT TGTTCTCCTC ATCCGACTTT 
10 GTGCGCAAGC TGGGTATCAA TATCCCTCCT 
GAGATGCGTG TTTTCGGCAC TGCAGTGAAT 
ATATTCATTG CTGTGGGTAA GATCCTGGAA 
ATCCTCAGCC GGAACGAAAA AAAAGGTCTC 

15 

(2) IMFORMATIOH FOR SEQ ID NO: 7 9 



CCCGGGAGCT ACAAACAGCT GATCGTATGG 300 

TACCGCTTTA TCTATGGGGG GGACGTTGCT 360 

ACGGCAGAGA TGTTTCGCTT CAGTGATGCT ^20 

GAATTGGGAC GTTCGTTCGT GTCGCTCCAG 4 80 

ATTTTTGTGC TGGACAATCT TTGGGACGGT 5^0 

GCACTCTATT TCTATGGCAA GGTGACCATG 600 

CTGATCCTGT ATTTTCTTCG CAAGCACTTC 660 

CATCCCCTAC CGATAGAGAT CAGTGCGGAG 720 

GACACCAATT ACAAGACTCT CAATATAGAA 7 80 

CTCGTGAGTG CATATATAGC TTTGTCTCCG 84 0 

GAGTCTTTCG GAGAGGTGGA GGAAACCGGC 900 

GAGAAAAAAC AACGGCACAT AGAGAGCTTC 960 

GACAGTAGCA ATGGCCGATC A 1011 



(i) SKQUEMCE CHARACTERISTICS: 

(A) LENGTH: 1698 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 

(D) TOPOLOGY: circular 



25 



(ii ) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) AIITI-SEIISE: NO 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROi-IONAS GINGIVALIS 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
35 (B) LOCATION l...i698 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO:79 



TACGACGGAG CGAGACTGGT CTATACCCTA 

40 AAAACCATTG TAAGATACAG CCGCCTTCCG 
GTGCACTTGT CTGTCGAGGC GCAGATGCTC 
GTCCTGTCTC CCACCGAAAG GCAATACAGG 
GGGGCCGATC TTTTCCCGTT GAGCGATAAG 
TCGGCCTATG GCGATATTGC GGGCGACTAT 

45 CTCTCGCTCG AATCGGGTGG TCGCATCAGT 

GCTTCCTACT CACGTGGCAT GCACAAACGC 
GCCTACTATC CCTATTTGGT GTCCGATTCG 
CGGCTTGCCG GCTACTATTC TTTTCGCGCC 
TACAGGGGCG AAGTTGCTTA TCGGCTGACC 

50 TTGGAGCTTT CTTGTGCTAC CTCTTTGACG 
GCTGCGTATC TCTATCATAG ACAACACCTC 
GACAvAATTCT TCGTCAGCTA CGGTTTCGGT 
TTCGGTATCT CCAGAATGAA CTACGTCAAC 
CGTAGGGGCG ATGCCATCGG TCTCGACTAC 

55 TCGTCCATCA ATCTCTTTGC TTTGCTTTAC 
CTGTCGGACT TCGATTTTTC ATTTTCAGCC 
ATATACGAAG ACTACAAGCC GGATGATAAT 
ATTCGCCGCT GGTATATGCT CAATGAGTTT 
CGCACGGATA GAGGTTGTGC CCTGAGAGTG 

60 GAGACGTATC GCAAGCATGG ACATCATACC 
ATAGCCTATG ACCATGCCGG ATCCAAATTG 
CGAATGGTGG TGACGCATTC GTATAAGATT 
CAGCTGGCCT ATTTGCCCTA TGCCTATCGT 
CTGTACGTCT CGATTCCGAT GCAGAATACC 

65 GGCGACCTGA TGAAAAGAAA GGACGGTATA 

CATATCCTGT CCGATCCGCA AGCCGAACGA 
AATATCTCCT ACCTCTTC 



TTTCGGAATA GAAACGACAT TCACCCTATG 60 

GTCGCTCTCT TCTTTTGCCT TTTGGGAGCT 120 

AATACTCCTT TCGAGCTGTC GGATCAGATC 180 

GAGATTTGTG TGCAAACGAA AGAAAAAAGG 2 4 0 

CTGCGCGATT CGGCCTATGT TCGTTTCGGC 300 

CTTCCGTACA ACGGCAATAA CTACTCCTCG 360 

GTCCGTAACT ATGGCACATT GCAGGGCAGT 4 20 

ATCGGCTGGA ATGCTCTGCG CAACGCCGAA 4 80 

ACCGGCGGAG ACTATCATTT CGAAGACTAT 54 0 

GGCCGCTTGC CCCTCGGTAT AGGCTTCTCA 600 

GATCCGCGTA CGACCAATAC GACCGGTGCA 660 

CTGCCTCGAG AGAACAGGCT ATCGCTTTCG 7 20 

ACACAGTACA ACTGGCGTCC CGGGCAGCAG 780 

CAGGTGGATG TCAGCAACAG CCCTATCTGG 840 

GGATGGAAGC TTAGCTCCCG TCTGGATACC 900 

AGCGGCTACT TCCTCGATAC CGAaGAGAGG 960 

AATCGCCTGC GACTCTATGG TAGCTGGCAT 1020 

GACTATGCTC TGCGCCAAGG GATAGAGCGG 108 0 

TATCATATCT ACGACCTCCG TATCTTGGCC 114 0 

TCTGCCCAAG CCCAAGCCTC CTACCGTATT 12 00 

AGTGCCGGTA GTGATTTCTA CGGCTATGAT 12 60 

ATGAGCGGAA TGCTACGTCC TTTTGCCGGT 132 0 

GATTTTGGAC TTTCGCTTTC GGCTGCTTAT 138 0 

CGTACCATCC AGAAAGAGCA GCTCGACTAT 14 40 

AATAGAGAAG GCGTGGAGGT GCGTTCCTCT 1500 

CACCGCCTGA TGACAGAGCT GCGGTTGTAT 1560 

GCCTATGGCA AAACGCCCGG TGTCATCTCA 16 20 

ACGTCCGGCC ATACCATCGG GGCTATCTGC 1680 

1698 



70 (2} INFORMATION FOR SEQ ID NO: 80 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2457 base pairs 

(B) TYPE: nucleic acid 
75 to STRAIIDEDNESS: double 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(D) TOPOLOGY: circular 
(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AHTI-SEIISE: (JO 

(vi) ORIGII-IAL SOURCE: 

(A) ORGAIJISII: PORYPHYROMOIIAS GIHGIVALIS 

(ix) FEATURE: 

(A) HA1-1E/KEY: misc_feature 

(B) LOCATION 1...2 4 57 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 

AGGACGAACG TTTTCTTATC TTTGTCCCAT AAAATTGGGA GAAGGGGTGC TTCCTGTAGC 60 

AATCGGAATG CGTGGATGGC TGAGAACAAA CCCTCATCAC CTGAACCGGA TAATACCGGC 120 

GTAGGAAACT CTCCGTCTGA CTATCTTCAC GGCGAAGCAA TCATTCCCCC TCTCTCTTCT 180 

TTGTCCAACT TCAATGATAA GAGATTTATG AAAAAACTTC ACATGATTGC CGCCTTAGCC 2^0 

GTCCTGCCTT TCTGCCTGAC GGCACAAGCA CCCGTCTCCA ACAGCGAGAT AGATAGTCTT 300 

AGCAATGTGC AGCTCCAGAC CGTACAGGTC GTAGCTACTC GCGCCACGGC GAAAACCCCT 360 

GTCGCTTACA CCAACGTTCG CAAGGCCGAA CTTTCCAA.GT CCAATTATGG TCGTGACATC 420 

CCCTATCTGC TGATGCTGAC TCCCTCCGTG GTAGCCACCA GCGATGCCGG TACGGGTATC 480 

GGATATTCCG GCTTTCGCGT GCGTGGCACC GATGCCAATC GCATCAACAT AACTACCAAT 54 0 

GG AGTACCCC TCAACGACTC CGAATCTCAG TCCGTCTTTT GGGTGAATAT GCCCGACTTC 600 

GCCTCTTCCA TCGAAGACCT TCAGGTGCAG CGAGGTGTGG GTACTTCCAC CAATGGTGCC 660 

GGAGCTTTTG GGGCAAGTGT CAATATGCGT ACGGATAATT TGGGACTGGC TCCTTATGGC 720 

CGTGTCGATT TGAGCGGAGG TTCGTTCGGC ACATTCCGCC GATCGGTCAA ACTCGGTAGC 780 

GGACGCATCG GTCGCCATTG GGCAGTGGAT GCCCGCCTGT CCAAAATCGG TTCGGACGGC 84 0 

TACGTGGATA GAGGAAGCGT GGATCTGAAA TCCTATTTCG CACAGGTGGG CTATTTCGGT 900 

AGCAACACGG CTCTCAGGTT CATCACTTTC GGAGGAAAAG AAGTTACGGG TATCGCATGG 960 

/U\CGGTCTTT CCAAGGAGGA TGAAGCCAAA TATGGCCGCC GATACAACAG TGCCGGTCTT 1020 

ATGTACGTGG ACGCGCAAGG AGTACCGCAC TACTACCACA ATACCGACAA TTACGAGCAG 1080 

CGTCACTACC ATGCCATCAT GACGCACAGC TTCTCTCCTT CCGTTATCCT CAACCTCACG 114 0 

GCACACTACA CGGCCGGATA TGGCTATACG GACGAATATC GTACCGGACG TAAACTAAAG 1200 

GAATATGCAC TGCAGCCCTA TGTGGAAAAC AGTGTGACCG TGAAGAAAAC GGATCTCATC 1260 

CGTCAGAAGT ATCTGGACAA TGACTTCGGA GGACTCATCG GTTCGCTTAA CTGGCACACC 1320 

GGTGCATGGG ATTTGCAGTT CGGGGCCTCG GGCAATATCT ATAAAGGAGA CCACTTCGGC 1380 

CGTATCACTT ACATCAAAAA GTACAATCAG CCCTTAGCTC CCGACTTCGA ATATTATCGG 14 4 0 

AACAGGGCAG ACAAAAGAGA AGGTGCAGCC TTTGCCAAAG CCAACTGGCA GATCACTCCG 1500 

GAACTGAACA TGTATGCCGA CCTCCAGTAT CGTACCATCG GCTACACGAT AAACGGCATC 1560 

ACGGACGAAT ATGATGAGGT ACAGGGAAGT ATGCAGCACA TCGATTTGGA CAAGACCTTC 1620 

CGCTTCCTCA ATCCGAAGGC CGGTCTTACC TATAGTTTCG ACGATGCTCA TACTGCCTAT 168 0 

GCTTCTGTTG CGGTAGCACA CCGCGAGCCT AACAGAACCA ATTACACCGA AGCCGGAATA 174 0 

GGACAGTATC CTACGCCTGA GCGACTGATC GACTATGAGC TGGGCTACCG CTATGCTTCG 1800 

CCCCTCTTGT CGGCCGGAGT AGGTCTCTAT TATATGCAAT ACAAGGACCA ACTCGTGCTG 1860 

GATGGCCGTT TGAGCGATGT GGGACAGATG CTCACAAGCA ACGTCCCCGA CAGCTACCGT 1920 

ATGGGACTGG AGCTGACTCT CGGTTGGCAG ATCCTTCCTC GTTTGCTGCG TTGGGATGCT 1980 

TCTTTCACTA TGAGTCGCAA CAAAATCGAC CGCTACGTAC AATATACATC CGTATATGAT 204 0 

GCGGACTACA ACTGGCTCGA ACTCAAGGAG GAGACCCTCG AAAGCACGGA TATAGCCTAC 2100 

TCGCCCAATG TCATTGCCGG CAGCATGCTT ACCCTCTCTC ATGCCGGTTT CGAAATGGCT 2160 

TGGACGAGCC GCTTCGTCAG CAAGCAATAT CTGGACAATA CACAGCGCAG CGATCGCATG 2220 

CTTTCCTCCT ATTGGGTGAA CGACCTCCGC CTCGGCTATG TGCTGCCGGT TCACTTCGTT 2280 

AAGAGAGTGG CACTGGGCGT ACAGCTCAAT AATCTCTTCA ACCTCATGTA TGCGTCCAAT 2 34 0 

GCCTACATCT ACGATGCCGG TTACGTACAG GCATCCGGAG AACTAAGTGC ATATGCCGAT 2 4 00 

CTGCGTTATT ATCCTCAGGC CGGATTTAAT GCACTGGGTA GTCTGACAAT CGATTTC 2 4 57 

( 2 ) IN FORI IATI ON FOR SEQ I D IIO : 8 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1596 base pairs 

(B) TYPE: nucleic acid 

(C) 3TRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) IIOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: HO 
(vi) ORIGINAL SOURCE: 
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{A) ORGAIslISM: PORYPHYROMONAS GIMGIVALIS 

iix) FEATURE: 

(A) HAI^IE/KEY: misc_feature 
5 (B) LOCATION 1...15 96 

(xi} SEQUENCE DESCRIPTION: SEQ ID 110:61 



TACAGGCGAT CGGGTVAGGGT TTGTCCTCGT 

10 ACTCGGTTAC AGCCGAGAAG AGCGGTTCGA 
AGAAGGTTTC TATCGCTGTT ACTGCTGTAC 
CGGTTTCCGA TGGTGCAGGG AATCGAGTTG 
CGTCCTTGGC GCGCCATCGG TAAAACGATA 
CATTTCATCA TGAACGAGGA CTTTGCAGAC 

15 CAAACAGGCT TTGGCTGGGA CAATGACAAG 

CACGGATCGC TCTATTTCAA TGCAGCGAGG 
CCGTTTGCCT TCTTTGGCAG TCTCATGTGG 
ATCAACGACC TCTGTGCCAC CACCATAGGC 
CTGTCGGACC TGCTCATCGA CAATCGTACC 

20 GCTATCGCTC TGATCAATCC GATGCGCTTT 

TCTGTCGGGA GTCGCAGCGG ACAGATATTT 
GCCGGCTTTC GCTTTTTGGC AGACAAGCGG 
CTGAATCTGA GATTCGACTA CGGCGATCCA 
TTCTTCCAAT TCAAAGCCGG ATTGAGTTTC 

25 AATCTGATCG GAATCCTAAG CGGATGCCAA 

GGAGGTCTCT TTCAGCACTT CGACTACTAC 
GAGGAGGTAC TCGTCACCCC ATACCGTATC 
ATCTTCCAGC ACCACGGAAA ATTTCGACGA 
CTGAATGTCG TCCCGATGGG AGCCAGTCTG 

30 TATAACCTCG GCAGCGGATT GAGCGGCAAG 

TGGAGCTGGC TCTTGGGAGT CGAAAGCTAT 
CCGCACCAGA AAAATACCGA TGTCAGCTCT 
CGCCTACTGG TGACGAGTTC CGAGTTCGCA 
GTCGCTCGCC GTTTCATCCG CAAAACAGCC 

35 ACCGGCGACA TACAGCTGCG TGTCGGATTT 



ATCCTCCGGA ACAAACGTTC CTACGCAGAT 60 

CTACGAGCCG GCACCAGAAC AAAGATGAAA 120 

ATACTCTCTT CCATCAGCCT TTCTGCTCAG 180 

GATACCGATT CGCTTTTCTC TCTGCCCAAG 24 0 

GGCGTCAATC TGGCCGTATG GGGCTTCGAT 300 

ATCAGTTGGC AGACTATCAA GAGCAATTTC 360 

TTTGTCACCA ACCTCTTCGC ACATCCTTAT A 20 

TCGAACGGTT TGAGCTTCAG GCACTCTGCT 4 80 

GAGCTGCTTA TGGAAAACGA GCCACCGAGT 54 0 

GGTATAGCTT TGGGGGAGAT GGGGCACAGG 600 

ACAGGGTGGG AACGTATGGG GCGCGAGGTG 660 

CTCAACCGTC TGACAGCAGG AGAGGTGACT 720 

CAGTCTGTCC CCATAAACAT AGTCGTCGAT 780 

CATGCCCGAA CCGGTGCCAC GGCTCTGACC 84 0 

TTCCGAAGCG AGACTTTCTC TCCATACGAT 900 

TCCGAATCGC AACCTCTGCT GAGCCAGATC 960 

CTGCTCGCAC ACGAACGAAC GGTTTTGGTG 102 0 

AATTCGGAAA AACGAATAAG CAAAAATTCG 108 0 

TCGCAAGTGG CAGCTCTGGG AGGCGGTCTT 114 0 

CGTCCTCTGG AGCTATATGC CGAGACCTAC 12 00 

TCGGATCACT ACAACGTGGA CAATCGGGAC 1260 

CTATACCTTG GTGCTACGTA CAATGATCTG 1320 

CGGCTCTACA CATGGATCGG GTATGAAGAG 138 0 

TTTATGGTGC AGGGGGACGA AAGCAAGGCG 14 4 0 

TTTCATCCTG GCCCCTGGCA TGTAGCCATC 1500 

TATCAATTCT ACCCTAACGT ATCATTCGAT 156 0 

CACTTC 15 96 



(2) I M FORMAT lOM FOR SEQ ID MO: 82 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 

(D) TOPOLOGY: circular 

45 

(ii) MOLECULE TYPE: DNA {genomic) 
(iii) HYPOTHETICAL: NO 
50 (iv) AI-ITI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

55 (ix) FEATURE: 

(A} NAI'IE/KEY: misc_feature 
(B} LOCATION 1 ... 900 



(xij SEQUENCE DESCRIPTION: S! 

60 

AAAAGAAGAA AAAAACAGAT GAAACGACTG 
TTCACCCTTG CGAACGCACA AGAAGCAAAC 
ACTATAAAAG GTGTGACCGG ACTAAATGCC 
GGTGGAGAAA ACACGGTGGC AGGTAACCTC 

65 GATAAATGGA GTTGGGACAA CGGTTTGCGT 

AACAAGTGGA ACAAAAGTGT AGACAAGATC 
GGCAAACATT GGTACGGAAG TGCGCTTTTC 
GAGAAGCCCT CGGATCACTT GACAGGAGTC 
TATCTCACTC TCGGTATTGG TGCGGACTAT 

70 TCTCCTACAA CGGGCAAGCT GACTGTAGTA 
TTCGGGGTGA AAGTTGGTGA AAAGACAATG 
GCCAATATAA ATCTGATGGA GAATGTCAAT 
TATACGCACG ACTTTGGCAA CATTGACATC 
AACAAGTTCC TCACGGCTAC GATAGCCACC 

75 AACGATGGCC CGAAAATCCA GTTCAAAGAA 



i;Q ID NO: 8 2 

ATTGTTTTTC TGGCAATGGG TGGCTTGCTG 60 

ACTGCATCTG ACACTCCCAA AAAGGACTGG 12 0 

TCTCAGACTT CTCTGACCAA CTGGGCTGCC 180 

TATTTGAACA TAGATGCCT^ii CTACCTGAAA 24 0 

ACAGACTTCG GTCTGACCTA CACAACAGCC 300 

GAACTCTTCA CGAAGGCCGG CTATGAGATC 360 

ACTTTCCTCT CACAGTATGC CAAAGGATAT 4 20 

AAGCATATCT CTAATTTCTT CGCTCCTGCA 4 80 

AAGCCCAATG AGAAGTTCTC TCTCTACCTC 54 0 

GCAGACGACT ACCTCTCAAG TTTGGGAGCC 600 

TTCGAACTTG GTGCTTTGGT AGTGGGTTCG 660 

TTGATAACCA AGGCTTCATT CTTCTCGGCT 720 

AATTGGGAGG CTATGCTGGC CATGAAGATC 7 80 

AATCTTATCT ACGACGATGA TGTGAAGATC 84 0 

GTTGTGGGCG TGGGTGTTGC GTACACTTTC 900 
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(2) IIIFORMATIOH FOR SEQ ID NO: 8 3 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DIJA (genomic) 
(ili) HYPOTHETICAL: HO 
(iv) AIITI-SEMSE: NO 



20 



25 



30 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOHAS GIHGIVALIS 

(ix) FEATURE: 

(A) HAi-IE/KEY: misc_feature 

(B) LOCATION 1. . -663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 

ACGAGAGAGA GTGTGTTACA TTGTAGAACA AAACTCAAAA AAGTKACGAAA AATGAAGAAA 60 

ATGATTTTGG CAGCTACTAT GCTGCTCGCA ACAATCGGTT TTGCAAATGC TCAGAGTCGT 120 

CCTGCTCTTA GACTGGATGC TAACTTTGTC GGTAGTAACT TAATGCAAAA AGTCGCAAAC 18 0 

ACGAGCGTGA ACAATAAGAT GATCGTAGGC TTACGTGTTG GTGCTGCTGC TGAGTTCGCT 2 4 0 

CTTAGCAATG ATGGATTCTA TCTCGCCCCC GGATTGGCCT ATACGATGAG AGGTGCTAAG 300 

ATGGAATCAC TAAGTGAAAC GACAACTCGC TTGCATTATC TGCAAATACC GGTGAATGCC 360 

GGTATGAGAT TTAGCTTTGC TGACAACATG GCTATTTCAT TGGAAGCAGG TCCCTATTTC 4 20 

GCATATGGTG TCGCCGGAAC GATTAAGACT AAAGTTGCAG GCGTTACGGC TTCTGTAGAT 4 80 

GCCTTTGGTG ATAACGGATA TAACCGTTTC GACTTGGGCT TGGGCTTGTC TGCTGCCTTG 54 0 

AGCTACGACC GTTATTACGT ACAAATTGGA TATGAGCATG GATTGCTTAA TATGTTGAAG 600 

GATGCTCCGG ATAAGACTTC TTTGCGTAAT CATGACTTCT TTGTGGGTCT CGGTGTTCGC 660 

TTC 663 



40 (2) INFORMATION FOR SEQ ID HO: 84 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 4 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
50 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



55 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOHAS GINGIVALIS 



60 



65 



70 



75 



(ix) FEATURE: 

(A) NAl^lE/KEY: misc_f eatur e 

(B) LOCATION 1 . . .74 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 84 



ATCAAACGAA 
ACTATGGCCA 
AATCAGAGCA 
GTCGGTGCCG 
AACTATACGA 
TATATTACGA 
CGGTTCGACC 
GGTATATATG 
GAGTTTTTTG 
ATAATAGCCG 
GTGGATATTG 
TCTTCGACGA 
TTCGTGGGCA 



TAGAAATGAA 
TCGGACAAAG 
TGAAAAGAGA 
CTGCCGAATT 
TGAAGGGCTC 
TGGTTTCCAC 
TGATGAATGA 
GTACATATCG 
GCCCAACGCT 
CAT TCC ACTA 
TGTCAGGTGG 
CCGCTCTAAG 
TAGGTTACCG 



AAGGATTTTT 
CCGCCCGGCA 
CGGATATGTG 
CATGATCGGA 
CAAAACCGAA 
TCGCTTGCAC 
CATGGCGGTT 
GCAGAAGTTG 
TGGTGGCCCA 
TAAGCGTTAT 
AGGTTCTGAT 
AGAAAAGGGA 
CTTT 



ACTGTAGCCC 
CTTCGCGTAG 
TGGGACACCA 
TCAAGAGGAT 
TGGGATATAC 
TATCTGCAAC 
TCGATCGAAG 
GAAGGATGGA 
ACAAATATCC 
TATATACAGA 
ATTCCCCGAC 
AATAACGAAT 



TTGTGCTACT 
ATGCCAACTT 
AAATGAATGT 
TCTACTTGGC 
CCGAAATGGT 
TGCCGATCAA 
CGGGTCCTTT 
AGCCGAACAA 
GCTGGGACAT 
TAGGCTATGA 
TGAACGACAA 
ACGCTTAT/^ 



TGCTTCGGTC 
CGTAGGCAGC 
CGGCCTGCGG 
TCCGGGTCTG 
TCCTGGTACC 
TGCCGGCATG 
CCTTGCATAC 
CTACAGCACA 
CGGGGCAAAC 
ACATGGATTT 
TAGGCAATCC 
TCGTGACTTC 



60 
120 
180 
240 
300 
360 
420 
480 
SAO 
600 
660 
720 
744 
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(2) INFORMATION FOR SEQ ID MO: 85 

5 (i) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDHESS: double 

(D) TOPOLOGY: circular 

10 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: MO 
15 (iv) AHTI-SEtJSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGAMISll: PORY PHYROHOI iAS GIMGIVALIS 

20 (ix) FEATURE: 

(A} MAllE/KEY: misc feature 

(B) LOCATION 1...633 



25 



40 



50 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:85 



AA7y\GAAAGA GTATGAAAAG AATGCTGCTG CTTCTCGTTG TATTATTATA TGGAATTGCA 6n 

GGCCGATTGG CTGCACAAGA CGTTATCAGA CCATGGTCAT TGCAGGTCGG AGCGGGATAC 12 0 

TCCGATACGG AGAACATCCC GGGAGGATTC ACCTATGGTT TCTATTTGGG AAAGCGTATG 18^ 

GGGAt^CTTTC TGGAAGTGGG GCTGTCCATG TACAACTCCA CACGTCAAAC AGCCAACAAT 2A0 

6U GCAGACTCCT TTGCATCGAA CGAAGGAGAC GGATCTTTTC AGGTAAATAT GTCTTCTCCG Sr.n 

AATGAGAAGT GGTCATTCTT CGATGCAGGC AGTGCCAACT GCTATATGAT CGTCGTCGGA 360 

GTCAATCCTC TCCATCTGTT TTGGCAGAAT AGCCGGCACA ATTTGTTTCT GGCAGTACAA 420 

GCCGGCCTGT CCAATAAGCA CAATATTCAT TTCATCTATG GAGACAAGGG AGCCAAAGTC 4 80 

AGTATCTACA CCAATTCGAA TACCTACATC GGTTACGGAG CACGTGTAGC CTACGAATAT 54 0 

CAAATTCATA AAAACGTGGG GGCGGGTGCC GCTGTAATGT ACGACCACGG CAATAAGATG 



600 



CTTACGGCCA TGGCCACGCT CTCCACTCAT TTT 63 3 

(2) IN FORMAT I OH FOR SEQ ID WO: 86 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2859 base pair^ 

(B) TYPE: nucleic acid 

(C) STRAMDEDHESS: double 
45 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: MO 

(iv) ANTI -SENSE: NO 

( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: PORY PHYROMONAS GIMGIVALIS 

(ix) FEATURE: 

(A) NAI4E/KEY: mi£c_feature 

(B) LOCATIOIJ 1...2859 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 



ATCCGAATGA GAGTATCCGA TCTCTGTTCC AGACTTTCAT GGTTATTACC CGTAATCCTT 60 

GTCGGATTGC TCTGTGCTAC TTTGGTCGCT GCGGAACGTC CTATGGCCGG AGCAGTCGGA 120 

TTGCACCACC GTCGGCATGC TGCGCTGTCT GATTCTACAG CGAAAGACAC GGTGCCTCTC 18^ 

DD GCAAAACCTA TTCCTGACAG TGCTTTTCGA GATTCCCTTC CTGCCGATTC CACCGGATCG 24 0 

ATGCGGCAAG ATAGCGTGTA TGACGATGAA TTCGAATTGG AAGATATAGT GGAGTACGAA 300 

GCTGCCGATT CCATCGTTTT GCTCGGACAG AATCGTGCCT ATCTTTTCGG CAAGAGCTAT 360 

GTGAGCTATC AAAAGAGTCG CTTGGAGGCA AACTTCATGT ATCTCAATAC CGACAGCAGT 4 20 

ACGGTTTATA CTCGCTATGT CCTCGATACG GCCGGTTATC CGATGGCCTT TCCTGTTTTC 4 80 

/U AAGGATGGAG AGCAGTCGTT CGAAGCCAAG AACTTTACCT ACAACTTCCG CACGGAGAAG 54 0 

GGGATTATCA GCGGAGTGAT CACGCAGCAG GGCGAAGGCT ATCTGACTGC CGGTAAGACC 6r.n 

AAGAAGATGC CCGACAATAT CATGTTTATG CAAGGAGGGC GTTATACGAC CTGCGACAAT 660 

CACGATCATC CTCACTTCTA TATCAATCTT TCCAAGGCAA AGGTGCATCC GGAGAAAGAC 7:>0 

ATCGTCACAG GTCCGGTCAA TCTGGTTATC GCCGATATGC CGCTGCCGAT AGGTCTTC^T 7 8n 

/t> TTCGGCTATT TTCCCTTTTC CAACAAATAC TCTTCCGGTA TATTGATGCC CACGTACGGA 8 4r> 
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GAGGACAATC GCTATGGATT TTATTTGAGG 
TATATCGATT TGGCATTGCG TGGGGAGATC 
CAATCGAAAT ATAAGAAGAG GTATAAGTAC 
TCGAAGTCCG GCGACAAATA CGTGCCCGGA 
5 CGATGGACAC ACAGTCAGGA TCCGAAGGCC 
AATTTTGCCA CCGGGAGCTA TTTCCAGAAT 
CGTACTGCTA CGACACGAAG TTCGGCCGTG 
TTTTCGATTA CGGGTAGCAT GGATATCAGC 
ACCTTGCCGA ATCTTTCGAT TAATATGTCC 

10 GTAGGACCGG AGCGATGGTA CGAGAAGTTG 
AGTATCTTGA CAAAAGAGAA AGATTTGCTC 
GGTATGCGTC ATTCCGTACC GATCAGTTTG 
ACTATGGGGG TTAACTACAA TGAGTGGTGG 
GAGGATAAGA AAACATTCCT GCCTTCGGAC 

15 TACAGTCTGT CGGCAGGCTT ATCTACCACA 
TTTTCCTTCG GAGGCAATCT CATTATGATC 
TCCTATATGC CGGACTTCAC GAAACGCCGA 
GATCAGAACG GCAAGCTGCA TACGCTGCTC 
GCTCCCTCCA TGGGCAATGC AGGATCTGTC 

20 AAGATCAAAT CCAAATCGGA TTCGACAGGG 
ACATGGTCTA CATCCTATAA TATGTTTGCC 
TCGCTGGCAC TTCGCCTCTC CAAGAGCTTT 
TATTTGACGA AGTATTATGA GGGAGAAGAT 
CTGCGCATTT TTAACGGCAA GGGATTGGCA 

25 TATACGCTCA ACAAAGAGTC GCTCAGCGGA 

CGGAGAGATG AAAAGAAAAA CACAGGGGCT 
ATACTTGAGG GAGGAAGACC GCAAAATGAA 
CAGGGCGGAG CAGTGGATCA GGATGGTTAC 
TTCGACTATA GTTGGAATAT TGCTACCGAC 

30 CACTACTACC GGGTAACGCA GAATCTGAGC 
TGGAGCTTCG GATTCAATGC GAACTACAAT 
TGCAACGTCA CTCGCGACAT GCACTGCTGG 
GCATACAAGT CCTATAATTT CGTCATATCG 
TATCAGCAGA GCAATCGTCC CATCACGAAT 

35 



AATGGTGGAT ATTATTTTGC CTTCAGCGAC 900 

TTTTCCAAAG GGTCATGGGG CATTTCAGCC 960 

AACGGCTCGT TCGAAGCCAA TTATCTGGTA 1020 

GACTACAGCA AGACCACCAG TCTGAATATC 1080 

AATCCTTTGC AAACGTTGTC GGCCAATGTC 114 0 

TCGCTGAATA CCACCTATGA TGTCAATGCC 1200 

AGCTATTCGC GCAAGTTTCC GGGTACTCCT 1260 

CAGAACATGC GCGATACGAC GGTGAGCCTT 132 0 

ACGCGTTATC CTTTCAAGCG GAAGACCCGT 138 0 

AGTGTGGGCT ATTCCGGTCA GCTTCGCAAT 144 0 

CAGAGCAATC TCGTGCGCGA TTGGAAGAAT 1500 

ACTGTCCCTT TGTTGGATTA TATCAATCTG 1560 

TACACGAAAG GCATACGGAA GTCGTGGAAT 1620 

ACGACCTATA AATTCCGCAG ACTGTACGAT 1680 

TTGTACGGTA TGTTCAAGCC TTGGAAACCT 17 4 0 

CGTCATCGCT TCACGCCCAC TGTCAGTTTC 1800 

TATGGCTTTT GGGAGCTTCT TGAGCATACG 18 60 

TACTCTCCTT ATTTCGAGCA GATATTCGGT 1920 

AATTTCTCTT TTGACAACAA CTTAGAGGCC 1980 

ATCAAGAAGA TCAGCCTGAT AGATCAGTTC 204 0 

GATTCGATCC GATGGAGCAA TATCTCGGCT 2100 

ACCTTGCGCT TGTCCGGTCT GTTCGATCCC 2160 

GGGAAGATCA TTCCCTATAA GAGCAACGAC 2220 

CGCCTGATCA GTACGGGTAC TTCTTTCAGC 22 80 

TTGATAGCTC TTTTCAGTGG CAAAAAGGAG 2 34 0 

ACTCCTCATG AAGGAGACGA TGCTGCCGAT 24 00 

AGTGGGGGGT CGCTCCTCGA GCGCAACCGT 24 60 

TTCGCATATT CGATCCCATG GAGCCTGTCC 2520 

TACAATAGGT AC7\ATGTCAA TAAGATGGAG 2 580 

TTTCGCGGCA ATATCCAGCC TACACCGAAC 2 64 0 

TTCGACTTGA AGAAAATAAC ATCGCTTACC 27 00 

GCTATCTCGG CCAGTTTCAT CCCTATAGGA 27 60 

GTGAAGAGTT CACTCTTGCA GGATCTGAAG 2 8 20 

ACTTGGTAT 2859 



(2) INFORMATIOII FOR SEQ ID WO: 87 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 3753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

45 (ii) MOLECULE TYPE: DWA (genomic) 



(iii ) HYPOTHETICAL: WO 



(iv) ANTI-SENSE: NO 

50 



(vi) ORIGIIfAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



(ix) FEATURE: 
55 (A) NAIIE/KEY: mi£C_f eature 

(B) LOCATION 1 ... 3753 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 



60 



65 



70 



75 



AAATGGAAAT 
AACGAGGTAA 
TGCACGGTAT 
AAGCCTTGGA 
TTAATTCAAG 
CCTGTGAACG 
GAAAATCTAC 
GTTGCCAATC 
ACTAAAGCGG 
GGAGGGCAAA 
AAAATGTTGC 
CATGAGCCTC 
CGGGCTTTGA 
ATTATGGCCA 
ACGGAGCAGT 
GCGAATGATC 



TGGCATGCGC 
TAATAATGAT 
TTACCTTTCA 
TTCAGAAACA 
CTGAAATTGT 
AAATAAGGCC 
GGGCATCTCC 
CAATGGATCC 
TAACAGTACC 
ACATTGCCAG 
CTAAGGGGAC 
ATCGCTTATC 
CTCCTATTTG 
CTTCTACTGC 
ATCCTGTTAA 
CCATTGATTT 



TTTTGATTGC 
GAAACGATAT 
AATAAAAGCT 
TTCAATGGAT 
ATACCAATCT 
TGCAAATCGT 
CCCCGTAGTT 
TGAAAATCCC 
TGTCGATGTG 
ATTAAAGTCT 
GGCTACAGAA 
TGATTTTACC 
GGGAACACAT 
TGTGGATAAG 
AAATGTAACT 
GGTTATACAA 



GCGTGTTGTT 
ACAATAATTC 
CGCCCTTATG 
TCTAAATTGG 
GTTTCTGAAC 
TTCCCTTCGC 
CCCGTTGCCG 
AATGCCTGGG 
GTGATGGTTA 
GCCATTGCAT 
GGGGTGCGTA 
AAAGACACTG 
ACCCAGGGGG 
CATATCATAT 
ACTGCAGACT 
GGAGCAATTA 



TCGATCCATT 
TTGCAGTTTT 
AAAGATTTGC 
TGCCTGCAAA 
ATAGTGACTT 
ATAGGAAGTC 
TCGACAAGTA 
ATGTGACGCT 
TCGACCAGTC 
CGGGACAGCG 
TCGCTCTTGT 
CTTTTCTCTG 
GGCTTAAAAT 
TGATGTCTGA 
TCATTGGCAA 
ATTTCCCTAC 



TGTAGTAACT 
TCTTTTATTC 
AGATGTAGAG 
TAAGGGTAAC 
AGTTATTTCA 
TTTTTTTGCA 
TGCGGTACCG 
AAAAATCACT 
TTCGTCAATG 
TTTTGTGAAA 
GAGTTATGAC 
TCAAAAAATC 
GGCGAGAAAC 
CGGGTTAGCG 
AACTGGAAAT 
AAATTATGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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TCCAACAATC CATCTACACC TCTTACCCCA AATTATCCAA CTCATTCTTC TAAAGTTGGA 1020 

CGGAGAAATC TGCCGGAATC CAAATTCGAT TATAGTAATC TGAGTGCAAG GATTACTTTT 108 0 

GATGGTGTTG CTGGCGCATT GGTCTATGAA CCGAGGTTTC CTCATCCCTA TTATTATTAT 114 0 

TTCCCTTGTA ACGCTGCTAT CAATGAGGCT CAGTTTGCGA AAAACTCTGG TTATACAATC 12 00 

5 CATACTATTG GCTATGACCT GGGAGATTTT GCCTTGGCCA ACAATTCGTT GAAACTMCC 1260 

GCTACAGACG AGAATCACTT CTTTACGGCG ACACCGGCCA ATTTAGCTGC AGCGTTTGAT 1320 

AATATTGCCC 7W\CTATTAA TATAGGTATA CAGAGGGGGG AGGTGACGGA CTTTGTAGCT 138 0 

CCTGGTTTCA TCGTTAAAAA TCTGACGCAA TCGGGAGATG TTACTCATTT GCTAAATGTT 14 4 0 

TCAAATGGAA CGGTGCACTA TGATGTCTCT ACTAATWKAC TGACATGGAC TACTGGTACT 1500 

10 ATCCTGAGCT CATCAGAAGC TACCATAACT TATCGTATTT ATGCCGATTT GGATTATATA 1560 

CAGAACAATG ATATTCCGGT AAATACTACT TCTGCTATCG GCCCGGATCT TGGTGGATTC 162 0 

GATACCAATA CCGAGGCAAA ATTGACCTAT ACCAATTCCA ATGGCGAACC GAATCAGCAG 1680 

TTAATTTTCC CACGTCCGAC GGTTAAGTTA GGTTATGGTG TTATTAAGCG GCACTATGTA 174 0 

TTGGTAAATA AAGACGGTCA ACCCATACAG GCAAATGGAA CAGTTGTCAG TTCCCTAAGC 1800 

15 GAGGCTCATG TTCTACAGTC ACAAGATTTC TTTTTGCCCT CAGGTGGAGG TCATATTGTT 18 60 

CCCAAATGGA TAAAGTTGGA CAAAACGACC GAAGCATTAC AGTACTATTC CGTACCGCCG 1920 

ACTAACACGG TCATCACTAC TGCCGATGGT AAACGTTATC GTTTTGTCGA AGTCCCAGGC 1980 

TCCACGCCGA ATCCGGGCCA AATCGGTATC AGTTGGAAAA AACCGGCAGG AAACGCTTAC 204 0 

TTCGCTTACA AGCTCCTCAA TTATTGGATG GGAGGAACAA CAGACCAACA GAGTGAATGG 2100 

20 GATGTGACGT CCAATTGGAC AGGAGCCCAA GTACCGCTCA CAGGAGAAGA TGTAGAGTTT 2160 

GCAACGACAG AAAATTTCGG TTCTCCGGCG GTAGCCGATT TGCATGTCCC GACAACCAAC 2220 

CCCAAAATTA TCGGTAACCT TATCAATAAT TCCGACAAGG ATTTAGTTGT TACCACAAGC 22 8 0 

AGTCAATTGA CGATCAACGG CGTGGTTGAG GATAACAATC CGAATGTCGG TACGATCGTC 2 34 0 

GTGAAGTCGT CGAAAGACAA TCCTACGGGG ACATTGCTTT TTGCCAATCC GGGCTATAAT 24 00 

25 CAAAATGTAG GGGGGACCGT CGAGTTTTAC AATCAGGGAT ATGATTGTGC CGATTGTGGT 2 4 60 

ATGTATCGCA GGAGCTGGCA GTATTTCGGT ATCCCTGTCA ATGAATCAGG TTTTCCAATT 2520 

AATGATGTGG GCGGAAACGA GACCGTCAAC CAATGGGTTG AGCCTTTCAA TGGCGATAAG 258 0 

TGGCGGCCAG CACCTTATGC ACCTGATACA GAGCTTCAAA AATTCAAGGG CTACCAGATC 2 64 0 

ACGAATGACG TGCAGGCACA GCCTACGGGA GTTTACAGCT TCAAGGGTAT GATTTGTGTG 27 00 

30 TGCGATGCCT TCCTGAATCT GACACGCACG TCCGGTGTCA ACTACTCGGG CGCCAACTTG 2 7 60 

ATCGGCAACT CATACACTGG AGCCATCGAC ATCAAGCAGG GTATTGTCTT CCCGCCGGAA 2 82 0 

GTCGAGCAGA CGGTGTATCT GTTCAACACG GGAACACGCG ACCAGTGGCG TAAGCTTAAT 2 88 0 

GGAAGCACGG TTTCAGGCTA TCGAGCCGGT CAGTACCTCT CTGTACCTAA GAATACAGCG 2 94 0 

GGTCAGGACA ATCTTCCGGA TCGTATTCCA TCGATGCATT CCTTCTTGGT GAAGATGCAG 3000 

35 AACGGAGCGT CTTGTACGTT GCAMATCTTG TACGATAAGC TGCTCAAGAA CACGACTGTA 3060 

AACAACGGTA ATGGTACGCA GATCACATGG CGATCCGGCA ACTCCGGATC GGCGAATATG 312 0 

CCGTCACTTG TGATGGATGT TCTTGGTAAC GAGTCGGCCG ACCGTTTGTG GATCTTTACC 3180 

GATGGGGGTC TTTCTTTCGG ATTCGACAAC GGCTGGGATG GTCGCAAGCT GACTGAAAAA 32 4 0 

GGTTTGTCAC AACTTTATGC GATGTCTGAC ATCGGTAATG ATAAATTCCA GGTTGCAGGG 3300 

40 GTTCCGGAGT TGAATAACCT GCTGATCGGC TTCGATGCGG ATAAGGATGG TCAATACACG 3360 

TTGGAGTTTG CTCTTTCGGA TCATTTTGCG AAAGGGGCTG TTTACCTGCA CGATCTTCAG 34 20 

TCAGGAGCCA AACACCGTAT TACGAATTCT ACGTCGTATT CATTCGATGC CAAGCGGGGA 34 80 

GATTCCGGGG CTCGTTTCCG CTTGTCATAT GGATGTGATG AGAACGTAGA TGATTCGCAT 354 0 

GTCGTGAGTA CAAATGGCCG TGAAATTATA ATTCTGAATC AAGATGCTCT TGACTGCACT 3 600 

45 GTAACCTTAT TCACAATAGA AGGTAAGCTT CTTCGCCGCT TGAAAGTATT AGCTGGTCAT 3660 

AGAGAAGTCA TGAAAGTGCA GACCGGAGGG GCCTATATTG TGCATCTTCA AAATGCTTTC 37 2 0 

ACTAATGATG TGCATAAGGT GCTTGTTGAG TAT 37 53 

50 (2) IMFORnATIOlJ FOR SEQ ID HO; 88 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 8 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRAIIDEDKESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 

60 (iii) HYPOTHETICAL: UO 

(iv) AUTI-SEHSE: MO 

(vi) ORIGINAL SOURCE: 
65 (A) ORGANISM: PORYPHYROMOHAS GINGIVALIS 

(ix) FEATURE: 

(A) NAIIE/KEY: mi£c_f eature 

( B) LOCATION 1 . . .1278 



70 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 



ACTATGGAAG TGAAGAAAAA CACAGTGGTG CTACGCCTTC TGATTTGGTT CGTGGCCATT 60 
CTTCTCTTCC ACTCCTCACG GCTGTGGGGA CAGGAAGGGG AGGGGAGTGC CCGATACAGA 120 
75 TTCAAAGGAT TCGTGGATAC CTACCATGCC GTACGCAGCT CTTCTCCTTT TGATTTCATG 



180 
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AGCTCGCGTA CGAGAGTGAG AGGTGAGCTG GAGAGGTCGT TCGGTAATTC GAAAGTAGCC 24 0 

GTATCGGTCA ATGCCACCTA CAATGCTCTA CTGAAAGACG AGACCGGCTT ACGTTTACGT 300 

GAAGCCTTCT TCGAGCATCA GGAAGAGCAT TGGGGGTTGC GCCTCGGACG ACAGATTGTC 360 

ATTTGGGGGG CTGCCGACGG TGTGCGCATC ACGGATCTGA TCTCCCCGAT GGATATGACC 4 20 

5 GAGTTTCTGG CACAGGATTA CGATGATATT CGTATGCCGG TCAATGCATT GCGTTTCTCT 4 80 

GTCTTCAACG AATCGATGAA AGTGGAAGTC GTGGTACTGC CTGTATTCGA GGGGTACCGT 54 0 

CTGCCTGTGG ATCCTCGCAA TCCTTGGAAT ATCTTCTCCC TTTCGCCCAT TGCTCAGGGG 600 

ATGAATATCG TCTGGAAAGA AGAAGCCGGC AAACCGGCCT TCAAGGTTGC CAATATCGAG 66 0 

TACGGTGCGC GATGGAGCAC TACGCTCTCC GGTATCGACT TCGCTTTGGC TGCATTGCAT 720 

10 ACATGGAACA AGATGCCCGT CATCGAAGTA CAGGGCATTG TGCCGACGGA AATCATCGTT 780 

AGCCCTCGCT ATTATCGTAT GGGATTTGTC GGCGGCGACC TCTCCGTACC CGTCGGACAG 84 0 

TTTGTTTTCA GGGGAGAGGC TGCGTTCAAT ATCGACAAAC ACTTCACCTA TAAGAGTCAT 900 

GCCGAGCAAG AGGGTTTCCA AACAATCAAT TGGTTGGCCG GAGCCGATTG GTATGCTCCC 960 

GGTGAATGGA TGATCTCAGG ACT^lATTCTCA ATGG.nAAGCA TATTCAGGTA TAGGGATTTC 102 0 

15 ATCTCCCAAA GACAACATTC TACCCTGATT ACTCTCAATG TTTCCAAG7\A ATTCTTCGGC 108 0 

AGTACACTCC AACTTTCGGA CTTCACCTAC TACGACCTTA CGGGCAAAGG ATGGTTCAGT 114 0 

CGCTTTGCAG CTGACTATGC CTTGAACGAT CAGATACATC TGATGGCCGG ATATGACTGG 1200 

TTCAGTAGTA AGGGCAGCGG TATATTCGAT CGCTACAAAG ACAATTCCGA ACTCTGGTTC 1260 

AAAGCCCGCT ACAGCTTC 127 8 



20 



25 



30 



35 



70 



75 



(2) IMTORMATIOtl FOR SEQ ID HO: 89 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1392 base pairs 

(B) TYPE: nucleic acid 
{C) STRAHDEDHESS: double 
(D) TOPOLOGY: circular 

(ii} MOLECULE TYPE: DUA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AHTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAI^JISM: PORYPHYROHONAS GINGIVALIS 



(2) IHFORIIATION FOR SEQ ID NO: 90 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 



(ix} FEATURE: 
40 (A) NAliE/KEY: misc_f eature 

{B) LOCATION 1 ... 1392 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 

45 TACGGCAAAA GAAGAAAGCT CGGAACCTCC GTCCGACCTT CCGTTCTAAC CCAAATCAGA 

TTTATTTTGG ACTTACACCT TATTACTGAT TTTTTTGA^^G GGCTTCGGGT CAATCCTATC 120 

GGTGCAGCAG CCATAGTGGC TTTCATTATC GACCTGCTTC TTCTTTGCTG TTCGGCTTTT 18 0 

ATGTCCTCCT GTGAGGTGGC TTATTTTTCA CTAAAGCCGA TCGATCTGCA GAACATCCGC 24 0 

GAACGGAATC ACTCTTCCGA CATCGCGCTT TCCAATTTAT TAGACAATTC GAATCAGCTA 300 

5U TTAGCTACTA TTCTGATCGG GAATAATGTG ATTAATGTAG CCATCGTTAT CCTTTCCAAT 360 

TATGCCATCG AGCAGACATT CGTTTTCTCT TCTCCGATCA TTGGATTTCT GATCCAGACG 4 20 

ATACTCCTGA CCACTGTTCT TTTGCTGTTC GGAGAGATTC TGCCGAAAGT GTATGCGCGG 4 80 

AAGAATCCGC TGCAATACTC GCGCTTTTCT GCTGCAGCTA TGTCCGTTAT CTATAAGATA 54 0 

TTGTCACCGT TTTCAAAATT GCTGGTCAAA AGTACCGGCA TCGTTACCAG AGGTATCAGC 600 

AAGAAGAAAT ACGATATGTC CGTGGATGAG CTCTCGAAAG CGGTAGCCCT CACCACTACG 660 

GAGGGAGAGC CGGAGGAGAA AGAAATGATT AACGAAATCA TCAAATTCTA TAATAAGACA 7 20 

GCCTGCGAAA TCATGGTTCC GCGTATCGAT ATTGTGGATG TGGATCTGAG CTGGCCATTT 7 80 

CGTAAGATGC TTGACTTCGT TGTTTCGTCG GGTTATTCCA GACTTCCCGT TTCAGAGGGG 84 0 

TCAGAAGACA ATATCAAAGG GGTGATTTAC ATCAAAGATC TAATCCCACA CATGGATAAA 900 

bU GGCGATGAAT TCGACTGGCA TCCTCTGATT CGTAAAGCAT ATTTTGTCCC CGAAAACAAG 960 

CGCATAGATG ATTTGCTCGA GGAGTTCAGA GCCAATAAGG TGCATGTCTC CATCGTTGTG 102 0 

GATGAGTTCG GTGGCACTTG CGGACTGATC ACAATGGAGG ACATATTGGA AGAGATCGTC 1080 

GGCGAGATTA CGGACGAGTA CGATGAGGAA GAACTCCCCT TTAAGGTTTT GGGGGATGGC 114 0 

AGTTATCTTT TCGAAGGAAA AACGTCTCTC TCCGATGTTC GACACTATCT TGACCTTCCG 1200 

Ob GAAAATGCTT TCGGTGAATT GGGGGACGAG GTAGATACGC TAAGTGGGCT CTTCTTGGAA 12 60 

ATCAAGCAGG AACTCCCCCA TGTGGGCGAT ACAGCAGTGT ACGAGCCATT CCGCTTTCAA 132 0 

GTGACCCAAA TGGACAAGCG CCGAATCATC GAAATCAAGA TTTTCCCTTT CGAGCGCACT 1380 

TGGGAGGTCG AA 13 92 
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(C) STRANDEDIJESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 

5 

( i i i ) KY POTHETICAL : NO 

(iv) AHTI -SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORVPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) IW1E/KEY: misc_feature 
15 (B) LOCATION l..,7 98 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 

ATAATTTATA GATCAACAAT GAAACTATTA CTTTATCTCC TATTGGTCTT GTCGACTCTA 60 

20 TCCCCGATGT ATTCGCAAAT GCTCTTCTCA GAGAATCTCA CAATGAATAT AGACAGCACC 12 0 

AAAACCATAC AAGGAACGAT ATTGCCCGTA CTGGATTTCA AAACCGA7W\ GGAAAATGTG 18 0 

TTCACCTTCA AAAATACTGC CAATCTCAAT CTGCTGATAA AGCACGGTCA AGTAATCAAC 2*10 

TTAATTAATA AGCTTGAGTT TTCTACCTAT GGCAATAAAG TAACCGTAAG TGGAGGATAT 300 

GTACACACCG AATACCGCTA TTTGTTGCAT CATGTTTTTG AGGTTTATCC TTATGTCGAG 360 

25 TCGCAATGGG CAGAAAGTAG AGGAATGAAA TATAAGGTTT CTPCGGGATT ACAGTCGCGT 4 20 

TATCGGCTGG TAAATAGTGA TAACTGTCTC ATGTTTGCAA CATTGGGGGT ATTTTTCGAA 4 80 

TTCGAAAAGT GGGAACAGCC AGCCACTAGC CTCTTTGCAG GAACGTATGC ATACAGCCGA 54 0 

AGTATCAAAA GCCACCTGTC TATCAGTTTC AGACATCGGT TGGGTGAACA TTGGGAATTT 600 

ACAACTACGG CTATTCACCA GGGAAAGCCT GACAGTTATT TTAAGAAGGC ACGTTTTGGA 660 

30 GGAGCTATCG ACCTCAAATA CCATATCACA CCTACGATAG GAATACGCGG GGCCTATCGG 7 20 

ATCATCTACG ATACTGCCCC TATTGTACCT GTGCGGATKAG ATTACAACAC CGTTGATGTT 7 80 

GGTATCGATA TTTCGTTT 7 98 

35 (2) INFOR14ATION FOR SEQ ID NO: 91 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2721 base pairs 

(B) TYPE: nucleic acid 
40 to STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

45 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



55 



(ix) FEATURE: 

(A) NAJvIE/KEY: misc_f eature 

(B) LOCATION 1...272I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 



ACTATATGCG TTGCCTTCTT ATCCGCTCCC GTTGCGGCAT TGCGCGAGTC GCCGCCAATG 60 

GGGGCGGAGC GGAAGACTCC CTCTCTCCTA CCCCTTCTTT TCGAGCATTC TGATAAAGGA 120 

60 AAGGGGTTCG ACTACAGACT ATTCACAAGC AATAAACTTA AAGTGTTTGC CACAGGTAAC 180 

AGCCGATACA TACACAATAA ACCAACAATC ATCCAAGCAA TGAAACGAAT CGTTTTATCA 24 0 

TCTTTCCTGT TCGTTCTGTC CATACTTTCT TTGATGGCAC AGAACAATAC CCTCGATGTA 300 

CACATATCCG GTACGATCAA GGATGCCTCC TCCGGCGAAC CAGTGCCCTA TGCCACTGTA 360 

AGCATCCGGC TGACAGGAGC AGATACCACA CAGGTGTTCC GACAAGTGAC TGACGGCAAC 4 20 

65 GGCTACTTCG TCATAGGCCT GCCGGCAGCT CCCTCCTATC ACCTGACAGC TTCGTTCGTA 4 80 

GGTATGAAAA CCCATACCAT GCAGATTAGT CGGGGAAATG GACAGCACGA CATCAAATCC 54 0 

ATCGACATTT CTCTCGAATC CGAGGACAAA CAACTCTCCA CCGTCACCGT ATCGGCAGCA 600 

CGACCACTGG TGAAGATGGA GATAGACCGC CTGTCCTATA ATATGAAAGA TGACCCCGCA 660 

GCCAAGACGA ACAACCTGCT CGAAATGCTG CGCAACGTTC CTTTGGTAAC GGTGGATGGT 7 20 

70 CAGGGCAATA TCCAGGTGAA AGGATCTTCC AACTTCAAAA TCCACCTCAA TGGCAGGCCC 7 80 

TCGACCATGG TGAGCAGCAA CCCGAAGGAG GTCTTTCGCT CCATTCCTGC CCATACGATC 84 0 

AAACGGGTGG AGGTCATCAC CGATCCGGGT GTAAAGTACG ATGCGGAAGG CACAAGTGCC 900 

ATCCTGGACA TCGTCACGGA AGAAGGTAAG AAGCTGGAAG GATATTCAGG TTCCATCACG 960 

GCCAGTGTCA GCAACAATCC CACAGCCAAC GGTAGTATCT TTCTGACGGC AAAGTCCGGC 1020 

75 AAAGTCGGGC TGACTACCAA CTATAACTAC TACGGTGGCA AAAACAAGGG CTCTCGCTAC 1080 
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TTTACCGAAC GTACTACATC CATGCTCCAA 
TTTGGCGGAC ACTTCGGCAA TGCCCTCCTC 
ACGGTGGGCG GCAATGTACG CCTTTGGGAG 
AGCTTTGCCG GCAGCAACCT CATGTCCTAC 
5 GCCGGATCAT ACGAGCTCAA TGCCGACTAT 

CTCACCGTTT CCTACCGCTT CACTCACAAT 
TGGAAGCGCG ATCCGCTCAA CACAGCTAAT 
TCCGATGCGG GCATGGACGA ACATACGGCA 
GCACATTCTT TGGAAGCAGG GCTGAAGTAC 

10 TATGAGATAC GACCATCCGA AGATGCTCCG 
CCGTCGAACG GAAAGTTCCG CCACGATCAA 
TATCGTAAGG ATCAGTATTC TTTGCAAACC 
GCACTCTTTC CCGAAAACGC AGCAGCAGAT 
CAGCTCACGC TCGGCTATAC CCCCTCGCCC 

15 CGAATCCAAC GTCCTGCAAT CGGCCAACTG 
CAAGTACAGT ATGGTAATCC CGACCTAAAG 
TATAATCAAT ACGGAGCCAA GGTCATGCTT 
AACGCCATCC AGAATTACAC CTTCTCCGAC 
TATGGCAATA TCGGACGAGA GCATTCTTTC 

20 GCCGTATGGG TCAGGATTAT GCTCAACGGA 
GCACTCGGCA TTGATGTCAA TTCATGGTCC 
CTGCCGAAGG ATTGGACTGT GAATCTCTTC 
CAGACGAAGT ATGATGGCAA TGTATTCAAC 
AAAAAATTGA GAGTCTCGCT GAGCGCAAAC 

25 AGCCGGACCA TCGGCAATGG ATTTACTATT 
GTTTCCCTCA GCCTCACCTA CAGCTTCGGT 
CGTACGATCG TCAACGACGA CCTCAAGCAA 
CAAGGAAATC CTACCGGCAA T 

30 

{?.) IMFORMATION FOR SEQ ID HO: 92 



ACGATAGAAG AAGGCAAAGG GCAAGAAACC 114 0 

TCATTCGAGA TAGATTCGCT CAATCTCTTT 1200 

ATGACCACCG ACCGGAACAG CGTAGAAAAA 1260 

ATAGACAGAA AACTCAAAAC ACAGATGGAT 1320 

CAGCACAGCA CTCGCCTGCC GGGCGAATTG 1380 

CCTAATAATA GCGAGACCTT CATTGACCAA 14 'JO 

ACGATCCAGT ACGCCGGCCA GCACTCCAAA 1500 

CAAGTGGACT ATACACGTCC CTTAGGACAA 1560 

ATCTATCGTC ATGCCACGAG CGATCCTCTC 1620 

TGGCAGCCCG GCTCTCTATA TGCACAGAAT 1680 

TACATCGGAG CAGCCTATGC CGGCTACAAC 17 40 

GGCCTCCGAG TGGAAAGCAG CAGGCTGAAA 1800 

TTCTCCCACA ACTCGTTCGA CTGGGTGCCA 1860 

ATGAAGCAGC TTAAGCTGGC CTATAACTTC 1920 

AATCCCTACC GGCTACAGAC CAACGATTAT 1980 

TCGGAGAAGC GTCACCACGT CGGTCTCTCC 2 04 0 

ACAGCATCGC TCGACTACGA CTTCTGCAAC 2100 

CCGGCCAATC CCAATCTGTT CCACCAGACC 2160 

AGCTTGAATA CCTATGCCAT GTACACGCCG 2 220 

AATATCGATC GCACATTCCA AAAGAGCGAA 22 80 

GGCATGGTAT ACTCAGGCCT GATGTTCACC 23 4 0 

GGAGGTTATT ATCATGGGGG AAGAAGCTAC 2 4 00 

AATATCGGTA TAGCCAAACA GCTTTTCGAC 2 4 60 

AACATTCATG CGAAGTATTC GACATGGAAG 2520 

TATTCGGAAA ATGCCGGTAT ACAACGGAGT 2 580 

AAGATGAATA CACAAGTGCG CAAGGTAGAG 2 64 0 

ACCTCATCCC AAGGACAGCA GGGTGGCGGA 2 7 00 

2721 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 1350 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

40 

(iii) HYPOTHETICAL: NO 



(iv) AMTI -SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(A) ORGAN I SI I: PORYPHYROMOHAS GINGIVALIS 



lix) FEATURE: 

(A) NAIIE/KEY: mi sc_ feature 
50 (B) LOCATION 1...1350 



(xi) SEQUENCE DESCRIPTION: S] 

TGGTGCCAAT CCGACCCAAT ACCTCATTGC 

55 CAACAAGGGC GATCAGACCA AGACCGTTTA 

TGCCTTGGGC GGTATGAAGG ATTTGGTGAA 

TTGATTGTGA TGCTGCCTGC TGTGCTTAGC 

CGACTGGAGA CGGACAGCAT TCGTCCGAAC 

TTCTTTCGGA ACAATGAATA CAATGCACGT 

60 CGGGTTTCCG CTTTTGCCTC TTACTCGCTG 

GGAGTATCTA CCCTGAACTA CTGGGGGGCA 

GATTTACCTT ATTGGACGGA CTATAACGAC 

CAGGCCATGC TGAAGCCGAC GGCCACGACT 

ACGGCTCACG GACTGATCGA ACCGATCTAC 

65 GAAGCCGGTG TGCAATTTCG GGGTGATTGG 

TGGATGAGCA TGATTTTCAA AAATGACAAT 

ACTACTTCGA AATTGTTATC GGGTGAAGGC 

ATTGCCACGC ATCGCGGCGG GGAATACAAC 

GTCAATGGAG CTGTCGGACT TAAGCTTTCG 

70 ATTTGGGGAT CTGCTTATGG TGTGGCAGCC 

AGAGGGTGGG GCGGTTATCT TTCTCTCGGA 

GACTATTGGT ACGGCAGGCA TTACGTTTCT 

ACGTATGACA AACAGCCTCT TACGAACGGT 

TATTCGTGGC GGATGGCACG AAGTGTTTCG 

75 CCTTCGGATC GTTTTGCGAT GAGCCACGCC 



;Q ID NO: 92 

CATGCAGTAT ATCGAAACGC TCAAGGACAT 60 

CCTCCCGTTC GAGGCTACCG GTATGCTCGG 12 0 

AGGATGAGAC TCTCTGCCAT TCTTATCGCT 180 

GGGCAGCATT ATTATTCCAT GGCGGGAGAG 24 0 

GAACTCTCGG CATCGATCCG AAGTGCGCTT 300 

TCGGTCAAAG GTTATACGTT GCCGGGTGCA 360 

CCGGCAGCAC ATGGTGTGAA GCTTTCGCTC 4 20 

AGTCGCTATC CGGCCGGTAT CGCTTATTCC 4 80 

TATGTACGCT TGCGTATCCT GCCTTATGTA 54 0 

GCTCTCATGC TGGGCAATAT AGCCGGTGGT 600 

AATCCTGAGT TGGATTTGAC GGCTGATCCT 660 

ACACGTTTCC GAATGGATGT TTGGGTC7VAT 7 20 

CATCAGGAGT CGTTTGTCTT TGGCTTGTCC 7 80 

AAATGGCGAC TCGAACTGCC CTTGCAGGCT 84 0 

TGGGCGCAGC AGGATACCGT GCATACATGG 900 

TATCGCCCTC GTACCGACAA ACCCATGCAG 960 

TTGTCAAGCG GAGGATACTT CCCTTACGAA 102 0 

ATGGACTTGG AGCACTTCGC TTTTCGTACC 1080 

CCCTTTGCTG CACCTTTCGC CAATTCCCTG 114 0 

TGGGGCGATT ATATTCGTCT CTATGCCGAC 1200 

TTGGCGGCTG TTGCTCGGGT ATGGTTCCAG 1260 

TTGGAACTGA CGATGCGTAT CGATCCCAAA 1320 
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TTCCCAATAG CTTTTCTGAA AGGCAATCAT 



1350 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) iriFORI'IATIOH FOR SEQ ID HO: 93 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LEliGTH: base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDHESS; double 

(D) TOPOLOGY: circular 

{ii) IIOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: WO 

(iv) AIITI-SEIISE: IIO 

(vi) ORIGIt/AL SOURCE: 

(A) ORGAdlSl I: PORYPHYROMOHAS GIMGIVALIS 

(ix) FEATURE: 

(A) NAIIE/KEY: misc^f eature 

(B) LOCATION 1 . . . 1341 

(>:iy SEQUEliCE DE3CRIPTIOH: SEQ ID HO: 93 



CCGTTTTCAC 
TCGACAGTCT 
GACTCGTCGC 
ATACCCGACC 
ATGCCGGCCA 
GAACGTGTAT 
TCGTTCTACG 
GAGCTGCGGA 
CGTCTCAACC 
ATCGCAGGTG 
GCGCGTTTCG 
CTTGTGGATT 
GAACAGCAGC 
CACGTGACAC 
AACGGTACCC 
GCCCAAGAGC 
ATCAACGGAT 
ACTGCTCGCT 
AAGTGGAACT 
GGCTATGCGC 
GGAGGGGTGG 
CGGCATTACC 
ATCTATCAGA 



CGCCATGGGT 
CAAGTACCGC 
CGGCATATTC 
AACGTATGAA 
TCGCCCAACA 
TCGGTCTGGA 
ACCAGCCGCT 
TGTCTGCTCA 
GTGCCGCCGA 
TGGACTGGCA 
GAGGGATAGA 
ACATGACCTG 
TGCAGCTACA 
CCGATGTCGC 
TCCTCGGAGG 
GTAATATGTG 
ACCTCGACCT 
ACGGC7\AGGG 
TCCGCATTTT 
CTGCCCAATA 
AATATTACCC 
GGTACAGTGC 
TACCTTTCTT 



ATGTGCTGGA 
CGCCTCATCA 
GCCCATTGGC 
CAAATCGCTA 
GACAGGACCG 
GCAGAAGCAG 
TGTCGATGAC 
TGGCGAACTG 
CGGCACTTCG 
CCCGAACGAC 
ATACGACATG 
CTATACTTCG 
GGTACTCT^T 
TACCGCCACG 
AGCACTGCAT 
GTACTTCACT 
CACCTACTCG 
CAAGACCCTC 
CGATCAGGTC 
CGGCGAGAGC 
TACGGAGACC 
GACCGAGACG 
A 



GCGGCTTCCT 
CACAGGCGAT 
TCTTTGTCCT 
TTATCATTGG 
GCCGAACGCA 
AAAAAGCTGA 
GAATCCCATA 
AACCGCCACC 
TTTGCCGACA 
AAGGTGTCTT 
AACCCCGTAG 
GGCGTGAACT 
GCTTACAACA 
AGCTACCCGC 
ATGCGTTACG 
GCGGGCAACC 
ATCGAGGGAT 
ACGGACGTCA 
AATCTCTTCC 
AGCCACACGC 
AACTTCCGTC 
GAAAGCACCA 



CAGTACGCAC 
CGGAGCACAT 
CATCTCACTC 
CATGCCTCAT 
GCGGCGAGCC 
AGGTGTACTT 
TCGGACACTT 
TCAGCTTCGA 
ATCTCTCCAA 
TCTTCTTCGG 
AGATCTACCA 
TCGCATGGAA 
ACCGCTTCGC 
TCCTCTACTC 
CCGTGTCGAT 
TGTTCAATCC 
TGGACGACAA 
AGTACTATGC 
TCAAAGGCAT 
GCCACTCCTA 
TGTTCGTCAC 
ATGCTCTTCG 



ACGGCCATGC 
GCGATCGGCG 
ATTTGATTCG 
CCTGTGCGGT 
TTCTCTGGCC 
AGGCATACAG 
CAAGGTACAG 
CTGGCGACAA 
TGCCATCGAC 
ACGTCAGTAC 
GTACAGCGAC 
CTTCCACCCC 
CGACCGCTAC 
GGCACAGTGG 
GGCTCATCAG 
GGGCTVAACGG 
AGGCATTATG 
TCTGGTATCG 
GTACGAGAAC 
CGGCTATATG 
CTACATAGGA 
CGCCGGTCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
114 0 
1200 
1260 
1320 
1341 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID HO: 94 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 
to STRA!1DEDHESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE; NO 

(vii ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHOUAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI4E/KEY: misc_f eature 

(B) LOCATION 1 . . . 681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 
TATAAGCAAG CTATAATGAA ACATTTGTTT AAGTCGACAT TAGTACTTCT TTGTGCTCTT 
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TCTTTTTCCG GTACCTATAC CTTTGCACAA GAAAATAATA CAGAAAAGTC ACGATTTGAT 120 

TTTTCTGTTA GGCTGGGACA GGGATATATT GCAGGTTCAA CTACCAACCT GATGTATGGG 180 

TATACATCTG CTAACGATAG ACTTTTGTCT GGTGCAATTT ATCTGGGCTT GACACCAAGT 24 0 

AAGAAAGAlAA ATGCAACCGG CGTAGCATTT CGTTTCTTAT CHCCCTCTCC GGGTTATTAT 300 

5 GTCGATATAT CCGGCAAAGA AAATACCTTG AATTATGCGT TTTACGTTGT CGGAGCATAT 360 

AATAGAATAG CCATTCCTAT ACGCCCTATC AAAAATTTTA ATTTCATCTT CTCTACAGAA 4 20 

GTCGGAATGG CTTGGATGAG TCGTCATGAG CAAATTTACA ATTCTACTTC GCAGACTTGG 480 

GATAAGCAGC GCAAGTCGAG GTCGGGACTG GATTTTGGTC TCGGGATGCA TCTGCAATUC 54 0 

CACATTAATA AGACCGTTTA CTTTATGGCA GGAACCGATC TTACGTCTTG CATGTTCGGA 600 

10 AAAAGGATCA ATGACTACCA GCAAAAGGAT CGAACCTTCA TTGCACTTAT CGACAACAGT 660 

ATTGGCATAG GATTAAACCT C 681 



15 



25 



(2) IHFORMATIOIJ FOR SEQ ID HO: 95 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 1218 base pairs 

(B) TVPE: nucleic acid 

(C) STRAHDEDMESS: double 
20 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DHA (genomic) 
(iii) HYPOTHETICAL: MO 
(iv) AIJTI-SEIJSE: MO 



(vi) ORIGINAL SOURCE: 

(A) ORGAI-IISH: PORT PHY ROMOf IAS GIMGIVALIS 



30 

(ix) FEATURE: 



<A) NAl'lE/KEY: misc_featur€ 
iB) LOCATION 1...1218 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 



ATATTCATAG ACCCCGATAA GAATACAAAA 
ATGCTGAAAA ATAAATTGGC CCCCTTGGCC 
AAGGCTCAGG AGCAACTGAA TGTGGTACAC 

40 GATGCACGTG CGGCCGGTAT GGGGGATATA 
CAGTATTGGA ATCCGAGTAA ATATGCTTTC 
TATACACCCT GGCTGTCCAA GCTGGTCAAT 
TACAAATTGG GAACAGACGA GAATCAGGCT 
GGAAAGTTGG AGACTTTCGA CGAATTGGGC 

45 TTTGCTGTCG ATTTGGGCTA TAGCCGCCAG 
CTGCGTTACA TCCGCTCAGA CCAAAGCACT 
GCCTTTGCGG CGGATATAGC CGGTTATTTG 
AGCTTGTGGT CGTTGGGTTT CAACGTAAAG 
GGTGTCACGA GTTTTTTCAT CCCTACTTCG 

50 ATCGATGACT ATAACAGCAT CAATTTCAAC 
CCTCCTATCA TGGATCAAAA CGATCAGGCC 
GAAACTTCTT CGATCAGCGG TATATTCTCT 
GAAGAATTCC GTGAGATTAC ATGGGGACTT 
TTTGTTCGTG CCGGATATTC ATACCTGCAC 

55 GCCGGTGCCG GCTTCAAAAT GAACATATTC 

ATCCAGAGTA ATCCGTTGGA TCAGACTCTG 
TTGCGCAATT TGTTCCAC 



CAAAACGAAC GAAATATGAT TATCAAGAAA 60 

ATACTGTTCC TTTTTGCTCC AAAGGCTATG 120 

ACCTCTGTGC CATCGCTGAA TATCAGTCCG 180 

GGTGTGGCAA CGACGCCGGA TGCGTATTCA 2 40 

ATGGATACGA AAGCCGGTAT TAGCTTCTCA 300 

GATATTGCCC TGATGCAGAT GACCGGTTTC 360 

ATTAGTGCTT CTCTGCGTTA TTTCACATTA 4 20 

G7\ATCCATGG GAGAGGCCCA TCCCAATGAA 4 80 

TTGTCGGAGA ACTTCTCCAT GGCTGTTGCA 54 0 

CACAACACCG GAGAGAATCA GGCCGGAAAT 600 

CAGAAGTATG TGCTACTGGG TAATGCGGAG 660 

AATATCGGAA CGAAGATCTC CTATGACGGA 720 

TTGAATCTCG GGACGGGGCT GTTGTATCCG 7 80 

CTTGAACTTA GCAAGCTGCT TGTACCCACT 84 0 

GGGTATGAGG CTGCACTCAA GAAATATCAG 900 

TCTTTCGGTG ATGCGCCGGG AGGACTCAAG 960 

GGGGCTGAAT ATAGCTATGA CGATAAATTT 1020 

CCCACCAJVAG GCAATTTGCA GTACTTCACG 1080 

CGTATCGATG CTTCCTACCT GTTGTCTACG 114 0 

CGGTTTACGC TTGCTTTCGA TATGGATGGA 1200 

1218 



60 (2) INFORMATION FOR SEQ ID NO: 96 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1356 base pairs 

(B) TYPE: nucleic acid 
65 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

{ii) MOLECULE TYPE: DNA (genomic) 
70 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: MO 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGANISil: PORY PHYROI-IONAS GIIiGIVALIS 
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(ix) FEATURE: 

(A) nAT-lE/KEY: misc_f ea ture 
^ (B) LOCATION 1. . .1356 

(>:i) SEQUENCE DESCRI PTIOl I : SEQ ID MO: 96 

CTAATAATCG AAAAGGAAAT GAAAACAACA GTTCAACAAA TTATTCTGTG CCTGGCTTTA 60 

ATGATGTCAG GTGTATTGGG CGGAAACGCA CAGAGCTTTT GGGAAGAAAT AGCTCCTCCT 12 0 

lU TTTATCAGTA ATGAGCCTAA CGTCAAGTAT ATAATTCCCA ATATGGGGAT TGATTCAAAG 180 

GGAACAATCT ATGTAACCGT GACAAAAAGG ATTCAGCAGG GAGCAAATTA TACTTCTGAG 24 0 

CAATTGGGTA TGTACTATCG ACCATTAGGT GATAATGAAC AGTGGTGGAA ACATGATCCG 300 

TATTTTGATG ACAAGATAGT TGCGGATATT CAGACAGATG CATATGGCAG AGTTTATGTA 360 

TGTACGACTT CTTCTCGAGA TCAAGAGTAT CAACTTTATA TAAACGAGCA GAACGAATGG 4 20 

15 AGGTGTATAT TCAAAACTTC TGTGTCTACA TATGAGCATG GTATGGCTGT TTTTCGCTCT ^80 

TCGACAGGGG TGACTTATAT AGGTACCAGG CATCACATCT TCGCATCAGG TGTAAATGAT 54 0 

TTCGAGTTCA ACACTATCTA TGAAGACTCT ACACCTATGA GCTGTCGCTT TGCAGAGGCT 600 

ACGAATAGTG GCACCATCTA TCTGGCATTG ATGCATGAAA CCACAATGTC TACGACTATC 660 

CTTACTTATC AAAACGGTGA GTTCGTCGAT ATCTCGGAAA GTGAATTGAG TAACTCGATT 720 

ZU ATTGCATCCA TGTGCTCTAA TAAAGAAGGT GATATAATAG CTCTTGTTAC TTCATATACA 7 80 

GGATTTATGA GTGGAACCCT TGCGATCAGA AAAGCAGATG AAGGCAAATG GCAACTTGTT B-JO 

GGCGGAGATA TACAGAATGC GATCGTTCAA AATATATGCA TGATGGACGA CAACAAGATT 900 

GCTTGTGAAG TCTTCGGGAC TCCTAACGGA GTAGATGGTC GGACAAGGGT TTGTGTTTCT 960 

GACGCATCTG TCTTTGATTT TGAGTGGTAT G^J\GATGI\7^J\ TATACGGAGG CCTGATATTT 1^20 

Z5 GACACTTTCT TCTATAGCCC TTGGGACAAA CTTCTTTATG CGAAATTTGG TGGGATTATG 108O 

CTCAGGAGTA AAGAGTCTTV TATAACCTCT TTCATTTCTC CGACAGTTGT ACAAGGAGTG 11 AO 

GATGTCTATA CTTTGGCCGG GAAGATAAGG ATCGAAAGTG AAACTCCGGT GTCTGAGGTG 12 00 

TTGCTTTTCG ACCTGGCTGG CAGGATGGTA CTTCGGC7W\ CCATTGATAA TAAAATCTAT 12 60 

TCGGACATAG ATACTAACGG ACTAAAGCGA AGCGGTATTT ACGTAGTCTC GGTGCGGCTC 132 0 

ciU TCTTCGGGAC AGGTATTCAG TCATAAGGTG CAGGTA 135 6 

(2) J IJFORtlATIOI'4 FOR SEQ ID MO: 97 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDMESS: double 

(D) TOPOLOGY: circular 

40 

(ii) MOLECLfLE TYPE: DHA {genomic) 

(iii) HYPOTHETICAL: HO 

45 (iv) AUTI -SENSE: NO 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROflONAS GINGIVALIS 

50 (ix) FEATURE: 

(A) NAl-lE/KEY: misc_feature 

(B) LOCATION 1 ... 993 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 



GGCCTGTACG GT ATGTCTGT TGTACCTATA ATAATTTATC TTTGCGGTAT ATCAAATTAT 60 

GCAAGACTCA TGATAATCCG GTGTCTTATC CGTCGTCCGA GAACCGTCCT GTTCGGGTTG 120 

ATATTCGTGG TAGGTCTTTT CTCTGCGATG GCGCAAGAGA AAAAGGATAG TCTCTCTACG 180 

GTTCAGCCAG TGCCGAATAG CAGCATGGTG GAGCAGACCC CTCTTCTCTC CATTGATCAC 24 0 

OU CCCGTCCTGC CCGCTTCTTT TCAGAATACC CGTACACTGA AAAGGTTTAG AGACAAACAT 300 

CTTTCCGATG CTTTGCTCAA TGGATTGAAG CCTCATCGCT CATCTTTGCA ATTGAATGAG 360 

GAACTCAACT TCGCGGCAGA GCGTCGGGAT TTCGTTTCTC CCCTCTTGCA AACTCGCCAC 4 20 

GCTGCCGGTG TCCTTTCATG GCGACCGACC GATAGGATGC ATTTTTATAC ATCGGGCAAT 4 80 

ATCGGTCTTG GCCATGATTT ATTGACCGGT GTGCGCAAGG ACTTCGGATG GAATGCTGGT bAO 

Ob GCCGACTTCT TGCTGAGTCA AAATCTTACG GCACATGTCC AAGGCGGTTG GCAGCAGAAT 600 

TTCGGCTTTA TACCTATGAC GGCTGTCAAT GGCCAACTGC GTTGGCAAGC CACCGAGAGA 660 

TTGAGTTTTA CCACCGGTAT CGATTATCGA CAGGTACAGT GGAATGCTTT CGATAATAGA 7 20 

ACGTTCTCGC TTAAAGGAAG TGCTCGATAC GAAGTGATGG ACAATGTCTT TGTCAATGGA 7 80 

TTTGGCAGCT ATCCTCTCTA CAGCAGTACG CGCTCAGGAC TCAATATGGC TGTTCCGATG 84 0 

/U CATGGATTCG GCCCTCAGTA CGGTGGATCG CTTGAGCTGA AAGTCTCCGA GCGATTCGGC 9O0 

TTTGCCGTCG GTATGGAGCG CGAATACAAT ATCTGGACTC GTCGGTGGGA AACGCATTAC 96n 

TTTGCTTATC CTGTATTCTA TGGCGATAAG AAG 993 

75 (2) INFORMATION FOR SEQ ID NO: 98 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRAIJDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

10 (iii) HYPOTHETICAL: NO 

(iv) At ITI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (A) ORGAHISM: PORYPHYROMOt JAS GIMGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 . . . 987 



20 



40 



55 



SEQUENCE DESCRIPTION: SEQ ID NO: 98 



GAGACGAACT CTTGGGTATC CAGCGATTGC AATTCGACGA CGATGAAAAC GAATAGACGA 60 

TACGCATTTG TTTTGCCGCT TCTGCTACTC ACCGGATTGT TGGCATGGGG GCAGGATTCT 120 

25 TCCCACGGTA GCAATACAGC GTTTGCAACT GATTCTTCGA GTAGAGAGTT GCCCACGGAG 180 

CAGTCCGCCT ACCGCATTCA TTCTGCCTAT ATGGTCGGTG GTGGCGGAAG CATAACGCGC 2 AO 

GACACCTATT TGTCACCCCT TCGTTATGGA GGATGGACAC TGAATTTGTT GGGAGAGAAG 30 0 

ACGTTCCCTC TCAAAGCCTC CGATTCCCGT TGGATGATCC GTACCGGGCA TGAGCTGGAT 360 

TTTGCCCTGA TGGACAATCC GGCCAATAAT GCTCATTTCT ATTCCCTGCT GTATAACGGT 42 0 

30 TCCGCTGCGG CTCTTTACCG CCTTGGCGCT AAGCATCTGC GAGCCGCGTG GATGGACAAT 4 80 

CTGCGCTTGG CATTCGGCCC GGGCTTGGAA ATCGGGCTTG GAGGAATTTA TAGTACACGC 54 0 

AACGGCAATA ATCCTGCGAC ATTGAAGCTC TACACCAATG CCATCGCCCA AGCCTCGATA 600 

GGATACTACG TCCCCTCCGA AACTTTTCCC CTGTATTTTC GGTTGCTCTC CCAGATCAAT 660 

CTCTTCGGTA TAGCCTATGG AAATGGTTTT GGTGAGAGCT ATTACGAGAA TTTTTTGCTC 720 

35 AATAACGGCA TTGCAGGCTC CCTGCATTTC ACTTATCCGG GCAAGTTTAC TCGGTTCACG 7 80 

ACACTCATAA CGGCGGATAT TCCCATTCGG AACTTCTGTA CGCTTCGTGT CGGTTATCGC 84 0 

TATTCCCATT TGGGCTCTTC GCTTAACGCA TTGGATACTC GAATCCACAG TCATACGGCT 900 

TTTATCGGTT TCGTCACGGA GTTTTACCGA TTCCGTGGGC GCAAAGCCAT GAATACCGGT 960 

CGGAGAACCA GTCTTTACTA TCATGAT 987 



(2) INFORMATION FOR SEQ ID WO: 99 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 957 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS : double 

(D) TOPOLOGY: circular 

50 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVALir 



(ix) FEATURE: 
60 (A) NAME/KEY: miEC_feature 

(B) LOCATION 1 . . . 957 

(xi) SEQUEt4CE DESCRIPTION: SEQ ID NO: 99 

65 TTTACGTCCG GTACGATATT CGTCCGTATA GCCATATCCG GCCGTGTAGT GTGCCGTGAG 60 

GTTGAGGATA ACGGAAGGAG AGAAGCTGTG CGTCATGATG GCATGGTAGT GACGCTGCTC 120 

GTAATTGTCG GTATTGTGGT AGTAGTGCGG TACTCCTTGC GCGTCCACGT ACATAAGACC 18 0 

GGCACTGTTG TATCGGCGGC CATATTTGGC TTCATCCTCC TTGGAAAGAC CGTTCCATGC 24 0 

GATACCCGTA ACTTCTTTTC CTCCGAAAGT GATGAACCTG AGAGCCGTGT TGCTACCGAA 300 

70 ATAGCCCACC TGTGCGAAAT AGGATTTCAG ATCCACGCTT CCTCTATCCA CGTAGCCGTC 360 

CGAACCGATT TTGGACAGGC GGGCATCCAC TGCCCAATGG CGACCGATGC GTCCGCTACC 4 20 

GAGTTTGACC GATCGGCGGA ATGTGCCGAA CGAACCTCCG CTCAAATCGA CACGGCCATA 4 80 

AGGAGCCAGT CCCAAATTAT CCGTACGCAT ATTGACACTT GCCCCAAAAG CTCCGGCACC 54 0 

ATTGGTGGAA GTACCCACAC CTCGCTGCAC CTGAAGGTCT TCGATGGAAG AGGCGAAGTC 600 

75 GGGCATATTC ACCCAAAAGA CGGACTGAGA TTCGGAGTCG TTGAGGGGTA CTCCATTGGT 660 
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AGTTATGTTG ATGCGATTGG CATCGGTGCC ACGCACGCGA AAGCCGGAAT ATCCGATACC 7 20 

CGTACCGGCA TCGCTGGTGG CTACCACGGA GGGAGTCAGC ATCAGCAGAT AGGGGATGTC 7 80 

ACGACCATAA TTGGACTTGG AAAGTTCGGC CTTGCGAACG TTGGTGTAAG CGACAGGGGT 8 40 

TTTCGCCGTG GCGCGAGTAG CTACGACCTG TACGGTCTGG AGCTGCACAT TGCTAAGACT 900 

5 ATCTATCTCG CTGTTGGAGA CGGGTGCTTG TGCCGTCAGG CAGAAAGGCA GGACGGC 957 

(2) INFORMATION FOR SEQ ID NO: 100 

10 (i) SEQLfBMCE CHARACTERISTICS: 

(A) LENGTH: 1842 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

15 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: MO 
20 (iv) AIJTI -SENSE: NO 



25 



30 



65 



70 



75 



(v-i) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GIMGIVALIS 

(ix} FEATURE: 

(A) NAME/KEY: mi£c_teature 

(B) LOCATION 1 ... 1842 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 



CCATATAATG TCCAATCTAT TAGCAACAAG ACGATTAAAA AACAAATGGA AAACTTAAAG 60 

AACATTCAGC CCAGAGAGGA TTTCAACTGG GAAGAGTTTG AGGCCGGTGG CGTCCATGCT 12 0 

GCCGTGAGTC GTCAGGAGCA GGAAGCTGCT TATGACAAAA CGCTCAATAC CATCAAGGAA 180 

AAGGAAGTGG TAATGGGTAG GGTAACTGCT ATCAACAAGC GTGAAGTGGT TATCAATGTA 24 0 

Jo GGGTACAAAT CGGAAGGTGT GGTACCTGCA ACAGAATTCC GCTACAATCC CGAACTCAAA 3 00 

GTGGGAGACG AAGTGGAAGT TTATATCGAG AATCAGGAAG ATAAGAAGGG CCAGCTCGTC 3 6n 

TTGTCTCACC GCAAGGGTCG TGCCGCTCGC TCTTGGGAGC GCGTGAACGA GGCTCTCGAA 4 20 

AAAGACGAAA TCGTAAAGGG CTATCTG/s^AG TGTCGTACCA AGGGTGGTAT GATCGTCGAT 4 80 

GTATTCGGTA TCGAGGCTTT CCTCCCGGGA TCACAGATCG ACGTGCGCCC CATTCGCGAC 54 0 

4U TACGATGCAT TCGTTGAGAA GACGATGGAG TTCAAGATTG TGAAAATCAA TCAAGAATAT 600 

AAGAATGTAG TTGTTTCCCA CAAGGTGCTC ATCGAAGCAG AGCTCGAACA ACAGAAGAAA 660 

GAAATCATCG GCAAGCTCGA AAAAGGGCAG GTACTCGAAG GTATCGTCAA GAATATTACT 7 20 

TCTTACGGAG TATTTATCGA CCTCGGTGGA GTGGATGGTC TTATCCATAT CACTGACCTT 7 80 

TCATGGGGTC GTGTGGCTCA TCCGGAAGAA ATCGTACAGC TGGATCAGAA GATCAATGTC 84 n 

4b GTTATCCTCG ACTTTGATGA AGATCGCAAG CGTATCGCTC TCGGACTCAA ACAGCTGATG 900 

CCTCATCCTT GGGATGCTCT CGACAGCGAG CTTAAGGTAG GCGATAAGGT GAAGGGTAAA 960 

GTTGTGGTGA TGGCAGATTA CGGTGCTTTC GTTGAGATTG CACAGGGCGT TGAGGGTCTT 1020 

ATCCACGTAA GCGAAATGTC ATGGACACAG CACTTGCGTT CTGCTCAGGA CTTCCTGCAT 108 0 

GTAGGCGACG J^JKGTGGAAGC CGTGATCCTG ACGCTCGACC GCGAAGAACG CAAAATGTCG 114 0 

OU CTCGGTCTGA AGCAACTCAA GCCGGATCCT TGGGCTGATA TCGAAACTCG TTTCCCTGTA 12 00 

GGCTCTCGTC ACCATGCTCG TGTTCGCAAC TTCACCAATT TCGGTGTATT CGTTGAGATC 12 60 

GAAGAGGGCG TAGATGGCCT TATCCATATT TCCGACCTTT CTTGGACGAA GAAGATCAAA 1320 

CACCCCAGCG AGTTTACGGA AGTAGGTGCT GATATCGAAG TTCAGGTAAT CGAGATCGAC 1380 

AAGGAAAACC GTCGTCTCAG CTTGGGTCAC AAACAGTTGG AAGAGAATCC TTGGGATGTA 144 0 

TTCGAGACGG TATTCACTGT AGGATCTATC CACGAAGGAA CGGTAATCGA AGTGATGGAC 1500 

AAGGGTGCTG TCGTTTCTCT GCCTTACGGT GTGGAAGGTT TTGCCACTCC GAAGCACATG 1560 

GTGAAGGAAG ATGGCTCACA GGCTGTACTC GAAGAGAAGT TACCTTTCAA GGTTATTGAG 162 0 

TTCAATAAGG ATGCCAAGCG AATCATTGTA TCTCATAGCG GTGTATTCGA AGATGAGCAG 160 0 

AAAATGGCTC AGCGTGAAGC CAATGCAGAG CGTAAGGCTG AAGCCAAAGC GGCTCAGAA.i\ 17 4 0 

DU GAAGCTGCTG CCGAAGCTGC CAATCCTGCA CAGGCTGTAG AGAAAGCCAC TCTCGGAGAC 18 00 

CTCGGCGAGC TGGCCGCTTT GAAAGAAAAG CTTTCAGAAA AC 184 2 

(2) INFORMATION FOR SEQ ID NO: 101 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) AtlTI-SEHSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVAL IS 

5 

(ix) FEATURE: 

(A) NAI'lE/KEY: mi sc_f ea t ure 

(B) LOCATION 1 ... 729 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 

TCGACAATAA TGAAAAAAGC TATTCTTTCC GGAGCGGCCT TGCTCCTCGG CCTATGTGCC 60 

AACGCACAAA ACGTGCAGTT GCACTACGAT TTCGGTCATT CCATCTACGA CGAACTAGAT 120 

GGACGTCCCA AACTGACTAC CACAGTGGAA AACTTCACAC CCGACAAATG GGGAAGCACC 180 

15 TTCTTCTTCA TCGACATGGA TTACACGGGC AAGGGTATCC AGTCGGCCTA TTGGGAGATT 2^0 

TCGCGCGAAC TGAAGTTTTG GCAAGCTCCC GTTTCCATTC ATTTGGAGTA CAACGGAGGC 300 

CTCTCCACAA GCTTTACTTT CGGACACGAT GCTCTAATCG GTGCCACCTA CACCTACAAC 360 

AACCCCTCCT TTACACGTGG ATTTACGATC ACGCCCATGT ACAAGCATCT GGGTGCGCAC 4 20 

GACTTCCACA CCTATCAGAT CACCGGCACT TGGTACATGC ACTTTCTGGA CGGTCTGCTT A 80 

20 ACCTTCAACG GCTTCCTCGA TCTTTGGGGT TTCCCCCAAG AGAACCCAAT CGGGGGCCCT 54 0 

GTGCTCAAAG AAGGGGATAA GTTCGTATTC CTGTCCGAAC CGCAGTTCTG GATCAACCTC 600 

AATCGCATCA AAGGCATCGA CAAGGATTTC AATCTCAGCA TAGGGACAGA GATGGAAATC 660 

AGCAGGAACT TCGCTCGCAT GGACAAATTC TCCTGCATCC CTACTCTTGC GGTCAAATGG 7 20 

ACTTTCAAC 72 9 



25 



40 



65 



{ 2 ) I M FORI 1AT I OIJ FOR SEQ ID 1)0:102 



{i} SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 705 base pairs 

(B) TYPE: nucleic acid 
(CI STRAHDEDNESS: double 
(D) TOPOLOGY: circular 

35 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(1 V ) ANT I - S EN S E : NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIMGIVALIS 



(2) INFORMATION FOR SEQ ID NO: 103 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1308 base pairs 
(B) TYPE: nucleic acid 
iC) STRAHDEDNESS: double 
70 (D) TOPOLOGY: circular 



75 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



60 



(ix) FEATURE: 
45 (A) NAI^IE/KEY: misc_feature 

(B) LOCATION 1 . . .705 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 

50 ATGAAAACAA TTAGTAAGAA CCATGCGGCA CGGATCTGTG CCGGCATTGC TTTGTTTGCA 

GTGTGTAACG GCCGGATAGC TGCTCAGGAT TTTCTCTATG AAATAGGAGG AGGTTTTGGT 120 

GCTGCTCAGT ATTTTGGCGA TGCAAACAGA GGCTTGTTCG GTTCATCCGG AGTAGGTTTG 180 

GAGTTGGTCG GACGTTACAA TTATAATTTT CGCTGGGCTT TCAGTACCAT GTTGGATTGG 2 40 

CGTACATTGA GAGGCGATAC CGATAAGTCC GGGAATGTCT TCCCCGATTT TGCTCAAGCG 300 

55 GATTTTAAGG TCGGCTTGAC TCAGCTCCAC GTTAGAAGCG AATTTAACTT TCTCCCTTAT 360 

AGCGATGGCT ATAAGTATCT TGGTACAGCT CGGCTGTCTC CTTATGTAGC GGCCGGGTTG 4 20 

TCTTTGGGTT TTGCTTCGGG TGCTAAAGGT TCGGCTTTTG CTCCCGGGAT TACTGCGGGA 4 80 

ATGGGAGTGA AGTATAAGCT TAAACCGCGG ATCAATGTCG GTATCGAGTA TTCTTTCACG 54 0 

GGGTTACTTA CCGATGCTTT GGATGCGCTG ACGGATAAAA GTGTTTGGCT CGAGGATCCA 600 

60 TATAAGATCA ATGACTCCTG GGTCAAAAAC AAGGATGCTA CAGGGGCCTT AGTGCTTAGG 660 

ATTACGTATG ACTTCGGCCT GCGTAAGACT TTTTGTAATA AACAA 7 05 
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(iv) AMTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAMISM: PORYPHYROMONAS GIIJGIVALIS 

5 

(ix) FEATURE: 

(A) NAJ'IE/KEY: misc_f eature 
(B) LOCATION 1 . . .1308 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 

AATATAATGT ATAAAGACTA TAAGGGTTTG TATGCGTCGC TTCGGTGGTA TGCCCTGATC 60 

ATTGGGTTGC TATTTGCAGC AGACGGTATA CAGGCTCAGA ACAACAACTT TACCGAGTCG 120 

CCTTACACTC GCTTCGGCCT TGGCCGTCTC GGAGAACGGA CGACTATTAG TGGGCATTCC 180 

15 ATGGGAGGAC TCGGCGTCGG TCTGCGTCAG GGCACATACG TCAATGCCGT CAATCCTGCT 24 0 

TCATACTCGG CTGTGGATTC GATGACGTTT ATCTTCGATT TCGGTGCATC TACCGGAATT 300 

ACGTGGTATG CCGAG7U\CGG GAAAAAGGAC AATAGGAAAA TGGGAAACAT TGAGTATTTC 360 

GCCATGCTTT TTCCTATTTC CAAATCCATT GCTATGAGTG CGGGAGTGCT TCCTTACTCC 4 20 

GCATCCGGGT ACCAGTTCGG ATCCGTTGAT CAAGTGGAAG GAGGCAGCGT CCAGTACACC 4 80 

ZU CGTAAATACT TGGGGACAGG CAATCTGAAC GATCTCTATG TCGGTATAGG TGCAACCCCG 54 0 

TTCAAAAACT TCTCAATAGG AGCCAATGCT TCATCCCTTT TTGGGCGATT CACACACAGC 600 

AGGCAGGTAA TCTTCTCCAC GGAGGCTCCT TACAATCCCG TACATCTCTC GACGCTGTAC 660 

TTGAAGGCTG CCAAGTTCGA CTTCGGTATG CAGTATCACC TTCTTCTCAA ATCAGATCGT 7 20 

TCGCTCGTTA TCGGTGCCGT CTATTCTCCG CGGGTGAAGA TGCATAGCGA GCTGACTCAG 7 80 

Z5 ATAAAGAATC AGGTTCAGAA CGGTGTAGTA GTGGAGAGCG AAACCCAAGA ATATATCAAG 04 0 

GGAATGGACT ATTATACCCT GCCTCATACA TTGGGGATAG GTTTTTCTTA TGAAAAGAAA 900 

GATAAACTTC TCTTAGGAGC AGACGTCCAA TATAGTAA.AT GGAAAGGCGA GAAATTTTAT 960 

AAATCCGATT GCAAATTCCA GGACAGAATA CGGGTATCTC TCGGCGGAGA GATCATACCG 102 0 

GATATAAATG CCGTTGGGAT GTGGCCTAAA GTTCGCTATC GCTTCGGTTT ACATGGTGAA 1080 

AATTCTTACC TGTWXGTGCC GACTAAAGGC GGTGTATATC AAGGATACCA TATCGTAGGT 114 0 

GCTGTATTCG GTATAGGAAT CCCGCTCAAT GACAGACGTT CGTTCGTAAA TGTCTCTCTT 12 00 

GAATATGACC GATTGATCCC GAAGGAGGGT ATGATCAAAG AAAATGCTCT GAAATTGACC 12 60 

TTCGGCCTCA CGTTCAACGA GTCATGGTTT AAAAAGCTGA AACTGAAC 1308 



35 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2835 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA {genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI4E/KEY: mi sc_f eatare 

(B) LOCATION l..,2835 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 



-^TACCATTT CGGAGAATGG GAACGATAGC ACCTATTTTT CTTTTCAAP.C TTTTCTTATG 60 

bU CGATCGATTT ATCAATTACT GTTGTCAATA CTCCTTGCTT CTCTTGGTTT CGTCGGGCTG 120 

GAAGCCCAAC AAGCCGGAGT AGCAGGTAGA GTATTGGACG AAGAAGGCAA CCCCATGATT 180 

CAAGCCAACG TACAGCTTGT ACAGAGTACC GGCCAAGTAG CCGTTGCCGC AGGTGCCACT 24 0 

AATGAAAAAG GGTTGTTCAG CCTGAAAACG TCACAGGAGG GTGACTACAT TCTGCGCGTT 300 

TCATATGTAG GTTACACTAC CCACGACGAA AAAATATCTC TTAGAAACGG GCAAACCATT 360 

bt) ACGCTCAAAG ATATATCCAT GAACGAAGAT GCCCGTCTTC TACAGAGTGT GACGGTGCAG 4 20 

GCTA7VAGCGG CAGAGGTCGT GGTACGCAAC GATACGCTCG AATTCAATGC CGGATCCTAT 4 80 

ACCGTAGCAC AGGGAGCTTC TATCGAGGAA CTGATCAAGA AGCTACCCGG AGCAGAGATC 54 0 

GGATCCGATG GGAAGATCAC CATCAACGGC AAGGACATTA GCAAGATCCT TGTCGATGGC 600 

AAAGAGTTTT TCTCCAAAGA TCCACAGGTG GCAATAAAGA ATCTTCCGGC CGATATGGTC 660 

/U AATAAAGTAC AGGTACTGAA CAAACTGAGC GAGCTGTCGC GGATGAGCGG TTTCGATGAT 720 

GGAGAAGAGG AGACCGTAAT CAACCTGACG GTGAAGCCCG AAAAAAAGAA AGGCCTCTTC 7 80 

GGAACGCTTC AGGCCGGCTA CGGTACCGAC CAACGCTATA TGGCCGGAGG GAACGTCAAT 84 0 

CGGTTCGATG GAAATAAGCA ATGGACATTG ATCGGTAGTG CGAACAATAC GAACAATATG 900 

GGCTTTAGCG AGATGGACAG CGAGATGGGA TCCATGACCT TCTTCTCTCC CCAAGGCGGT ^60 

/O GGTCGACGCG GCTTCGGCAA TAGTGGAGGT GTTACGTCTT CGTCGATGCT GGGCGGCAAC 102r. 
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TTCAGTGTCG AATTCTCCTC TGCCCTTAAT ACAGGAGGCG ATGCACGCTA CGGATACAAC 1080 

GACAAGGCCA TAGAGACGAC CAAACGCGTG GAAAATATCC TCGCCGAAGG GAATACTTAT 11-40 

ATGGACGAAA ATATATTGGA ACGCTCTTTC TCTCACAATG GTCAGGCGCG ATTTAGGATG 1200 

CAATGGAAAC CGTCCGAACG TACCGAAGTG GTATTCGAGC CGGATCTTTC GATATCCAAG 12 60 

5 ATCGATGGGT TCTTTAACGA CACATACGAG ACGAAAGATG CCACCGGAAT CTCTATCAAC 1320 

AAAGGTTCTA TCCACCAAAC TACACAAGGA AACAACTTCA GACTGAACGG AGAATTGGAT 13 80 

ATCAGTCACA AGCTCAACGA CGAAGGCCGT ACGATCAGTG CCTCCGTCAG TGGCGGTCTG 14^0 

ACCGACGAA.G ACGGAGATGG CATATATCAG GCTGTGCTCC AAAGCGTGGA GACGAATCAA 1500 

AAGCAATTCA ACGACAACTC CAACCTGCAA TATCGGCTTC GCCTCTCGTA TGTGGAACCG 1560 

10 TTGGGTAAAA ACTACTTCGC ACAAGCGATT CTGAACAGAC GTTTCTCCCG TCGCAATTCG 1620 

GATCGTGAGG TGTACCGACT GGGCGATGAC GGGCAATACT CCATATTAGA CAGTCAGTAC 1680 

GGACTCTCCT ACAGTAACGA GTTCACCCAG TATCGCATCG GACTCAACCT CAAGAAGATT 17 4 0 

GCCAAAACGT GGGACTACAC CGTAGGATTC AATGTGGATC CCAACAGAAC TGTCAGCTAT 18 00 

CGGAGCGTAG CCGGAGTAGA GCAGGACAAA CTGGCTTTCA ATCGTGTCAA TCTCTCCCCG 18 60 

15 ATGCTCCGAA TCAACTACAA ACCGAGCAGG ACTACCAACC TCCGAGTGGA CTACCGAGGA 192 0 

CGCACGACAC AACCATCCAT CAATCAGATC GCTCCCGTTC AGGACATCAC GAATCCGCTA 1980 

TTCGTGACGG AAGGCAATCC CGGTCTGAAG CCGAGCTATT CCAACAATGT GATGGCCATG 2 04 0 

TTCTCGGACT TCGATGCCAA AAGTCAGCGA GCTTTCAACA TTGTTTTCTT CGGCAACTAT 2100 

ACATTCGACG ACATCGTCCC CAATACGCAC TACGATCCGT CTACAGGGAT CCGTACCACT 2160 

ZU CGTTACGAAA ACGCCTCCGG TACGTGGCAA GCGAATCTTC ATGGGACACT ATCGCTTCCA 22 2 0 

CTCAAGAACA GGGCATTTTC TTTCAGGATG TCCTTGTTCA ACAGGTTGGC CGAAGGACAA 22 8 0 

AGCTTCATCA ATGACGATAA GAACAAAGCT CTCTCTTTCC GAACGAGGGA ACGCCTGACG 23 4 0 

CTGACCTATC GCAACAATTG GATCGATACG AGTATCGGTG GCAATATCGG ATTCTATATG 2 4 00 

GCGAATAATA GTCTGAGCGG ACAGAAAGAT TCTCGCACAT ACGATTTTGG CGGCAATTAT 24 60 

CAAGTTGCCC TAACGCTTCC CTATGGATTC CGTATCGACA GCGATGTTGA ATACAATACG 2520 

AACTCCGGTT ACAGCGGAGG ATTCAGTCTG GACGAATGGC TTTGGAATGC TTCGCTTTCA 258 0 

TACAGCTTCC TCCGTGACAA GGCCGGTACA CTGCGTGTCA ATGGCTATGA CATCCTCGGT 2 64 0 

CAGCGGTCAA GTATCAGCCG TTCTGCTTCG GCCATCAATA TAGAAGAGAG CATGTCCAAT 27 00 

ACGATCGGAC GCTACGTGAT GGTGGACTTT ATCTACCGAT TCAACGCCTT CAGTGGTGGT 27 60 
JU GGATCTCGCA GCGATCATCA GCGTGGCAAT ATGAATCGTC CGGGCCCACC TTTCGGCGGT 
GGCAGACGAC CGTCC 



35 



40 



45 



50 



60 



(2) II]FORMATIOH FOR SEQ ID NO: 105 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 1236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv) AIJTI-SElfSE : NO 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: PORYPHYROHONAS GIMGIVALIS 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 

(B) LOCATION 1 . . .1236 



55 (-i) SEQUENCE DESCRIPTION: SEQ ID NO: 105 



2820 
2835 



GGAGAGTATC CTGCAAACAG TAACGACAAA AAAGAGATGG TTATGAAGCT GATTAAAAGA 60 

AGTTTGCTCC TGCTTGGAGC GGTACTGCTG ATTACGCTTC CTGCGTACTC GCAGAATGAT i-^^O 

GACATCTTCG AAGATGACAT CTATACATCG CGAAAAGAAA TACGTAAACA A7VACCAAGTT 180 

AAAGACTGGC AAAACCAAGA GGACGGATAC GGCGACGATA CGGAATATAC AGTGGCTTCC 24 0 

GATCGGGACA TTGACGCCTA CAATCGTAGA GATGGCCAGT CCTACGATGG GAAAAAGTTG 300 

TCC/<AAGACA AGAAAAGAGA CTCCACTCGT TCTTCTGTTC CCGGTCGCTA TAGTCGCCGC 360 

TTGGCTCGAT TCTATAAGCC GAATACGATC GTCATTTCAG GTGCCGAC^ TGTATATGTA 4 20 

ACTGATGATG GTGAGTATTT CGTCTATGGA GACGAATACT ATGATGACGC GTCGTCTGTA 4 80 

DO AACATTTACA TCAACAGTCC TTGGTGCGAT CCGTTCCCTT ATACGTCATG GTATCCATCT 54 0 

TTCTCCGGCT GGTACAACTA TACGTGGAAC TATCCATGGT TCTACTACGG TAGCCATATC 600 

GGATGGGGCG GTTATTACCC CGGATATAAT TGGTATTGGA GCT ACT ACTA TGATCCTTTC 660 

TACAATCCCT ATGGAATCGG TATGGGTTGG GGATATCCTT ATGGCTGGGG CAGCTATTAC 7 20 

GGTTGGGGTG GCTATCCGGG AGTGATACAT CACTACCACC ACTACCCCAA GAAGACCTAT 7 80 

/U TCCAATGGTC AGCATTCCGG AGCTTACTAT TCTTATGGCC GACCGAATCG TATCAAAGGT 84 0 

GGAACGTCCG GTGCCAAACT TGGGACAGGA CGCTACGATA GAATTCAAAA TTCGT-^TT^G ^00 

CAAAAAAATA AGTTCGGATT GCAGTCGAAC AAACCCAATA ATAATCTGCA AAATGTCAAG Q60 

TCGGGACGTA CCGGCCGAGC CAATAGAGAC CGAAATATAG AAACGGTA/^.C TCCPJKACAAC 10^0 

GGGCAAAAGC AGAATCGTCC CGTATTCCAG CAG/'vATCAGT CCGGCAATGA CCGACCGACC 108 0 

/O GGACGGAATA TCCGCAGCGA GAGACAGGGG GAAAATAACG ATAGGACATT TTCGA^TCCT 114 0 
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TCTCGTAGCA ATAGTAACGG TGGCTTCTCC ACGCCTTCTC GCTCTTCTTC CGGCTCTATG 
AGCGGAGGTG GCGGACGTAG TGGCCGGGGA CGCAAT 



1200 
1236 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(2) IllFOFU'IATION FOR SEQ ID HO: 106 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1803 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DIIA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AtlTI-SEIISE : NO 

(vi) ORIGINAL SOURCE: 

(A} ORGAIIISM: PORY PH YROMOI IAS GINGIVALIS 

(i>:) FEATURE: 

(A) NAI4E/KEY: mi^c_feature 

(B) LOCATION 1 ... 1803 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 106 

AGTAATAGCA GCTCCCACAA GTGGTTAATT TATTATCATA TAGAAAAGAC TAAAAGTATT 60 

ATGATCCGCA AGTTGATCTT ACTGCTTGCT CTGATGCCGG TAGCCTCTGT GGCTTTTGCG 120 

GTACCAACGG ACAGCACGGA ATCGAAAGAC AATCGTATCC TTACAAGCAT GCAATCCTCC 180 

TCTTTGAATA GGGATGATGC TCCGGATAAA TGGCAACCTA TGCATGCCAA TTTCAGTATT 24 0 

CAGAGCGATA TGCTGCTTTC TACTGCCCAA AAGTCCAAGA ACACCTGGTT CGGCAACTCC 300 

TATATCATGG GTATAATCAA GAACAATTAT CTGGAGTTTG GTGCCCGTTT CGAGGATCTC 360 

TATAAGCCCC TGCCCGGACA TGAACCCGAG ATGGGGCGTG GCGTTCCTCA CATGTATGTG 4 20 

AAGGGAAGCT ATCATTGGGC GGAGCTGACT ATGGGAGACT TCTACGATCA GTTCGGTAGC 4 80 

GGTATGGTAT TCCGCACCTA TGAAGAGCGC AACCTCGGTA TAGACAACGC GGTTCGCGGC 54 0 

GGAGGTATAG TACTCACTCC TTTTGATGGA GTGCGTGTCA AGGGTATTGC AGGACAGCAG 600 

CGTAACTACT TCGACCGCAC GGGCAAGGTA TTCAATTCCG GCCGAGGCTA CCTACTGGGT 660 

TCTGATCTGG AGCTGAATGT AGAGCGTTGG AGCAGTGCCA TGCGCGACAA TGACTATCAT 720 

TTGGCTATCG GGGGATCGTT CGTTTCCAAA CACGAAGCAG ACGAAGATAT ATTTGTGGGT 7 80 

GTAGGCGAAG ATCGCAAGCG ACTCAACCTG CCGCTCAATG TCCCGATTAT GGGCCTGCGC 84 0 

ACCAACTTTC AAAAAGGAGG TCTCGCCCTC TACGCAGAGT ATGGATACAA ATACAACGAT 900 

CCCTCGGCAG ACAATGACTA TATCTACCAC GACGGACAGG CTGCACTCCT CTCTGCCTCA 960 

TACTCCAAAA AAGGGATGAG TATCCTGTTG CAGGCCAAAC GTTGTGAGAA CTTTGCTTTC 1020 

CGCAGCAAGC GAAGTGCCCA GCTCACACCG CTTATGATCA ACTATATGCC GGCTTTTACC 1080 

CAAGCTCACA CTTATACGCT GGCGGCCATC TACCCCTATG CTACTCAGCC TCAGGGAGAA 114 0 

TGGGCTTTCC AAGGTGAACT GCGTTACAAC TTTGCTCGCC GGACAGCTCT CGGTGGACGC 1200 

TACGGTACCG GCTTGCGTAT CAACGTTTCG CATGTGCGTG GTCTGGACAA AAAGATGCTC 1260 

AAAGAGAATC CCGACGAACT GATCGGAACG GATGGCTACA CCGTTTCTTT CTTCGGCATG 132 0 

GGCGACCTCT ATTATTCGGA TATAGATGTG GAGATTACTA AAAAGGTAAG CCCAGGATTC 13 80 

AACTTTACGC TCACCTACTT GAATCAGATC TACAATAACA AGGTACTGCA CGGTGCAGCC 14 40 

GGAGAGAAGC CTGAGAAGAT CTATGCCAAT ATCTTCGTCT ATGATGGTAA GTATAAGCTG 1500 

AGTAATAAGG TAGCCCTCCG TACCGAACTG CAATATTTGC ACACGAAGCA GGATCAGGGT 1560 

GACTGGATCT ACGGCATGGC CGAGCTCTCT ATCCTGCCTT CTCTGATGCT TTCCCTCTCG 162 0 

GAGCAGTATA ATATCGGAGA GACCAAGAAA CATTATGTCA TGGGGTCTGT CACCTATACT 16 80 

CACGGAGCAC ATCGAGTAGC TTTCTCTGCA GGCAAAACCC GTGCAGGGAT GAACTGCTCG 17 4 0 

GGAGGTGTAT GTCGTGTGGT CCCTGAGACT CAGGGATTCT ACCTTTCTTA TAGCACCAAT 1800 

CTG IQQ2 



(2) INFORMATION FOR SEQ ID NO: 107 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 756 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGAIIISII: PORY PH i ROMONAS GIIJGIVALIS 

(ix) FEATURE: 

(A) NAI'IE/KEY : misc_f eature 
5 (B) LOCATION 1...756 

{xi) SEQUENCE DESCRI PTIOt J : SEQ ID MO: 107 

AGGGGGTCGA GTTCGGGCAT TTCAGCCCGT GGACGCGACA TGCGTTCTTT ATTTTTGAGC 60 

10 GCGTTGCGCA GCTCCTCTCT CCATGGTTCA GAGCGACGCA GTCGGATAAG TTCTTCTGTA 120 

GTCATGTCAA TAAGGCAGAA AATAAGGTTA TTCCATCTCT CGGTATGCGC CCAAACGCAT 180 

GATCATCTCA TCGAAATCCA CTTGGTGTGC ATCGAATTCG GGGCCATCGA CACAGACGAA 24 0 

TTTCGTCTGT CCTCCCACGC TTATACGACA AGCCCCACAC ATACCGGTGC CATCCACCAT 300 

AATTGTATTG AGAGAAGCTA TGGTCGGTAT CTCGT7U\CGT TTGGTCAGGA GAGAAACGAA 360 

15 CTTCATCATC ACAGCCGGCC CGATCGTAAC GCAGAGGTCT ACCGTTTCCC GTTTGATAAC 4 20 

GCTTTCCACT CCATCCGTTA CGAGGCCTTT CGTCCCATAA GACCCATCGT CTGTCATGAT 4 80 

GATCACTTCA TCGCTATTGG CTCGCATTTG TTCTTCAAGG ATAACCAGAT CTTTAGTTCT 54 0 

GGCAGCCAAT ACGACAATTA CACGGTTGCC TGCTTTGTGG AAAGCCTCCA CGATCGGGAG 600 

CAAAGGAGCC ACACCCACAC CGCCTCCGGC ACAAACCACT GTGCCGACCT TTTCGATATG 6 60 

AU CGTACTCTGT CCCAGCGGAC CTACCACATC CGTGATATAG TCGCCGACTT CGAGTTCGGC 7 20 

CAATTTCTTG GAAGATTTGC CCACGGCCTG AACCAC 7 56 



25 



35 



40 



45 



(2) IIJFORHATIOH FOR SEQ ID NO: 108 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2370 base pairs 
(E) TYPE: nucleic acid 
(C) STRAMDEDNESS: double 
30 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DIJA (genomic) 
(iii) HYPOTHETICAL: HO 
(iv) AMTI-SElviSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAI-IISM: PORYPHYROMONAS GIHGIVALIS 

(ix) FEATURE: 

CA) WAl-IE/KEY: misc_f eature 

(B) LOCATION 1. . .237 0 

ixi) SEQUENCE DESCRIPTION: SEQ ID rJO:108 



CTCTTGTTTT CTTCTCCTTT ACCCCGAATG GATCGTCCTA AGCCTTCATA TATTGTTCGA 60 
ATAGCAGCCA TTCTCTGCTT GTTTGTCGGC AGGCCTTTGT TTGCGCAGAG CTATGTGGAC 120 
TACGTCGATC CGCTGATCGG GACGCTAAGT TCTTTTGAGC TGAGTGCGGG CAATACCTAT ISO 

OU CCGGTGATCG GTTTACCGTG GGGAATGAAT AGCTGGACAC CGATGACCGG TGTACCCGGT 24 0 

GACGGCTGGC AATATACCTA CTCGGCACAC AAGATTCGCG GATTCAAACA GACCCACCAA 300 
CCCAGTCCTT GGATCAACGA CTACGGCCAA TTCTCCCTTC TTCCCCTTAC GGCACCGCAG 360 
AAGCCATCAT CGAACGACTC CATAGCTCTG ACTAAATGGT GCAAGCAACT CTTTTCGGAC 4 20 

GAACAGACCT CGTGGTTCTC GCACAAAGCG GAGACGGCGA CGCCATACTA TTATAGTGTC 4 80 

OO TATTTGGCCG ATTACGACAC ACGCGTGGAG ATGGCTCCGA CCGAGCGTGC AGCTATCTTT 54 n 

CGCATACGTT ATTCCGGCAA TACCGAAAGT GGCTCCGGTC GATGGCTTCG TCTTGATGCC 600 
TTTACCGGTG GTTCGGAGAT TAGCATCGTG GATCCTCACA CCGTAGTGGG CATATCTCGC 660 
AAGAATAGCG GAGGTGTGCC GGCTAACTTC GCCTGTTATT TCATCCTGCA GTCCGATACT 7 20 

CCTATGGCCG ATGTCCTGCT TGAGACAGAT ACCGGCAA.GT CAGACGAAGG CACAAGGGCA 7 80 

OU TGGGCAGCCT GTCGCTTCGA TTCGCAAGAA GTTACCGTCC GGGTGGCATC TTCTTTTATC 84 0 

AGTGTCGAGC AGGCCGAAAG AAATCTTGCG GAAGTCAAAG GGCAGAGTTT CGACCGGATC 900 
AGACTTGCCG GTCGCGAAGC TTGGAATAAG GTGCTCGGAC GCATACATGT GGAAGGAGGA 960 

ACGAAGGATG AGCGCACTAC ATTCTATTCC GCACTCTATC GCTGTCTGCT TTTTCCCCGT 102 0 

CGCTTCTATG AGGAGGATGC TTCCGGCAAT TTTGTGCATT ACAGCCCCTA CAATGGAGAG 1080 

bb GTACTTCCCG GTTATCTCTA TACCGATACC GGATTTTGGG ACACTTTTCG AGCCCTTTTC 114 0 

CCCCTGCTCA ATCTGCTGTA TCCCGATGAA AACATTAAAA TTCAGGAAGG TCTGCTGAAT 12 00 

GTATATCGCG AGAGTGGCTT TTTCCCCGAA TGGGCCAGTC CGGGCCATCG GGATTGTATG 1260 

ATAGGCAACA ACTCTGCTTC TGTTCTGGCG GATGCCTACC TCAAGGGTGT TCGGGTAGAA 13-^0 

GATACCCGTA CACTGATGAA CGGACTCTTG CATGCTACGA AAGCCGTCCA TCCGAAAATC 1380 

/U TCCTCCACGG GTCGCAAAGG TTGGGAGTGG TACAACTCCT TAGGTTATGT TCCGGCTGAT 14 4 0 

GCAGGCATCG ACGAAAGTGC TGCCCGTACG CTCGAATATG CTTATAACGA TTGGTGCATC 1500 

CTCCGACTGG GGCGCACATT GGGTTGGGAT AGAGCTGCAT TGGACACGTT GGCTCATCGT 1560 

TCGATGAACT ATCGTCATCT GTTCGATCCG GAAACCA.AAC TCATGCGCGG TAGAAATCAG 162 0 

GATGGTAGTT TCCGGACACC TTTTTCCCCT TTCAAATGGG GAGATGTATT CACGGAGGGC 1680 

/b AATGCCTGGC ACTACACTTG GTCGGTCTTT CATGATGTGC AGGGGCTTAT CGACCTGATG 17 4 0 
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10 



GGAGGAGATC 
GATGAGAGCT 
GGCAATTATG 
GGTCATCCAT 
ACTCCGGATG 
TCTGCTTTAG 
ATTTTTTCCA 
GCCAACAGTG 
TGCAATTACC 
ACGAAACCCA 
GAGCAACAGC 



GCCCGTTCGT 
ATTACGGATT 
CTCATGGCAA 
GGAAAGCTCA 
GGTATTGCGG 
GCTTCTATCC 
AGGTAATACT 
CCGATACGCC 
TGACTCACGA 
ATTATAATCG 
GTCGCGCTAA 



GTCTATGCTC 
TGTCATCCAC 
TCAACCCATA 
GGAGAGACTA 
CGATGAAGAC 
TGTTACACCC 
CTCTTTTCCC 
TTACATCCGC 
ACAGCTTCGC 
TGGTATGAAG 
TCACAGTAAT 



GATTCGGTAT 
GAAATCAGAG 
CAGCATATGA 
CGCGAAGTGA 
AACGGACAGA 
GCTACGGATC 
GACGGACACA 
TCGATCAGCG 
TCTTCTGCAT 
GAAAGTGACA 



TCAATACTCC 
AGATGCAAAT 
TATATCTGTA 
TGGGGCGGCT 
CTTCGGCTTG 
AGTATGTGCT 
AAACGGTGTT 
TAGAAGGAAA 
CCATTCAATG 
GACCTTATTC 



TCCTATGTTC 
AGCGGATATG 
TAATCATGCC 
CTATCGTCCT 
GTACGTTTTC 
CGGTTCGCCG 
GCATGCTCCG 
AGAATGGAGC 
GATGATGGAC 
CTTCTCCACG 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2370 



15 



20 



25 



30 



35 



40 



45 



50 



{2} IMFORJIATIOH FOR SEQ ID MO: 109 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DIJA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROI-IONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAl IE/KEY: mi£c_feature 

(B) LOCATION 1 . . . 858 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:109 



ATTTGTGGCA 
CGCAGCGTGC 
AACAAAGCTG 
CCTCTGCGGG 
TCGGGAGATG 
GATTCCTCTT 
CGCTCTAACA 
TACAAGCAGG 
ACATATACTT 
CGCACCACCG 
CTGTCCGCAC 
TT GG ATGCAA 
GATGACAAAG 
GCTGTCTATG 
CTAAAGCGTT 



GTAAAATGAA 
TGTTGCTACT 
CCGAATCGAA 
CAGACATAGA 
ATATGCTCAC 
CGCCGGAGAA 
ATGCCGAAGC 
AAAATACCAT 
ACGAAGGCGG 
GCAAGGTGCT 
TCATCATAGG 
TAGGTTTCTT 
GTCTCACATA 
TCCGTCTCGG 
ACTTGCCG 



TCTATCGGGA 
CTTTCCATTG 
GTCTGTCTCT 
TACGCCATCG 
AGAGATTTTC 
TGCCATGGAA 
CAATCTGC/VA 
CGCTTATTGC 
TGCACATACG 
CGAAGAGCGA 
ACAATTGGTG 
CAACGCAGAA 
CTGTTTCAAT 
ATATGACGTA 



TTACAATCTT 
TCTTTGATCA 
TTCGATTCGG 
CTGCATGTGA 
AACGGTTTGC 
GGCTATGCAC 
GGGCTTCCTT 
GATACGGGAT 
GAGAATACAG 
GATATATTCA 
CACGATTTCG 
GAAATACAGC 
GAGTATCAGA 
TTGGCTCCTT 



TCACTATGAT 
CTGCTTTGGG 
CCTATCTCGA 
TGATCAGCTA 
TCTTCGGCGA 
AGATGCTGGG 
CTGACCTTTT 
TGATCTCCAC 
TCCGGTTTGC 
AGATCGACTA 
GCAAGACCAC 
CCAATGGCAA 
TAGCTGCTTA 
TGCTAAGGGA 



GAAATCCATG 
CTGTAGCAAT 
ACGCTACATC 
CGTCTATCCT 
CAGCCTGATG 
AGAAGACTAT 
GGACTATATC 
GCGCATCAAT 
CAACATCCTT 
TGCGGAAAGG 
ACCTGCCGAA 
TTTTATGATC 
TGCCAGAGGT 
TGATTCCCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
858 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 110 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DlIA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AMTI-SENSE: NO 

(vi) ORIGIliAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .1134 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:110 
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10 



15 



20 



GGGATAATAT 
GCTGCTGGCA 
AGGTCTATGG 
AGCGGTCTGT 
AAAAAGAGAA 
AATAATTTGA 
AAAAAAACTA 
AATGCCCAAT 
ACGGCCGGAA 
GTACACCACA 
GCACACTGGG 
GGTATTGTTG 
CTCGGCCGCT 
GGATGGACCA 
GACACCTACA 
AGCAATGGCT 
AAGAGCAAAT 
TATCCGTATC 
ATGAACCACA 



CTGTTCTTTC 
CGCATAGTGT 
GTATTATCTC 
CGAGGGCAAA 
AACTGCAAGA 
TACCTTCGCT 
CTTTGACAGG 
CGAAGTTAAA 
ATGGTTTTGG 
GCATTCTGCC 
AAGAGGCCGG 
AAAAGGTAAT 
TCCTCATCAG 
ATGACGCCCA 
CAAGAGCTGG 
TTTCACCGGG 
ATGATTTTCG 
GTAATTTCAA 
TCGGCCATGT 



ACATGTGGTT 
GTATTCGATT 
AGCCTGCCGT 
TGTAGGTATT 
AAAAAATGTT 
CCCATTTAGA 
ATCGATATGT 
GATCAAGAGC 
TGGCAATATC 
CGAAGGGTTG 
AAGTCCGAAG 
AGTCTTCGAA 
TTTGGGGATA 
TATTCTTTTC 
CAGTGACCTT 
GATCGGACTC 
CCTCGGTTTC 
GCTGGATGGA 
GGGCTTCAAC 



GACCGTCCAC 
CTACATCCCT 
ATCGCAATTC 
CTTTCGTACA 
TTCCTCCAAA 
ATCGATAACA 
GCTTTACTCC 
ATTGAGGCAG 
TTCGGCATGG 
GATTTTTCGG 
CTCATGTATA 
GACGCAGAAG 
TCCTATACCA 
TCACCGATAC 
GTACTTCAGT 
A.AAAGTTCTA 
AGCCTGGGCT 
AATAAGCCGC 
TTTACCGTGG 



AATGGGGTGC 
CCGCCGGTAT 
TTGCCGGCAA 
ATCCAGAAAA 
TCCGGCTCCG 
CAAAAAAAAT 
TGTTTTTGGG 
CTACCACTTT 
ACATGAGCAT 
TAGGAATACA 
CGAATGTCCC 
ACTTTTTTGA 
AGCATCTGGG 
CCAAGAGCAA 
CCGAAGATGT 
TTTGGTGGAA 
ATGAGTATCT 
TTTCAGCACT 
GTCTTTGGAC 



TTCTCCCGAA 
TATCCGGATC 
GCACCCCCGT 
CACGCCCGAG 
TCAATCATTT 
CACTGAAATG 
TCTCTCGGCC 
CAGTTCGGCC 
ACGGATGAGG 
TGAAAGAAGA 
AAGTATCATT 
CAAAAAAGCT 
AGCGTATTGG 
GGTCCACTAT 
TGCCACAGTG 
AATGCCCATC 
GAACCTGCTA 
ATCTCCTCGC 
TAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1134 



25 



30 



35 



(2) IMFORtlATIOM FOR SEQ ID HO: 111 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDriESS: double 

(D) TOPOLOGY: circular 

(ii) riOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AMTI-SEJISE: NO 



40 



45 



50 



55 



60 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGATUSII: PORY PHYROHONAS GINGIVALIS 

{XX) FEATURE: 

(A) MAI-IE/KEY: misc_feature 

(B) LOCATION 1 ... 3807 

(Mi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 



GTAAAATGTA 
TCGGATTATT 
AAG7\AACCCC 
TCCTCACAAG 
GCAGTCGTTG 
TTTCCTGCTC 
GAACGTCTTA 
GGCCTGGATA 
CTAGAGGGTC 
AGTAAACTAG 
CAAATCCGTA 
GGTAACCAAA 
CTAAGAAGTA 
CTTGAACTAT 
GCAACGCTTG 
TCGTTAACAA 
CTCACCTCGC 
GAACGTCTCA 
GGCCTGGAAC 
CTAGAGGGCC 
AGTAAACTAG 
CAAATCCGTA 
CGCAACCAAA 
GTTTCGGGCA 
CAAACTTTAG 
CTCTCTCCTT 
AAACAGAAAA 
CATTCTTCGG 
AATACACATG 
GGGGGACAGG 



TGGGCAAGTA 
ATACCTTTGA 
AAGCCATTTT 
AAGGGATAAG 
AGCTTTGCTT 
TAAAAAAGCT 
CTTCGTTAAC 
GTCTCACCTC 
TGGAACGTCT 
AGGGTCTGGA 
AGCTGGAGGG 
TCCGTAAGCT 
ACCAAATCAG 
CGGGTAACCA 
AACTGTCGGG 
AGCTTCGTCT 
TAACAAAACT 
CCTCGTTAGC 
GTCTCACCTC 
TGGATAGTCT 
AGGGCCTGGA 
AGCTGGAGGG 
TCAGTAAGCT 
ATGATATTCA 
AAAAACTGAG 
ATGATAATCA 
AGACTTCAGT 
GTAAAACAAC 
TGTTGTCGAT 
ACTATTATCA 



TAAAAGAGCT 
GGGAGTTACT 
AGACTTAGAG 
CTGGTCGGTA 
GCGAGAATGT 
TGATCTATCG 
AAAACTTCGT 
GCTAACAAAA 
CACCTCGTTA 
ACGTCTCACG 
TCTGGAACGT 
AGAGGGTCTG 
TAAGCTAGAG 
AATCCGTAAG 
TAACCAAATC 
AAGAAGTAAC 
TTCTCTCTCC 
GGAGCTTTAT 
GTTAACAAAG 
CACCTCGCTA 
ACGTCTCACG 
TCTTGATGGT 
GGAAGGACTA 
ATCTATTGAT 
AATCCATGAC 
TTTGCCGGAG 
TGAATATCAC 
ATTTCTTAGT 
ACATCGAAGC 
TGGGATTTAC 



AAGTACCGCT 
TTTTTATGCG 
A.AGGCCTATA 
AATAGATATT 
CAGATAGAAA 
TATAACCAAA 
CTAAGAAGTA 
CTTTCTCTCT 
GCGGAGCTTT 
TCCTTAGCAA 
CTCACGTCCT 
GAACGTCTCA 
GGTCTGGAAC 
CTGGAGGGTC 
AGTAAGCTAG 
CAGATCAGTA 
GATAACCAAA 
CTTTTGGATA 
CTTCGTCT7U\ 
ACAAAACTTT 
TCCTTAGCGG 
CTTGCTTCCT 
GACAGACTAA 
GATATTAAGC 
AATCCATTTG 
ATTAAAGCTC 
CCATTTTGCA 
CAATACGATA 
AATAACCCTA 
CAAGCCTTTT 



ATTGGCTTTT 

CATCTGACGA 

ACATTGAAAT 

TCAAGCAAGA 

GCATGACTTG 

TCAGTAAGCT 

ACCAAATCCG 

CCGATAACCA 

ATCTTTTGGA 

CGCTTGAACT 

TAGCAACGCT 

CTTCGTTAAC 

GTCTCACCTC 

TGGAACGTCT 

AGGGTCTGGA 

AACTAGAGGG 

TCAGTAAGCT 

ACCAAATCCG 

GAAGTAACCA 

CTCTCTCCGA 

AGCTTTATCT 

TAACAAGGCT 

AGGTTTTGAG 

TATTGGCTCC 

TTGCATCATC 

TTCTTGAAAA 

AAGTAATGCT 

CAAATTATAC 

ATGCGATCTT 

TTACCACCCA 



TCCTTTCTGT 

TATGACAACC 

TCCTGATCTC 

TTCCTCCGGT 

GCTTATTGAT 

AGAGGGTCTA 

TAAACTAGAG 

AATCAGTAAG 

TAACCAAATC 

ATCGGGT7VAC 

TGAACTATCG 

AAAGCTTCGT 

CTTAGCAACG 

CACGTCCTTA 

ACGTCTCTCT 

CCTGGAACGT 

AGAGGGTCTG 

TAAGCTGGAG 

AATCAGTAAA 

TAACCAAATC 

TTTGGATAAC 

TAGTCTAAGO 

AAAACTTGAT 

GATTCTGGAG 

AGGCTTGATA 

AGAAAAAGAA 

ATTGGGAAAT 

GTATCAGAAA 

TTACGACTTT 

ATCGTTATAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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CTTCTCTTTT GGGATGCTAA GAAGGATCGA AACTTTGTGA GCGTAGATGA TAAAGAATAT 18 60 

CAGACTCTTA ATTTCAATCG CCCCTATTGG TTAGGACAGA TAGCCTATGC CTGCAATCGT 192 0 

TGTATGTCCG TTGGAGGAAA TCCTGATGGC AAGGACACAC CACAGACCAC AGACGATACA 198 0 

ATTATCATTC AGACTCATGC CGATGAAACG GGCGCTAAGC AGCAAACCTT AGGCTGTGCA 2 0^10 

5 GCCGAGAATG GAGTATTGGA AGAAATCTAT GTATCCTTAG AGCCCAAGGC GAATAGTGCC 2100 

GTACATGCGC TCAACTATCT GAATGAGCGG GTGCGAGAAG TTGTCGCAAG CAGGAGTAAA 2160 

TCAATTCAGA TCACAGAAAA AGATAAGGGA TTGTACGAAG CTCTTCCCAC AATCGCCGGT 222 0 

GATAATAAAC ACATCCCTAT CTCTCTCGAA GCTCTTGCGG CTCAATTGAA TAAGGGAAGA 228 0 

GCTGAAAATG ATCTTTACAC CATAGAGTAT CTACAGACCG AATTGAACCA GCTTAGTCTG 234 0 

10 CGAGGGGAGG TGCTTTACTA TCGTGAGAAT GAGAAGCTGA ACAATTATGT CTGGTTAGAT 2 4 00 

CCGGCAGCTT TTGTCCAAAT GATTCATGGA GAAATCCTCC AAAAAGACAA CATCAATAGA 24 60 

GGAACAGTTC CTAAAGACAT TTTTGAATGC AAACTGCATA ATCTAAGTTC CGGAAGTATA 2520 

TTTGAAGAAG ATGGCCAAAA TGGTAATATG ATCTTGCAGC TATTATTGGA AGAGCTGATC 2580 

GTATATGAAG ATAAGGACTG CTATGTGATA CCGGGCTATC TCCCTTTGCA TTCCGATGAC 264 0 

15 GAAGCCTATA AATGGCTTAC TTTGGGATTC GAGAGGCCCA ATTTTGTCCT CAAATTCGAA 27 00 

CGTTTTATCC CCTTTGGCCT GATCAACCAG ATTATAGCCT ACTATGGCCG GGAAGAAGGT 27 60 

GCTCTAAAGC GGTATTGGCG AGATCAGGTC ATCTTCACAG CAGGCCGTGA GATGGATAGG 2 82 0 

CAAACGCTTG AGCAAGAAGA AGAGAAAGAG GGTTTGCCCA AGACGAATGC CGAGGATTAT 28 80 

CAGATCTGGA TCAAGCTCGA CTTTACCGAC TTGGCCATAT CCGTATTCAT CAAAGAGCAG 2 94 0 

^0 AGAAAGACAT CAGCTAAGGA TATGCAGCGG AAAGAGGCTA CTATCCTCAG TGATATGTTG 3000 

GATATGTATT GGAA.CAATAT CCCTCCGAGG GAGCAAATAG GAGATAAGGA TACGGAGCAA 3060 

ACGAGAAGCA CTATTCGTGA AACAAACAGA AAGAAGAGAC CCATCCAGGA TCTCTACCTC 312 0 

TCCTGTGCCC AAGCGGATAA AGATTTGACG GAGTCTCATT ATATCCATTT GGGCACGCTG 318 0 

GACGATGAAA GCAAGACTAC GGCGAGGATT GCAGCCTATC CGTTGAAGAA CGGCGTTATC 32 4 0 

GATAMGAGC GGGTGCGAGA AGTATCGACT CGTCCCTACA AACATCTTTC CGTCAATAAA 3300 

AATCTGGCTA CTGCAAAACA GATCTTTATT TCCTATTCCA TVXGAGGATCA GACTGAACTG 3360 

GAGACCTGTC TGQAATTTTT CAAACCCTTG GAGAAGAATG GTCAGATCGA GATCTACTAT 34 20 

GATAAGTTGA CTAAGTTTGA AACACCTATT CACCCTGAAA TAAGAAAGCG TATTGTCGAA 34 8 0 

GCCGACTGTA TAATCGCTTT GATCAGCCAA CGCTATCTGG CCACGGATTA CATCCTGGAT 354 0 

OO CATGAGTTGC CTGTATTTCG GGAGTATAAC AAGACCATAG TGCCGATATT GATCAAGCCT 3600 

TGTACATTCG AAGACGATGA GTTCCTTCGG GAGAAATATT TTGCTCAGAA AGCTCAAATA 3660 

ATCAATCTTG GAAAAGAGGG A7W\ACCATT AAAGCTTATG ATAGTATTAC GGCATCAGCC 37 20 

CATCGTGATG AAAATTGGGT GGCAGTAGTC AGAGAGTTCA AAGAGAAGAT ATTAAGAATA 37 80 

ACAAAACAGG AGGTAAATAC AGATGAA 38 07 

35 

(2) IHFOKHATIOM FOR SEQ ID MO: 112 

(i) SEQUEtJCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDHESS: double 

(D) TOPOLOGY: circular 

45 (ii) MOLECULE TYPE: DMA (genomic) 

( i i i ) HY POT H ET I CAL : NO 
(iv) AUTI -SENSE: I JO 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHOl JAS GINGIVALIS 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 

(B) LOCATION 1 ... 693 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 

60 AAGTTTATGA TGAAAAAAGC ATTTGTTTTC GTACTACTGG TTTGCCTATT CTCCTCGTTC 60 

AGCAGTTCCG CCCAAACAAC GACGAACAGT AGCCGGAGTT ATTTTACAGG ACGAATCGAG 120 

AAGGTGAGTT TGAACTTAGG GGTCCCCCCC GTAAGCACAG AGGTTTGGGG AATGACCCAT 180 

GATGCGAACG GTCI'CCCTTT CGAAATACCT ATCTCTTTCA GTCGTTTCAA CAGCCAGGGA 24 0 

GATATAGCTA CCACTTATTA CATAGCGA/^T AGCGAGGCAA CTTTGAATGA ATGGTGCGAC 300 

DO TATGCACACC CGGGCGGCAT CGTGAGGGTA GAAGGTCGTT TTTGGAAAAT GACTTACAAC 360 

ATACCAACCT ACAATGCAGT CTGCACCCGG ATTACATTCG AAAATCAAGA AATAGAAGGA 4 2n 

ACGATCGTCT TGATACCCAA GCCCAAAGTC TCGCTGCCTC ATGTGTCGGA ATCGGTGCCT 4 80 

TGCATCCGAA CCGAAGCCGG GAGGGAATTT ATCCTTTGCG AAGAAGACGA CACCTTTGTG 54 0 

TCTCACGATG GTAACGAAGT AACGATAGGC GGTAAACCTT TCTTGCTCAA TACCAACGTA 600 

/U AAGATTGTGG GGGACGTATC TCAAAAGTAT GCCGTGGGGG TAGGAGAAAT TCGATTCCTG 660 

CAGATTTGTG CCCAAACAGT ATCACAAC7\A AAA 693 

(2) INFORMATION FOR SEQ ID NO: 113 

75 



40 



55 
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(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 1689 base pair^ 
{B) TYPE: nucleic acid 

(C) STRAMDEDMESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AMTI-SEMSE: MO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI'lE/KEY: misc_feature 

(B) LOCATION 1 ... 1689 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 

TCGCTAATCA ACAATTCAAG AGACTGGAGG GCTCCTGTCC GATTGAGTCT CAAAAAAAAG GO 

ACAAAAACTA TGAAGACAAA AGTTTTACGC AAATTCGTGG TGGCGGCTTT CGCCGTCGCA 120 

ACCCTCTGTC CTCTCGCCCA AGCGCAGACG ATGGGAGGAG ATGATGTCAA GGTGGTCCAG 180 

25 TACAATCAGG AAAAACTGGT ACAAACGAGG ATGAGTGTGG CGGACAACGG ATGGATCTAT 24 0 

GTAATGACCC ACAGTGGATA CGACACCGGC AATAGCAATG TGAAGATCTT CCGCTCCAAA 300 

GACCAAGGTG CCACATACCA AAAGTTGAGG GATTGGGATC CATCGGATGA TTATCAGTTT 3 60 

CAAGACTTCG ATATCGTGGT AACGGGTAAG AATGAATCCG ACATCAAGAT TTGGTCGGTA 4 20 

GAGCTCATGA ATAAGCCCGG AGGATATAAG AGTAGAGTTG CGGTCTTCAG TCGCGATGCC 4S0 

30 AACGCGCAGA ATGCGAAACT CGTGTATAAG GAAGACTTCT CCAATGTGCA GTTGTACGAT 54 0 

GTGGATATAG CCTCCAACTA TCGTTCGCCT TCTTCTCTTA ACAATGGTGG CAACCCTTTT 600 

GCTTTGGCTT TCGCTTACAC CGGCTTCAAC AATACGCACA AAATAAGTTT TGTGGACTAT 660 

GTGTTCTCTC TGAATGGAGG GCAAAATTTC AATAAA7\ACT TACTCTTCAG TCAAGATGGA 7 20 

GAGAAGAAAA TTGACAAGGT GGATCTCTCA TTGGGTAGCA CCTCTGAATC CATGGGTCAC 7 80 

35 AATGCCTGGC CGCTAATGGG TGTGGTATTC GAAATGAATA AACAAGGGGG AAAAAGCGAT 84 0 

ATCGGTTTCT TGTCGAACTT TGTCGACAAT GATCCCGAAT TTCAGTGGTC AGGCCCTATA 900 

AAAGTGAGTG AAAGCGACAT GTCGTTCAGC CCCAAAATCC AAATGTTGCT GGACGAGGAT 960 

AACAATACGA TCAATGGGGA GAGTTGCCAC AACTTCATGA TTACGTACAG CGATTATGAT 1020 

TCTGAATATT CGGATTGGGA CATTCGGTAT GTATATCCCA AGAAATCGTT CAAGTATGAA 1080 

40 AAAGGAAAAA CTCCGACTAT GGATGATCTG GTGGAAGCTT TCCTTACAGC TTCGTACCAG 114 0 

AGTGAGACCA ACTCGGGGCT GGGGTATGAC AAGAACGCCA ATCACTACCT GATTACATAT 12 00 

GCCAAAAAAG AAGAGAACGG TACGAACACG CTGAAATACC GCTGGGCCAA TTATGACAAG 12 60 

ATTCATAACA AAGATTTGTG GAGCGACACA TTTACGTATA CATCATCTGC CAATGCTCTC 1320 

TACACACCTC AAGTAGACAT CAATCCGACC AAGGGTCTCG TGTGCTGGTC ATGGGTGGAA 1380 

45 TATCTGCCGG GCAAACGGAT CGTTTGGTCT GATACGCAGT GGACCCATGC CAACGGTGTA 14 4 0 

GAAGACATCG TAATGCAAGA AGGCAGCATG AAGCTCTACC CGAATCCGGC TCAAGAATAT 1500 

GCTGTGATTA GCCTGCCGAC GGCAGCAAAC TGCAAGGCTG TTGTTTACGA TATGCAGGGC 1560 

AGAGTAGTCG CTGAGGCTTC TTTCTCCGGC AACGAATACA GGCTGAA.CGT GCAGCACTTG 1620 

GCTAAGGGTA CGTACATACT CAAGGTCGTA TCCGATACGG AGCGTTTCGT AGAGAAGCTC 1680 

50 ATCGTGGAA 168 9 



(2) IIIFORJ'IATION FOR SEQ ID NO: 114 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2358 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

60 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

65 (iv) AtlTI-SENSE: JJO 

(vi) ORIGINAL SOURCE: 

(A) ORGAMISH: PORYPHYROMONAS GINGIVALIS 

70 (ix) FEATURE: 

(A) NAME/KEY: misc_f ea tur e 

(B) LOCATION 1...2358 



75 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO:114 
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ATCGTTTACC TTTGTCACTG TATGAACCAC 
AACTTCCTCC TCTTTTGTTG TCTGTCGCTG 
GTCTCTTCGG GTAGACGACT GACAGAATAT 
GGTACCACGA ATCCGGGAGC AGTATTGCCC 
5 GTCAGCGGAT CGACAGAGAA TCGCTTCGAC 

TCGGCCGACA ATAGTTACTG CATCGGTTTC 
CCCGAACTGA GTGGAATACT GCTGATGGCC 
TGCTATGGCT CTTCGCTCAG TCGAGAATAT 
GACAAATACG GTATAGATGC AGCCGTGACC 

10 GCTTTTCCCG AAGGAGAAGG CCATATCCTG 
TCGGGAGCCT CTGTTCGATT CTTAAACGAC 
ACGTTCTGCT ACAATCCGCA AGCAGTTTTT 
CGACCGATCT CTGCCGGCTA TTGGAAGAAG 
GATTCGACTG CAGGGAAATA TAAGCAGTAC 

15 GACATCGGTG TCCGATTCTC GTTCAACTGC 
GCCGTTTCAT TCGTCAGCGA AGCCAATGCG 
GTGTTCAAAA GTGTCGGAGG GAATCCGGCC 
ATAGAGCGTT GGGAGGAAGC CCTCGGTACG 
AAGACGATAT TCTATACCGC ACTCTATCAC 

20 GCCAATGGAG AATATCCTAT GATGGGCAGT 
TACACCGTGT TCTCTCTTTG GGACACGTAC 
TATCCGGAGA AGCAGTTGGA TATGGTACGG 
TGGCTGCCGA GATGGGAGCT GTACGGACAG 
CTTATCGTCA TCAATGACAC TTGGCAAAGG 

25 TATGAAGCCA TGAAAAAAAA TGCTTCTTCG 

AACGACGACT ATCTCACCCT CGGCTTCGTA 
TCGCATGCGC TGGAATACTA TCTGGCCGAC 
GGGCATAAAG AAGACGCAGC TCTATTCGGA 
AATAAGGAGT ATGGTATGCT GTGTCCATTG 

30 GATCCCAAAC AGGGTGAAAA CTTCGAGCCT 

AACTATGCCT TTTTCGTTCC CCACGATATA 
A^i.GGTTTTTT CGGAAAGGTT GCAGAAAGTC 
GAGCCGGACA TCGCCTATCC TTACCTCTTC 
CAGA^iATTGA CCCGGGAGTT GATAGACAAA 

35 GGTAATGACG ATGCCGGTAC GATGAGTGCT 
CCTGACTGTC CGGGCAGCCC CACCTATACA 
ATTCGGCTCA ATCCGCAGTA TTATCCTCAG 
AATCAACCGA CAGATTCCAT TTACATCCAT 
CATGGAACAA GGCATATCAG CCATGCCGAT 

40 CTAAGCAATC GTCCTCGA 



AGACGATCAA AAACCATGCT GACGATCCGA 60 

ATAGCGTTTG CTGCCGATGC ACAAAGCTCT 120 

GTCAATCCCT TTATCGGAAC GGCCAATTAC ISO 

AATGGGTTGA TGAGCGTTAC CCCTTTCAAT 2 4 0 

AAAGATTCGC GTTGGTGGAG TGCGCCTTAT 300 

AGCCATGTGA ATCTGAGTGG AGTAGGCTGT 360 

ACTTCCGGCA CATTCGATCC TGATTACTGC 4 20 

GCGCGCCCGG GAGAATACAA GGCTGTATTG 4 80 

GTAACCGAGC GGACTGCTTT GACCGAATTT 54 0 

CTGAACCTGG GACAGGCCCT AAGCAATGAA 600 

TCCACAGTCG TCGGCAGCAG GCTGATGGGG 660 

CGTCAGTATT TCGTACTTCA GGTGAGTCGG 7 20 

CAGCCTCCTA TGACAGTGGA AGCCCAATGG 7 80 

GACGGCTACA AGCGTGAGAT GAGCGGTGAT 84 0 

GATCAGGGGG AAAAGATCTA TGTACGATCG 900 

CTCTATAATC TGGAAGCGGA GCAAGAAGAG 960 

AAGGCTTTCT CCGCTATACG CTCTCGCGCT 1020 

GTGGAAGTGG AAGGAGGCAC ACCGGATGAA 1080 

CTGCTGATAC ATCCG/VATAT CCTACAAGAT 114 0 

GGCAAAACGG GTAATACGGC TCACGACCGC 12 00 

CGCAATGTAC ACCCGCTGCT CTGCCTCCTC 12 60 

ACACTGATCG ACATGTACCG AGAGAGCGGG 1320 

GAGACCCTGA CGATGGAGGG CGACCCCTCG 138 0 

GGCCTTCGTG CTTTCGATAC GGCAACGGCC 14 4 0 

GCAGGAGCGA CCCATCCGAT CCGTCCTGAC 1500 

CCGCTTCGCG AACAGTACGA CAATTCCGTA 1560 

TGGAATCTGT CCCGGTTTGC CCACGCACTT 1620 

AAACGCTCGT TGGGCTACAG ACACTATTAT 1680 

CTGCCGGATG GATCATTCCT CACTCCTTTC 17 4 0 

AATCCCGGTT TCCACGAGGG CAGTGCTTAT 1800 

CAAGGGCTTG CCCGGCTGAT GGGAGGAGCA 18 60 

TTCGATGAAG GATATTATGA TCCGACCAAC 192 0 

TCCTATTTCC CCAAGGAAGC ATGGCGAACG 198 0 

CATTTTTGCA ATGCTCCTAA CGGCTTGCCC 204 0 

TGGCTTGTCT ATTCCATGCT GGGATTCTAC 2100 

CTGACCTCGC CGGTATTCCC CCGAGTTAGG 2160 

GGGGAGTTGA TCATTACGAC CT^TACAGAG 222 0 

ACGGTTTCTC TTGGCAATAA AACACTTCCG 2280 

TTGGTGCGCT GCGGTCACCT CCGTTACGAA 234 0 

2358 



45 



50 



65 



(2) IM FORMATION FOR SEQ ID NO: 115 

( i ) SEQUEIJCE CHARACTERISTICS : 

(A) LENGTH: 24 4 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIJDEDMESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 

(lii) HYPOTHETICAL: NO 

55 (iv) AMTI-SEMSE: HO 

( V i ) OR I G I UAL SOU RCE : 

(A) ORGAiaSM: PORYPHYROMOIJAS GIIJGIVALIS 



60 (ix) 



FEATURE: 
{A} MAI-IE/KEY: misc_ feature 
(B) LOCATIOM 1 ... 2442 



(xi) SEQUEIICE DESCRIPTIOM: SEQ ID NO:li: 



CCCCGTTTTC ATCCCGGAGG GACACACGCT CAGCTTTGCA GAAATGGGAG AAGAAACCAA 60 

AAATCAAATC AGTCATCGTG CTTTGGCGGT GGCACAGCTC CGCGATTTTT TATTATGTGC 120 

AAAATAAGAT TCAGCCTCTT GCAGGCTCTT GTCGTCTGCT TATTGTTCAC CTCTTTTTCT 180 

CTCCAAGCTC AAGAGGAAGG TATTTGGAAT ACCCTCTTGG CTATCCACAA GACGGAAAAA 240 

/U GCCGTAGAAA CGCCCAAGAA AGTCTTTGCC GTAGCCAACG GAGTACTTTA CTCGGTGGGC 300 

AAAGAAGCTC CCCATGAGGC AAAGATCTTC GACCGTATCA GCGGACTCAG CGATACATCG 3 60 

GTAAGCAGCA TAGCCTACTC CGAGCAACTA AAATCCTTGG TCATATACTA TGCATCAGGC 4 20 

AATATCGACA TCTTGGACGA AGCAGGCCGT GTGACCAACG TACCTGCATT GAAAGACAAT 4 80 

ATCGATCTGA TAGACAAAAC GCTCAATCGC CTTTTGATCG TAGGCAACAG GGCTTATTTG 54 0 

/O GCAGGAGGAT TCGGCCTCTC CGTTCTGGAT GTCGCCGAAG CTCGCATACC GGCTACCTAC 600 
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GCCAAGGGAA CTAAGGTGAC CGATGTGGCT AAGTTGGACA ATGATCGCTT GCTGATGCTG 660 

AAAGAAGGGC AGCTCTTCAT CGGAAAAGAG ACCGATAACC TGCAAGATCC GGCCGCATGG 72 0 

ACAGCCTTGT CTTTGAATTT GCCGATGGGC TCGGTCACCG GTCTGGGCAT TGTCGGGGAA 78 0 

GACATCTGTT TCCTGCTCGC CGATGGCCGT GTATATGTCG CTGCAAACCA ATCGTTTGAG 8^0 

5 CCGGAGCTAT TGCTCTCTTC CTCCGCCGAT TCACGACTGT ATGTGACGGA TCGTGGTCTG 900 

TTCATCTGTG CCGAGAATCG 7\ATTTATTTC ATAGAAAA=\G GTCGCAAAAC GACACAATTT 960 

CCTATAGCCG ACGTCCTTGG TGTCGGTGCC ATGAACGA.A.^\ GCAATACGGC ATACATAGCA i02 0 

TTGGGAG.AAG AAGGTTTGGC TTCACTTCTT CTCGCAGAGG GAAGTACGGC CGAAGCCATG 108 0 

CCTGTAGCAT TCGACGGACC GGGGGACAAT GATTTCTACG AGATGCGGTT TAGTCACGGA 114 0 

10 CGTCTGTATG CAGCCAGCGG ACTCTGGGGA ACAAACCTGA TGGGACATGC CGGTATGGTG 1200 

AAGCTATACG ACGGCAACCG ATGGACTAAC TTCGACAAGA AGACCGTACA GGAACAGTTG 12 60 

GGCGGCGGAT TCAGTTTCAA TGATGCTATC GATATAGCTG TTTCCAACGG AGACCCCGAT 1320 

CACTTTTTTG TCGGTACATG GGGAAACGGT CTGTTCGAAT TCAAGGATGG CAAAGCGATA 138 0 

GCTCGCTATT CGGGAAACGA AACTGCTATC GCAGAATGTA ATCCCGGAGA TGCCCGTGTG 14 AO 

15 AAAGCGATTG CCTTTGACAA TAAGGGCAAC CTCTGGGGGA CGCTCGGTGC CGTAGGCAAG 1500 

AACATCTTCA TGTACGATCC GCAGAGTAGC ACATGGCATT CTTTCAGCTA TCCGGATGTA 1560 

GCCAATCTGG CCTCCTTCGG CAATATGATT ATCCTACCCA ACGGAGACAA ATGGGTAAAT 1620 

ATCCTTCACC GTAGTGGCGG ATCCACGCGC AAAGGTGTCT TGATCTTCAA CGATCGGGGT 1680 

ACACCGGAAA CGACTTCGGA CGACAGCCAT CTTTACGTCG AGCAGTTTGT CAATCGCCTC 1740 

20 GGGGCAGCCA TAGGACATAA GACTATCTAT GCAATGGCCG TCGATCATAA CGGCTCTGTC 1800 

TGGATGGGAT CGGATATAGG CATTTTCGGC GTCTACAATG CAGCCGGAGT ATTGTCCTCG 1860 

ACTTCTACCC CTATCGCTGT TCGGCCGGTC GGAGGAG.AAG AACCCAATTT GTACTATGTG 1920 

CTGGACAAGG TGACGGTGAC AGACATCGTC GTGGACAAAC TCAATCACAA ATGGGTTGCC 1980 

ACCCAAGGGA CAGGACTCTA TCTCCTTTCG GAAGATTGCA GTAAGATCCT CGCGCAATTT 2 0-4 0 

25 ACCGTAGAAA ACAGCCCTTT GCTTTCTAAC AACATACTAT CCCTGGCCTT AAATGACGAT 2100 

AACGGACTGC TGTACATCGG TACGGCGGAC GGACTGATGA CGTTCCAAAC GGGTACGGGG 2160 

AGTGGATCAG CTTCCGAACT GGACGGCGTC TATGTATACC CCAATCCGCT AAGGCCGGAA 2 2 20 

TATCCCGATG GCGTCACCAT TGCCGGACTG CAAGCCGGCT GTAGTGTCAA AATCACCGAT 22 80 

ACCACCGGCA GACTGCTATA CCAGACTGAG AGCGTAACCA CCGAAGTCAA ATGGAATGCT 2340 

30 CGAGGTGCCG ATGGCAATAG GGTAGCTTCG GGCGTATATG CCGTTGCAGT GTACGATCCG 2 4 00 

GTATCGAAAA AGTCCAAACT AATTCGCTTC GCAGTGATTC GC 2 4 4 2 



35 



40 



(2) IMFORIIATIOH FOR SEQ ID MO: 116 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 34 86 base pairs 

(B) TYPE: nucle-ic acid 

(C) STRAIIDEDMESS: double 

(D) TOPOLOGY: circular 



45 



(ii) IIOLECULE TYPE: DHA (genomic) 
(iii) HYPOTHETICAL: HO 
(iv) AMTI-SEHSE: HO 



50 



55 



60 



65 



70 



75 



{ V i ) OR I G 1 14 AL S OLf RCE : 

(A) ORGANISM: POKY PHY ROMOMAS GIIIGIVALIS 

(i>:} FEATURE: 

( A ) NAME/ KEY : mi sc_ feature 

(B) LOCATION l..,34 8 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 

GCTATTTCTC AGATGAAACG AATACTTCCA ATAGTCGCAT TCCTTTCTCT CTTCCTTGCC 60 

CTTGCTTTGC CTGCGAAAGC GCAACGAGCT ATGGGGAAGA CGGCCGACCG TTCGCTAATG 12 0 

GCTTCGGGAC ATTGGGTCAA GATACGTGTC GATGCAAGTG GAGTGTATCG CCTTACGGAC 180 

GAACAGCTCC GTGCCAATGG CTTCTCCGAT CCGTCCAAGG TAGGTGTGTT CGGTTATGGT 24 0 

GGAGGGGTGC TTCCCGAAGA TCTGAGCCGG ATCACGACAG ACGATTTGCC TCCGGTACCG 300 

GTACTCCGTC AGGGCAATGC GCTGTATTTC TATGCCGTGG GCCCGGTGAC ATGGTTCTAC 36 0 

AATCCGGCCA AAACCACCAT GGAGCATACG GTGAATACAT ACAGTACGCA TGGCTACTAC 4 20 

TTCCTGTCGG ATGCTGCCGG AGCACCTTTG CAGATGTCCC AATATACGGG TGGAGGTGCG 4 80 

TCGGCCGAGG CTTTGATCGA CTACTACGAT GAGCTGATGC TCCATGAACA GGAATTGTAT 54 0 

TCGCCCAAAG AATCGGGACG AGATCTGTAT GGCGAGTCTT TCAGTGCAGT CAATACGCGT 600 

ACGGTCAAGT TCCCTTTGAG GGGCAACACC CGCTCGTCTG GCGAACTCGG TACCGTATTC 660 

TCATACATAG CCAAGGCCAG ATCGGCCGGT GGCGGCCGTG AGATGTCGCT CTCGGCGAAT 72 0 

GGCATTCTGA TCTTCAGCGA TCCTTTTTCC ATGACATCGA ATGAAGTCTC CAATTCCTAT 78 0 

TTGGCCGGCA AGAAGCGTCG TCTCTATCAC AGTACGCCGA TGAACAGCTT GGTCAATGAG 84 0 

TTGCGCTTGG ACGCGAACTA TAGCATGACA GGAGATGCGG TCAATCTGGA TTTCATAGAG 900 

GTGGCTACAC AGAACGACCT CCGGTACGAT GGCGCACCCA TGCATATCAG GCGGTTTTCC 960 

AATTTGCCCG TTTTGGGGGG CGAGTCCTGC CGGTTCGTTA TCAGTGAGGT GCCGGAGTCT 102 0 

CTGGTGGTTT TGCAGGCCAA TTCTTCCCTG ACAGCATCGC TTGTTCCCGT TAAGACTGTC 1080 

GGGGATAAGA CCATTGAGTT CGTGGCTCCG CCGAAGGGTC AGGATCGTAG GACTATCAAT 114 0 
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ACGTTTTATG CCGTGGACTT 

AATC/iAAACC TGCATGGAGA 

CTCCTTGAGG CTGATCGACT 

GTCGTGTTGC AGGTXACAGGT 

TACCGCCTCT TTGCCAAAAT 



GTCACAGGCT TCTGCTCCGG AGATCCTCGG AGCGGTACCC 
GGAAATCCCT GATCTGATCA TTGTCTCTAC TCAGGCGCTC 
GGCCACCTAT CGTAGAGAGA AAAACGGGCT GAAGGTTTTG 
GTTCAACGAG TTTTCGGGTG GAACTCCCGA TGCTACAGCA 
GTTCTACGAC AGATGGAAGG CAAATGCACC TGTGGGAGAG 



1200 
12 60 
1320 
1380 
14^3 0 
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ACCTTCCCGA TGCAAATGCT TCTCTTCGGT GATGGGGCTC ATGACAACAG GAAGGTCTCC 1500 

GTAGCTTGGC AGAAACCGTA TCTCCAACAA ACGGAGTTCT TGCTGACATT CCPJKGCCGTC 1560 

AATTCGACGA ACGTAAACAG TTATGTGACG GATGATTACT TCGGCTTGCT GGATGATCAG 162 0 

CCGGCCTCGG TCAATATCGG TTGGCGCAAT TATAATATGG CTGTAGGGCG ATTCCCCGTA 1680 

CGTACTCCGG CCGAAGCTCG CATCGCAGTG GACAAGACCA TCCGATATGA GGAGGATCGA 17^0 

GAGAGTGGTG CCTGGCGTAT TCGTGCCTGT TTTGCGGCAG ACAACGGGGA CAAGCACGCA 1800 

ACCGAGACTT CCCGTTTGAT CGATACCGTC AAGCGTTATG CTCCTGCCAT CATGCCGGTA 18 60 

CGCGCCTTTC AGGACGTATA TCCGCATGTC ATCGAGAACG GGTTGCACAG CATTCCGGGT 192 0 

GCAAAGAAAA AGATGCTGGA AACCCTTCAG TCGGGTATTA TCCTGCTTAA TTATGCTGGT 198 0 

CATGGCGGTC CTGCCGGATG GTCGGACGAG CATTTGCTGA CGCTCAACGA TATACACAAA 204 0 

TTCAATTATA AGCATATGCC CATTTGGATT ACTGCCACGT GCGACTTTGC CAACTATGAC 2100 

AGTCAGACGA CCTCGGCAGG GGAGGAGGTT TTCCTCCATG AGAAGAGTGG CACTCCGATC 2160 

ATGTTCTCGA CTACGCGTGT CGTTTACAAT ACGCAGAATG AGAAGATCAA TGGTTTTATG 222 0 

CTTCGGCGTA TGTTCGAGAA AGCTAAGGAT GGGCGTTATC GTACGATGGG CGAGATTATC 228 0 

CGATCGGCCA AACAGGGGAT GCTCAGTACT GTTTTCCCCG ATTCGATCAA CCAGTTGAGT 2 34 0 

TTCTTTCTGA TGGGTGATCC GTCCGTGCGT ATGAATCTTC CTACCCACAA AGTGCAATTG 2 4 00 

ACCGCAATCA ACGGGCAGGA TCCCGAAGGG CAGTATGGAA CTATTATGCT CAAGTCTTTG 24 60 

GAACGGGTAG CTCTGAAGGG TAAGGTAACC GATGAAAAGG GGACATTCGA CGAGACATTC 2 52 0 

AGTGGCAAGG TTTTCCTGAC CGTCTTCGAT GGCAGAAAGA AAATGACAGC TTTGGAAGAG 2 58 0 

GAGGGAAT^.CG ATCTCTCTCT TGTATATTAT GACTATCCTA ACGTGATGTA TGCCGGTATT 2 64 0 

GCCGAGGTGA AAGACGGACT CTTCGAAACT TCGTTTATCG TACCCAAGGA TGTGAACTAT 27 00 

TCCGAGCACG AAGGCCGGAT CAATCTTTAT GCTTATAACG AGAGCACAAA GGCGGAAGCC 27 60 

ATGGGGGTAG ACTTCTCCAT CAGAGTCCAA CCGGGTATTC CTGATGAGGT AACGGAAGAT 2 820 

AATACACCGC CTGAAATCAT AAGCTGCTTC CTCAATGACA GTACATTCCG ATCGGGAGAT 2880 

GAGGTTAATC CTACTCCTCT GTTTATGGCC GAAGTATTCG ACTTGAATGG AATCAATATC 2 54 0 

ACGGGTAGCG GAGTAGGGCA TGATATTACG CTTTGTATCG ATGGCCGTGC CGACCTGACC 3000 

TACAACCTCA ATGCATATTT CACAAGTTCG GCTACGGATG CAGGTGTGGG CACTATTCTC 3060 

TTCATGATAC CGGCTTTGGC CGAAGGAGAT CATACTGCCC GACTGACGGT TTGGGACATT 3120 

TTCAATAATG CCGTCCATCA TGACTTTTCA TTCAGAGTGG TAGATGGCAT TGCTCCGGAT 3180 

GTGGCTGATG TGATTCTATT CCCGAATCCG GTACGCGAGA GTGCTACGTT CCGAATCTTC 324 0 

CACAATCGCC CCGGAAGCGA TTTGAACGTG GCCGTGGAGA TCTATGACTT CACCGGTCGT 3300 

CTTGTGAACA GTTTGCCAGT CAAGACCTAT TCGTCTTCCT ACGGAGAACC TATAGAGATC 3360 

AAGTGGGATC TGACCTCCAA ATACGGAGTG AAGATCGGAA ACGGATTCTA CCTCTATCGT 34 2 0 

TGTGTGGTGA ACTCTCCCGG AGGACAGACG GCCTCCATGG CCAAGAAAAT GATCGTGGTA 34 8 0 

GGACAA 34 8 6 



12) IHFORllATIOM FOR SEQ ID NO: 117 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 2919 base pairs 

(B) TYPE: nucleic acid 

(C) STRAtlDEDWESS: double 

(D) TOPOLaSV: circular 

(ii) IIOLECULE TYPE: DHA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AMTI-SENSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGAIIISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) HAI'lE/KEY: mis c_£ea cure 

(B) LOCATION 1 . . . 2919 

<xi) SEQUENCE DESCRIPTION: SEQ ID MO: 117 

TTTTACACCG GAATTGACTT GCATATAGAA AGCCAAATGA AGAAACTTTT CCCATTATTA 60 

TTACTCATAT TGTCGATTTT GGTCGGATGT GGAAAAAAAG AAAAACACTC TGTAACTGAA 120 

ATCGCCCGAG AGAAAAAGCG TATTACTGCA TTGCTGTACG AAAAAGAACT CCCCACTGAT 180 

TCTGTTAAAC AGCTTTACGA GAACAGCGTA CAGAACAAGA ACCTTGTGGG ACAAATGTTA 24 0 

TTCGCGATCG AGGTCGGCAA ACGGATGCGT AATATGTCCC AATACACAGA TGCGATGCTA 300 

TATCACCAAA AAGGGTTGAA CGCTGCATTG AATCTAAGGG ACACCATCGT AGCCGCACAA 360 

GCATGGAATC ATCTCGGAAC GGATAGCCGT CGTATCGGTG CTTTGGCAGA AGCTTCGGAT 4 20 

TATCATTACA AGGCTCTTTC CTTGATAGAA TCTTTTAGCG GAAACCAGAA TAGGCCTGCG 4 80 

ATCAAAGCCA GATCGGCGGC CCTGAACGGC ATCGGCAACA TCAATCTTGA GTTAGGATAC 54 0 

CATGATGAGG CCGAAAAGAA TTTCCTGAAA GCACTGCAAG GTGAGAAAGA ACTCGACAGT 600 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



81 / 490 



10 



15 



20 



25 



30 



35 



40 



CCTCTCGGGC 

TACGACAAGG 

CTGATGGGTA 

TATCAAAAGG 

CGATGGCACT 

GAAAGGCTCT 

ACTTCACATC 

TACAAACAAG 

GCGCACAAGG 

AAAGAGCTTG 

CTCTTGAGCA 

GCATACCGTC 

GGCTTTTTCA 

AATGAGAAAA 

ATAATAGACA 

AAGATCAGAA 

CAGATTCTCA 

CAACCCGAGA 

TCCAATCTTT 

CTGGCAAAAA 

ATAGATAAAA 

GAAAAGCATG 

GGTACGATCA 

ACACAAAACC 

GTCATGCCTG 

AATCATCGCT 

AACCTGCTCG 

AAAGAGGGTA 

ATGCCGATAA 

CACATTCCCA 

AAAAGCGGTG 

ATCGAGCAGC 

CTGGATCGGA 

CTTGCTGCCA 

GACTTGGCCG 

GTAGGTTGCT 

CTTGCCGATG 

GCTTACTTCT 

AAACTCCTTG 



AAGCTATCAA 

CTCGTACCTA 

TCGGACTCTG 

CTTTGCAAGA 

GGCTGAATTC 

ACCAGCATTT 

TGATAGAAAT 

CCCTCGAAGC 

TCAGCAGCAT 

AAGAAATACA 

CTCTTTTTGC 

AGGGCAAGAA 

CCGGTATTAC 

TGAGTTCAAG 

GACAGAGTAG 

GCGGAGTATC 

TCGATTCGTT 

GCAAACCTAT 

TGTCCAATGC 

CCAAGAATGA 

CTGATCAGGC 

GATCAGGCGT 

AAGTGGAAAG 

AGTCCTCTTC 

TCCACATCGC 

TCGAGGACGA 

TCAAACTACT 

TAGCCCTCGC 

TGGATGGGAT 

TTGTCGCTTT 

TAGTCTCTTA 

TTCTGAAAGA 

AGCCGGAGGA 

AAGATGCAGT 

AAAAAATGTG 

CCACCATCGG 

AGAGCAAAAA 

CTCGCACCTT 

CCATGCCGGG 



CTATGCGAAC 

CTTCCTCTTG 

TAGCATCAAT 

GTATGCCACG 

CTGTATCCCG 

CATTTCTTTG 

ATACAATCTT 

ATTCTGTCTG 

ACAAGAAACG 

GCAAGTAAGC 

CCTTTTCATC 

GCATAACAAG 

ACACGAATTT 

TCCTGATCTC 

CCATATGCTG 

CACGCCCGAA 

TGCACCATAC 

TGTCGTGGAC 

CATCAAGTAT 

AAAAAATGTG 

TCATATCTTC 

CGGACTCTCG 

CCAGCCGGGG 

GGCAGAGATT 

GCCCGATGAC 

ACGTCCGACC 

CCTTTGCGAT 

TACCGAGCAT 

AGAAATGACA 

GACGGCC7\AG 

TCTATGC7y\G 

CCGTGAGTTG 

GGAGCCTCAA 

GTCCGGTGGA 

CATGAGTCCA 

CTACATACAG 

CATCTCCGAC 

CAAACGCTAC 

GAGCGACAAG 



CTCGGACGTA 

TCTCTGGAGC 

CTCGGAGAAG 

GCATACAAAC 

ATGGCACGTA 

GCCGAAGGGA 

CAATACGAGA 

AGCAAGACGT 

CGATTCAACT 

AAGGCAAAAC 

TCGATTCTTT 

CTGATCAAAG 

CGTACGCCTA 

CAAGCATCGG 

AATTTGGTGA 

TGGCGCAATG 

GCACAGGCTC 

TTCGTCCCCT 

TCTTTAGCCG 

ATCATACGCG 

GACATCTTCT 

TTTACCAATA 

AAAGGAAGTG 

CTTCCTTGGC 

TCACCGACAT 

ATACTGCTCG 

CGCTACAATG 

ATTCCCGACA 

ATCCGGATGA 

AGTACCGAAC 

CCATTCTCTC 

CTCAAGAAGT 

CCGATAGATG 

ATCAAACAAA 

TCCCAACTCA 

CAGATCAAGA 

ATTAGCATTG 

ATGAACTGCT 

GAGACAGTT 



TCTATCGACA 

AGAACAATAT 

TAGACGAAGA 

TGATGGAACA 

TCAATCTCAA 

CTGCGAAAGA 

ATCTCGAGCG 

TGAGCGACAG 

ACGAACGAAA 

AAGAGAAATC 

TGATTTCTGT 

AGACGGATAA 

TCACCGTCAT 

ACAGAACCGA 

ACCAGCTGTT 

GCGACATCGT 

7VAGACATAAC 

CCTACTTGCA 

GAGGGAGAGT 

TTGCAGACAA 

ATCGAGGACA 

TACTGGTCGA 

CCTTCACCAT 

TACCCTCCTC 

CTCCGATGGT 

TCGAGGACAA 

TGCTATCCGC 

TT AT CAT TAG 

AGCAATCGCC 

AGGACAGATT 

CGGAGGAGCT 

TCTATATGCA 

ACAGCAGTAT 

ATCCGGATTT 

ACAGAAAGCT 

TAAAATTGGC 

AGGCAGGCTT 

CTCCCTCCCA 



ACGCAAAGAA 

GGCAGAGAAT 

AAAAGGGGAT 

GTTGTCCGAT 

ACAAGGTAAC 

AATTAATTCG 

TAAAAAAGAA 

CATGTCCATT 

CAAGTCCCAA 

GAAGTTTATC 

TCTGACATAT 

ACTTCGCTCC 

ACAAGGTTTG 

GCTGCACAAG 

GGATATTTGC 

CTCCTTCGTA 

CTTGGAGCTA 

AAAAATCATA 

GGTCATATCT 

TGGCATAGGA 

GTCCGCTACC 

AAACCTTCGA 

CAGTATTCCT 

CGATGACATT 

AGCAGCTCTG 

TAAGGATATC 

CGCAAACGGA 

GGATATTATG 

TCTGCTCTGT 

GGAAGGAATC 

TTTGATGCGG 

AAAACTCATG 

GCAGTTTCTC 

TTCCGCTCAA 

CACGAGTGTC 

CTGCAAGCTC 

TTCGGATCCG 

ATATCGGCAA 



660 
720 
780 
8^0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
14-10 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2 04 0 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2919 



(2) IMFORIIATIOH FOR SEQ ID NO: 118 

Ci) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 168 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

50 (ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: 1(0 



55 



60 



(iv) ArJTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAIIISH: PORY PHYROMONAS GINGIVAL IS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION I . , . 168 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 



65 



70 



75 



CATCATAAAA 
TTGGGACTCT 
GGTCAATTTG 
CCTGTGCGTG 
GGTATCCCGA 
GCGAGTGAAA 
TCTAATTGCG 
TGGTTCGCAA 
CGTCCTAACG 
GGG ATGAAGC 
CAGTCACATA 



CATATCAAAC 
TCCAACTGCC 
CAACAGAAGA 
CTATCGCTGA 
TGGAATTGAT 
GCCAAAAAAA 
ATTTCATCAT 
TGTACGATAC 
ATGATGAATT 
TCAAGCAGAC 
TCGCATATAC 



AATGAAAAAG 
CGCAATCGCC 
GATGCAACGA 
GTACGAACGC 
CAAAGAGCTG 
CACCGTTATA 
TGCGAAAACT 
GAACAAAGTA 
CCCCAAATAC 
CGGTGGCAAC 
GGAGAACTCC 



CTTTTACAGG 
CAAACGCAAA 
GCATTCCAGG 
TCTGCAGCCG 
GCCAAGAACG 
ACCCAGTACA 
GACTCTTACT 
GGTCTCGTGG 
GAAGCACAAT 
TACATGACGG 
TCTCTGTCTC 



CTAAAGCCTT 
TGCAAGCAGA 
AAACGAATCC 
TTTTGGTACG 
ACAAGGTGAT 
CCCAAAGCGG 
GGACACGCGA 
ACTTTATTTA 
ATCTGGGCAT 
ACGGATATGG 
AAGCTCAAGT 



GATTCTGGCA 
CCGAACAAAC 
CCCTGCAGGT 
CTACCCGTTC 
TACCATTGTG 
TGTGAATCTC 
CTATACCGGT 
TAACCGCCCT 
CGAGATGTTC 
ATCCGCTGTG 
AAATCAAAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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30 



ATGAAAGACT ATCTCGGCAT CACACATCAT GATGTGGTAC AAGATCCGAA CGGCGAATAT 7 20 

ATCAACCATG TGGACTGTTG GGGCAAGTAT TTGGCACCGA ACAAAATCCT CATCAGGAAA 7 80 

GTGCCTGACA ATCACCCTCA GCACCT^GCC CTGGAAGATA TGGCAGCCTA CTTCGCAGCA 84 0 

CAGACCTGCG CATGGGGAAC GA;i.GTACGAG GTATATCGCG CTTTGGCCAC CAATGAACAA 900 

5 CCGTACACGA ACTCTCTGAT TCTGAACAAC AGGGTATTTG TTCCTGTCAA TGGCCCCGCC 960 

TCCGTGGACA ACGATGCTCT GAACGTCTAT AAGACGGCAA TGCCCGGTTA CGAAATTATA 102 0 

GGTGTCAAAG GGGCTTCAGG AACACCTTGG TTAGGAACAG ATGCCCTGCA TTGTCGTACT 108 0 

CACGAGGTAG CGGATAAGGG CTATCTCTAT ATCAAGCACT ACCCGATACT GGGCGAACAG 114 0 

GCAGGCCCTG ATTATAAGAT CGAAGCAGAT GTCGTCTCAT GCGCCAATGC TACTATCTCG 1200 

lU CCGGTACAAT GTTACTATCG TATCAATGGT TCCGGTAGCT TTAAGGCTGC TGATATGACG 1260 

ATGGAATCAA CAGGTCACTA TACTTATAGC TTTACAGGTC TTAACAAGA.?\ TGATAAGGTA 1320 

GAATACTATA TCTCTGCCGC TGACAATAGT GGTCGCAAAG AGACTTATCC CTTTATCGGC 138 0 

GAACCTGATC CTTTCAAGTT TACGTGTATG AACGAAACCA ATACATGTAC TGTGACCGGA 14 4 0 

GCTGCCAAAG CTCTTCGTGC ATGGTTCAAC GCCGGTCGTT CAGAACTGGC TGTTTCGGTA 1500 

15 AGTTTGAATA TTGCCGGCAC ATATCGGATA AAGCTTTATA ACACCGCAGG AGAAGAAGTC 15 60 

GCTGCAATGA CCAAGGAATT AGTAGCAGGG ACGAGTGTCT TCAGTATGGA TGTGTATTCT 1620 

CAGGCTCCGG GCACATATGT TCTGGTTGTT GAAGGAAATG GAATCCGTGA GACAATGAAA 16 80 

ATTCTCAAA ^^^8 9 

20 

(2) IliFOHMATIOH FOR SEQ ID 110:119 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 1311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 

(D) TOPOLOGY: circular 

(ii) riOLECULE TYPE: DIJA (genomic) 

(iii) HYPOTHETICAL: MO 

(iv) ANTI-SEliSE: NO 

35 { V i ) OR I G I HAL SOU RCE : 

(A) ORGAHISII: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) MAI1E/KEY: niisc_f eature 
40 (B) LOCATION 1...1311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 

ACCACAAATA GAAAACCAAA TACTAATATG AAACTTTCAT CTAAGAAAAT CTTAGCAATC 60 

45 ATTGCATTGC TGACGATGGG ACATGCTGTG CAGGCACAGT TTGTTCCGGC TCCCACCACA 120 

GGGATTCGCA TGTCTGTCAC TACAACCAAG GCCGTAGGCG AAAAAATCGA ATTGTTGGTT 180 

CATTCCATAG AGAAGAAAGG CATCTGGATC GATCTCAATG GGGATGCCAC TTACCAACAA 24 0 

GGAGAGGAAA TAACCGTATT CGATGAGGCA TACCACGAAT ACACGATCGG GACGCAAACC 300 

CTCACTATCT ATGGTAATAC GACCCGATTG GGCTGTCGAT CTACCGGTGC AACGGCTGTC 3 60 

OU GATGT7V\CGA AAAACCCTAA TCTGACCTAT CTCGCATGCC CGAAAAATAA TCTGAAATCA 4 20 

TTGGACTTGA CGCAAAACCC AAAGCTGCTG CGAGTTTGGT GCGACTCTAA CG7W\TAGAA 4 80 

AGTTTGGACC TGAGTGGCAA TCCGGCTTTG ATCATCCTCG GCTGTGACAG GAATAAGCTG 54 0 

ACTGAGCTGA AGACCGATAA CAACCCCAAG TTGGCCTCTC TTTGGTGTTC TGATAATAAC 600 

CTGACGGAGT TGGAACTCAG TGCC7VATCCT CGTCTCAATG ATCTTTGGTG CTTCGGTAAT 660 

DO CGGATCACGA AACTCGATCT GAGTGCCAAT CCTCTATTGG TAACACTTTG GTGCAGTGAC 7 20 

AATGAGCTTT CGACCTTGGA TCTTTCCAAG AATTCGGACG TTGCTTACCT TTGGTGTTCA 7 80 

TCGAACAAAC TTACATCCTT GAATCTGTCG GGGGTGAAGG GACTGAGTGT TTTGGTTTGT 840 

CATTCCAATC AGATCGCAGG TGAAGAAATG ACGAAAGTGG TGAATGCTTT GCCCACACTA 900 

TCTCCCGGCG CAGGCGCTCA GAGCAAGTTC GTCGTTGTAG ACCTCAAGGA CACTGATGAG 960 

bU AAGAATATCT GTACCGTAAA GGATGTGGAA AAAGCTAAAA GTAAGAACTG GCGAGTATTT 102 0 

GACTTCAACG GTGATTCTGA CAATATGCTT CCATACGAAG GAAGTCCGAC ATCGAACTTG 108 0 

GCAGTAGATG CTCCCACTGT CAGGATATAT CCCAATCCGG TAGGAAGATA TGCGCTCGTC 114 0 

GAGATCCCCG AGTCTCTTTT AGGGCAGGAA GCTGCTTTAT ACGATATG/U\ TGGGGTAAAA 1200 

GTCTATAGTT TCGCGGTAGA GTCTCTTCGT CAGAACATTG ACCTGACACA TCTTCCCGAC 1260 

Db GGCACTTATT TCTTCCGTCT CGATAACTAT ACCACTAAGC TCATCAAACA G 1311 



{ 2 ) III FORI lAT I ON FOR SEQ ID HO : 1 2 0 

70 (ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 
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(ii) tlOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL; NO 
(iv) Aim -SENSE: HO 



10 



15 



20 



25 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROIIOIJAS GINGIVAL! S 

(ix) FEATURE: 

(A) HAI^E/KEY: mi sc_f ea t ure 

(B) LOCATION 1 . . . 954 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 



AATCATTTGA 
CTATTTGGCT 
AGTATTTTTA 
GTATGGGGTA 
CTCACTCCCG 
GGAATAACTG 
TCTGTATTCA 
AATGCTCCCA 
CTCCCGGGGG 
CAATTAGGGG 
GCTTCCTATG 
GGCCATTGCT 
GCAGGAGACG 
AGCCCCATAG 
ATGACCGAAA 
ATGATCGAAA 



AGACAAATAT 
GTTCTTGGGC 
GGGGCATCCT 
TGGCTCGTGC 
TGCTTTTCTA 
AGTATTTCAA 
TGGCGAATCT 
TTGAAGGTGT 
TTTCTGATTG 
TTGTAAACAT 
GATCTTCCGT 
ATTTGGATAT 
ATGAGGTTTT 
TCCAAGTGGT 
ACGCTTATAC 
CACAAAATTC 



CAAGATGAGA 
ACAAGAAAGA 
TGAAAAGGTG 
GAGCGAGGAT 
TAACCGTCTT 
ATTCGCTCAA 
TTTGTACTAT 
TGTGAGCAAG 
CATAGAAATC 
CACTGACGGG 
CCGGGTGTAT 
CTACCCAACC 
TGTCAAACAA 
CGTATACGAT 
CCTATCCTTT 
GATTATCAAT 



AAAACAATAA 
GTCGATGAAA 
AAAGCACCGC 
TTCTTTTTTA 
ACAAACGAAC 
GAAGGTGATT 
CGTTTTTTCC 
ACGGGAAATC 
TCAAACAACC 
ATGGAACCTC 
GGTCATGTCT 
AATTGCTATC 
CAAGGCAGGC 
CTTGAGGGGA 
AGAGCACCCA 
AAAAAACTTA 



TTTTCTGCTT 
AAGTATTCTC 
TTATGTATGG 
TACTTCCCGT 
CCTGCTTTGT 
ACATTGAAGT 
CGACAAGAAT 
CTGCTTTTAC 
GCAAAGTCTT 
CGATTATTGC 
CACAGCGGTG 
CGCTCAGCAC 
AAATAGAGAT 
AAAGTGTTTT 
TGCTCGGCTT 
ATGTTACACA 



GTTGCTCGCC 
CGCAGGAACA 
AGATCGTGAG 
TACGGATGAC 
GTCAGACCAA 
CGAAGGAAGC 
TACCTCCTAT 
AATCCCGATG 
TCTGACCAAT 
CGGAGTCTCT 
GGACATCATA 
GAAACCCGTT 
CGATAGCAAC 
TCGCAAAAGA 
TATGACCATC 
GOT A 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
"780 
8-10 
900 
954 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 121 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTIl: 1383 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv) AMTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPH YROHONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAJ IE/KEY: misc_feature 

(B) LOCATION 1 ... 1383 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 



AGACGGGCAG 
CTTTTTTCAG 
CGATTGTCGA 
ACCATTATTT 
AATTCTTCTT 
ACTCAAAATC 
AACCAGTCTG 
ATAAAAGCCA 
ATATCGCATG 
TTCAAGTACA 
ATGGATCCGA 
ACCCACTGGA 
GAAAAAGGTC 
ACCGGCAAAA 
GTCGTCCAGC 
TVATATCTGCA 
ATCGAGAGCG 
ACCTGGCCTA 
GAAGGAGATG 



TTAAAATCCG 
ACTTTAAATA 
TCCGTCTTAA 
CTTTGATTGT 
ACAAACCTTT 
GTGCGCAGAA 
CAGATACTCA 
ATGGGGTAAG 
TGGATCTTTA 
CCTATGATGA 
ATACACCTAT 
TTTCTTTTGC 
TGTTGGTCAG 
CGGAATACAG 
AAGGAAAGGC 
TACAATATTT 
ATAAGTCCAT 
ATATGTACGG 
TGGCTACTCC 



ATCACCTCCC 
CTTATATTTA 
GTCATACAAT 
CTTCGGTGCT 
TTCGAAAGAA 
GGACAATGCC 
CTTTATCTTC 
AAAATCCACG 
TATCTCGTTC 
CGAGGGAAAG 
CTCACGCTTG 
TTTCGGGGCA 
CGAAGTGCTG 
CTATGACGAT 
ATGGCAAGTA 
GGCTATTAAC 
CTCCGCAAAT 
ATTCAACGCA 
TATATTCGAC 



CATATCCACT 
TCCCGAAAAA 
CCAATAATCA 
TTCTTTGCAG 
GATATTGCCG 
GAGTGGCTTC 
GATGAGAACA 
GACTCCATTT 
AGTGGCGGAG 
ATGACCGTGA 
GAATATCATT 
GAATCCCAAA 
AGCAATGCAA 
GCAGATAATA 
CTCAAA/i.GAG 
GGTACCGACA 
GTCATTGACA 
AAGCGACTGA 
TATATCTATA 



CATTATTTGT 
TTACCCAAGA 
TTATTGAAAT 
CCGTGGGCCA 
GAGGAGTTTA 
TTACAGCGAC 
ACCGCTATAT 
ACTACGATGC 
AGCCTGCACT 
GGGAAGTATT 
ATGATGCACA 
AGAATACGTA 
TGGGGACAAC 
TGGTGAAGGC 
AGGAATACAC 
CAAAGGTGTA 
TTCCGTCAAT 
AAGAGACTTA 
CGTACAAGGC 



ACGTAAATGT 
GAGATTGGGT 
GAAAAAAACA 
AACCAAGGAC 
CTCTCTCCCG 
CGTCTCCACA 
CGCTCGTGAC 
CAACGGGCGA 
CGACACCCGA 
CATGCTGGTA 
GGGCAGACTG 
TCACTATAAT 
CTATTCAGAC 
CGAGTACTTC 
CTATGAGGAC 
CAAGCGAGAC 
GCCGGAACAG 
TTCCTCCTAC 
TCTTACCTCA 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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ATGGCAACAC CTTCGACAGA AGCTCAGGTA GCAGTCTATC TCAATCCGTC AACGGACCGG 1200 

TTAGTGATTC TGGCCAACGG CATCACACAT CTGAGCATGT ACGACTTGCA GGGTAAGCTT 12 60 

ATCCGTGATT GTGCCTTGAG CGGCGATAAG GTGGAAATGG GTGTCGGATC TTTGACCAAA 132 0 

GGGACATACC TGCTTAAAGT GAATACGGAT CAGGGAGCCT TTGTGAGAAA AGTCGTGATT 1380 
CGA 



1383 



(2) II I FORMATION FOR SEQ ID IJO:122 

10 (i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 1353 base oairs 
(E) TYPE: nucleic acid" 
iC) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

15 

Ui) IIOLECULE TYPE: DHA (genomic) 

(iii) HYPOTHETICAL: MO 

20 (iv) AlITI-SEMSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPH Y ROMONAS GIHGIVALIS 

25 (i::) FEATURE: 

(A) NAIIE/KEY: mis cofeature 

(B) LOCATION 1. . . 1353 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 



(2) INFORMATION FOR SEQ ID NO: 123 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B85 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: IJO 



70 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROI lONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI-IE/KEY: misc_feature 
75 (B) LOCATION 1...585 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



ATGGCAAT^G TTATAAAAAC T^AAAAAAGGC CTTGCACTTA ATCTGAAAGG AAAACCGCTG 60 

CCCGAGATGC TGGCCGAACC GGCCCAAAGT CCTACTTACG CGGTCGTGCC CGACGATTTT 12 0 

GAAGGTGTTA TCCCCAAGGT GACGGCTCGT CCGGGGGATA AGGTGCGTGC CGGCTCAGCA 180 

CTGATGCACC ACAAGGCATA TCCGGAGATG AAGTTTACAA GTCCGGTTAG CGGCGAAGTG 24 0 

Ctb ATCGCGGTGA ATCGCGGTGC CAAGCGCAAG GTGTTGAGCA TCGAGGTGAA ACCGGACGGA 300 

CTGAACGAAT ACGAGTCATT CCCTGTCGGG GATCCGTCTG CCCTCTCTGC CGAACAGATC 360 

AAGGAGCTTT TACTGTCGAG CGGTATGTGG GGTTTTATTA AGC7y\CGTCC TTACGACATA 4 20 

GTGGCTACAC CGGATATAGC TCCACGCGAC ATTTATATTA CTGCCAACTT TACTGCACCA 4 SO 

TTGGCTCCGG ACTTCGATTT CATCGTTCGA GGAGAAGAAC GCGCCCTGCA GACTGCCATC 54 0 

GATGCCTTGG CCAAACTCAC GACAGGAAAG GTGTATGTGG GCCTGAAGCC GGGTTCATCT 60O 

CTGGGCTTGC ACAATGCAGA AATCGTAGAA GTACACGGAC CTCATCCGGC AGGTAACGTG 660 

GGCGTGCTGA TCAATCATAC GT^GCCAATC /\ATCGGGGCG AAACGGTGTG GACGCTCAAG 7 20 

GCTACCGACC TGATCGTGAT CGGACGTTTC CTGCTTACGG GCAAAGCCGA TTTTACCAGA 7 8"^ 

ATGATTGCCA TGACCGGCTC AGACGCTGCA GCTCACGGAT ACGTCCGTAT TATGCCGGGT 84 n 

4b TGCAATGTCT TTGCTTCCTT CCCCGGCCGA CTGACAATAA AGGAATCTCA CGAGCGTGTG 900 

ATCGATGGCA ATGTGCTGAC CGGTAAGAAG CTCTGCGAGA AGGAGCCTTT CCTGTCAGCC 960 

CGGTGTGACC AGATCACGGT GATCCCCGAA GGCGACGATG TGGACGAACT CTTCGGGTGG 102^^ 

GCTGCACCCC GTCTCGATCA GTACAGCATG AGCAGAGCTT ATTTCTCTTG GTTGCAGGGG 108n 

AAAAACAAAG AGTACGTACT CGATGCCCGG ATCAAGGGTG GCGAACGTGC TATGATCATG 114 0 

OU AGCAACGAGT ATGACCGCGT TTTCCCGATG GACATCTATC CGGAGTATTT GCTCAAGGCT 12 00 

ATTATAGCAT TCGACATCGA CAAGATGGAG GACTTAGGCA TATATGAAGT GGCTCCGGAG 12 60 

GACTTTGCCA CTTGCGAATT TGTGGATACA TCCAAGATCG AGCTGCAGCG TATCGTTCGC 132 0 

GAGGGCTTGG ATATGCTCTA TAAGGAAATG AAT 1353 



BNSDOCID: <WO 9929870A1_I > 
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(xi) SEQUENCE DESCRIPTION: SEQ ID nO:123 

ATGGATAAAG TGAGCTATGC TCTGGGATTG AGCATCGGTA ATAATTTCAA GTCTTCGGGC 60 

ATCGACAGCG TCGTTATGGA TGATTTCATG CAAGGTCTGT CTGATGTACT GGAAGAAAAA 120 

GCCCCTCAGC TCTCGTATGA CGAGGCCAAG CGCGAAATAG AGGCGTATTT CATGGATTTG 180 

CAGCAGAAGG CTGTCAAACT GAACAAAGAG GCCGGAGAAG AATTCCTCAA GATAAATGCA 2^10 

CAC/^AGGAAG GTGTGACGAC CTTACCGAGC GGCTTGCAAT ACGAAGTCAT TAAGATGGGA 300 

GAGGGCCCGA AACCCACCCT TTCGGACACG GTAACCTGTC ATTATCACGG TACGCTCATC 3 60 

AACGGTATCG TTTTCGATAG CTCTATGGAC AGGGGAGAAC CGGCCAGTTT CCCTCTAAGA 4 20 

GGAGTTATAG CCGGCTGGAC GGAGATTCTT CAATTAATGC CTGTAGGATC CAAGTGGAAA 4 80 

GTAACTATAC CGAGCGATCT GGCGTATGGA GATCGTGGTG CCGGCGAACA TATCAAACCG 54 0 

GGTAGTACGC TCATTTTTAT AATCGAATTA TTGAGTATCA ACAAA 585 



(2) IJIFORHATION FOR SEQ ID IJO:124 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



25 



(ii) IIOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 



30 



(iv) AMTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVALIS 



35 



(ix) FEATURE: 

(A} NAIIE/KEY: misc_featur- 
(B) LOCATION 1 ... 819 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12^ 

ATGAAAAAAG CATTACTTAT TGGTGCTGCT CTTTTGGGAG CAGTCAGTTT TGCAAGTGCT 60 

40 CAGTCTTTGA GCACAATCAA AGTACAGAAC AATTCAGTAC AGCAACCTCG TGAGGAAGCC 120 

ACTATTCAGG TTTGTGGAGA ATTGGCAGAG CAAGTTGACT GCATTGGGAC AGGTAATTCT 180 

GCAATCATAG CCGCTGCAGC GAAATTTGAA AGCGATGATC TCGAAAGCTA TGTTGGCTGG 24 0 

GAGATCATGA GTGTTGATTT CTTCCCTGGA TATAAAGCGT GCAAGTACAC ATCTGCAGTC 300 

TGGGCTGATG ATATGACCAT TTTGGGCCAA TCAGAAGATA GTGATCCCGA AATGCAGACT 360 

45 ATCAACAATC TTGCTCTCAA GACTAGTGTC AAGATTGAAG CCGGCAAGAA TTACATAGTT 4 20 

GGTTATATTG CTAATACCGC AGGTGGACAT CCTATCGGAT GTGATCAGGG CCCTGCCGTT 4 80 

GATGGTTATG GAGATTTGGT TTCTATATCA GAAGATGGTG GTGCTACTTT CCCTCCGTTC 5 4 0 

GAATCTCTTC ATCAAGCAGT TCCTACCTTA AATTACAACA TCTATGTCGT TGTTCATTTG 600 

AAGAAGGGTG AAGGTGTTGA GGCTGTTCTT ACCAACGACA AGGCTAATGC TTATGTTCAG 660 

50 AATGGCGTTA TCTATGTAGC CGGAGCTAAT GGTCGTCAGG TATCTCTGTT CGACATGAAC 7 20 

GGTAAGGTTG TTTATACCGG CGTTAGCGAA ACGATTGCAG CTCCTCAGAA GGGCATGTAT 7 80 

ATCCTCCGTG TAGGTGCTAA GAGCATCAAG CTGGCTATC 819 

55 (2) IIJFORIIATION FOR SEQ ID NO: 125 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1662 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) riOLECULE TYPE: DNA (genomic) 
65 (iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 



70 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROI lONAS GINGIVALIS 



75 



(ix) FEATURE: 

(A) NAilE/KEY: misc_feature 

(B) LOCATION 1 . . .1662 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(xi) SLQUEtJCE DESCRIPTION: SEQ ID MO: 125 

ATGCCAAGAA TTATGAAATT AAAAATTGCA CTCAGACTGC TGCTGGCGAC TTTTGCCATA 60 

GTTTTATTTA GCCCTCTGGC CAAGGCCCAG ATGGATATTG GTGGAGACGA TGTATTGATC 12 0 

5 GAGACGATGT CCACCCTATC AGGATATTCA GAGGATTTTT ATTACAAGAT GGCTGTGGCA 180 

GACAATGGAT GGATCTATGT GATGTTGGAT TTCTCTCGTA TTTATTTTGA TGATGTCAGG 24 0 

CTGTATCGTT CCAAAGACGG TGGTGCTACT TACCAAAAGT TAGGGTCTTT GGGGTCTTTG 300 

GTGCCTTATG ACTTCGATGT CTCGCATTGC GATTTTATTG TAACGGGAAA GGATGAAGAT 360 

GATATCAATG TTTGGACAGT CATGACAGCA TTCGAATATG TAGGTGGTAC TATTGGCAAT 420 

10 GGCGTTTTGC TGATGCATCG CCATGATGCA GATATCAATA A^ACAGAGTG TGTGTACAAG 4 80 

AAGGATTTCC CTAAT7\ATAG ACTGATGGGT GTAGCCATCG CCTCCAACTA CCGTGCGCCC 54 0 

TCTCCTTACG GTTTGGGGGG CGATCCTTTT GCTCTCGCTG TCGCCGTTAG TGGCTCCGGA 600 

AGCGATCACA GCTTCTTGGA CTATATTTTT TCGTTAGATG GTGGAGTACA CTTTGAGCAA 660 

AAGCGTATTT ACACAAGACC CCAAAAACTG ACTATCAATA GAGTAGACCT TTCATTAGGC 72 0 

15 AGTACATCTC CTTCTCTTGG ATTTAATACT TGGCCACTAA TGGGAGTCGT ATTCGAAATG 780 

AATAAGAACC TTGATGGCTT CGACATTGGT TTCATTTCCA ACTTTGTGGA CTATGATCCC 84 0 

CGCTATGCGT GGTCTGAACC GATAATAATA GAAGAAGACT GTGGATGGAC TGATTTTAAT 900 

CCTTTGGGAG CACTAAGTAT AGAGATCCAA ATGATGTTGG ATGACAATTC GGATAATACC 960 

GTGGGTGGAG AACGCTCCCA TAACTTCCTG ATCACTTACC CGGGCCATTA CGTATATCCG 102 0 

20 AAGCAATCTT TCAATTATTC TCCCGGACAT ACACCGACAA AGAAAGATCT GGTCTTTAAA 1080 

CACTGTATAG GTATTCCGGC TTTGGCATAC GATAAGGAAG GCGATCGTTA TCTGACTACT 1140 

TTTCAAGATC ACAATCTAAT GAGATACAGA TGGATCAAAT ACGATGACAT TAACTCTTTT 1200 

TATGGTTGGA GTTGGCCATA TGTATATGCA AAAGAAGCTA AAGATAAAAA GAGGCGCCGT 1260 

CCGC7KAGTAG CACTCAATCC TACCAATGGA AAGGCTTGTT GGGTATGGCA TACTCGCAAG 132 0 

25 AGCCCATATG ATGAAACCAA ACCACATCCT ACTCCTGTAA TTATTAAACA TTTCCTATGG 1380 

TCCGATACGG AGTGGGTACA TGCTCTGGAC GTGGGGGACG TATTGCAGAA GGAGGGTAGC 14 40 

ATGAAGCTCT ACCCCAATCC TGCCAAAGAA TATGTTCTGA TCAACCTACC CAAAGAAGGG 1500 

GGGCACGAGG CAGTCGTATA CGACATGCAG GGCCGAATCG TGGAGAAAGT TTCATTTTCA 1560 

GGGAAAGAAT ATAAGCTGAA TGTGCAGTAT CTGTCCAAAG GTACGTACAT GCTGAAAGTT 162 0 

30 GTAGCGGATA CGGAGTATTT CGTGGAAAAA ATCATTGTAG AG 1662 

(2) IMFORI^IATIOH FOR SEQ ID NO: 126 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 base pair^ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

40 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

45 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROI 10NAS GIIJGIVALIS 

50 (ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION 1.,.1650 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 126 



ATGAAATTAA AAATTGCACT CAGACTGCTG CTGGCGACTT TTGCCATAGT TTTATTTAGC 60 

CCTCTGGCCA AGGCCCAGAT GGATATTGGT GGAGACGATG TATTGATCGA GACGATGTCC 120 

ACCCTATCAG GATATTCAGA GGATTTTTAT TACAAGATGG CTGTGGCAGA CAATGGATGG 180 

ATCTATGTGA TGTTGGATTT CTCTCGTATT TATTTTGATG ATGTCAGGCT GTATCGTTCC 24 0 

60 AAAGACGGTG GTGCTACTTA CCAAAAGTTA GGGTCTTTGG GGTCTTTGGT GCCTTATGAC 300 

TTCGATGTCT CGCATTGCGA TTTTATTGTA ACGGGAAAGG ATGAAGATGA TATCAATGTT 360 

TGGACAGTCA TGACAGCATT CGAATATGTA GGTGGTACTA TTGGC7VATGG CGTTTTGCTG 4 20 

ATGCATCGCC ATGATGCAGA TATCAATAAT ACAGAGTGTG TGTACAAGAA GGATTTCCCT 4 80 

AATAATAGAC TGATGGGTGT AGCCATCGCC TCCAACTACC GTGCGCCCTC TCCTTACGGT 54 0 

65 TTGGGGGGCG ATCCTTTTGC TCTCGCTGTC GCCGTTAGTG GCTCCGGAAG CGATCACAGC 600 

TTCTTGGACT ATATTTTTTC GTTAGATGGT GGAGTACACT TTGAGCAAAA GCGTATTTAC 660 

ACAAGACCCC AAAAACTGAC TATCAATAGA GTAGACCTTT CATTAGGCAG TACATCTCCT 720 

TCTCTTGGAT TTAATACTTG GCCACTAATG GGAGTCGTAT TCGAAATGAA TAAGAACCTT 7 80 

GATGGCTTCG ACATTGGTTT CATTTCCAAC TTTGTGGACT ATGATCCCCG CTATGCGTGG 84 0 

70 TCTGAACCGA TAATAATAGA AGAAGACTGT GGATGGACTG ATTTTAATCC TTTGGGAGCA 900 

CTAAGTATAG AGATCCAAAT GATGTTGGAT GACAATTCGG ATAATACCGT GGGTGGAGAA 960 

CGCTCCCATA ACTTCCTGAT CACTTACCCG GGCCATTACG TATATCCGAA GCT^lATCTTTC 1020 

AATTATTCTC CCGGACATAC ACCGACAAAG AAAGATCTGG TCTTTAAACA CTGTATAGGT 1080 

ATTCCGGCTT TGGCATACGA TAAGGAAGGC GATCGTTATC TGACTACTTT TCAAGATCAC 114 0 

75 AATCTAATGA GATACAGATG GATCAAATAC GATGACATTA ACTCTTTTTA TGGTTGGAGT 12 00 
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TGGCCATATG TATATGCAAA AGAAGCTAAA GATAAAAAGA GGCGCCGTCC GCAAGTAGCA 1260 

CTCAATCCTA CCAATGGAAA GGCTTGTTGG GTATGGCATA CTCGCAAGAG CCCATATGAT 1320 

GAAACCAAAC CACATCCTAC TCCTGTAATT ATTAAACATT TCCTATGGTC CGATACGGAG 1380 

TGGGTACATG CTCTGGACGT GGGGGACGTA TTGCAGAAGG AGGGTAGCAT GAAGCTCTAC 14 4 0 

5 CCCAATCCTG CCAAAGAATA TGTTCTGATC AACCTACCCA AAGAAGGGGG GCACGAGGCA 1500 

GTCGTATACG ACATGCAGGG CCGAATCGTG GAGAAAGTTT CATTTTCAGG GAAAGAATAT 1560 

AAGCTGAATG TGCAGTATCT GTCCAAAGGT ACGTACATGC TGAAAGTTGT AGCGGATACG 1620 

GAGTATTTCG TGGAAAAAAT CATTGTAGAG 1650 



10 



25 



70 



(2) IMFORIIATION FOR SEQ ID HO; 127 



( i ) SEQUEHCE CHARACTERISTICS : 
15 (A) LENGTH: 1170 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIIDEDIIESS: double 

(D) TOPOLOGY: circular 

20 (ii) IIOLECULE TYPE: DHA (genomic) 

(iii) HYPOTHETICAL: IIO 
(iv) AIJTI -SENSE: HO 



(vi) ORIGTIIAL SOURCE: 

(A) ORGAIIISM: PORYPHYROMOIJAS GIHGIVALIS 



(ix) FEATURE: 
30 (A) NAIME/KEY: misc_feature 

(E) LOCATION 1...1170 

(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 127 

35 ATGAAACGAT TACTCCCCTT TCTCCTTTTA GCAGGACTCG TAGCCGTAGG AAACGTGTCT 60 

GCTCAGTCAC CCCGAATCCC TCAAGTGGAT GTACACACTC GCATCGCAAG AAATGCCCGT 12 0 

TATCGACTGG ACAAGATCAG TGTCCCGGAT TCTCGTCAGA TATTCGATTA CTTCTATAAA 18 0 

GAAGAAACGA TACCCACTAA 7VATACAAACG ACCACAGGAG GTGCAATTAC AAGCATCGAT 24 0 

TCGCTTTTCT ATGAAGACGA CAGGTTGGTT CAGGTGCGCT ATTTTGACAA TAACCTTGAA 300 

40 TTAAAACAAG CGGAGAAGTA TGTATACGAC GGTTCTAAGC TGGTCCTTCG AGAAATTCGC 360 

AAGTCGCCGA CAGACGAAAC GCCAATAAAG AAAGTTAGCT ATCACTATCT CTGTGGCAGC 420 

GATATGCCTT TTGAGATTAC GACAGAGATG AGCGATGGCT ATTTTGAAAG CCATACGCTT 4 80 

AACTATCTGA ATGGAAAGAT TGCCCGAATA GAT AT CAT GA CTCAACAGAA CCCATCGGCC 54 0 

GAATTGATCG AAACGGGTAG AATGGTATAT GAGTTTGATG CCAATAATGA TGCTGTACTG 600 

45 CTTCGTGACA GTGTATTTCT TCCTCTTCAA AACAAGTGGG TAGAAATGTT TACTCACCGT 660 

TATACATACG ACAATAAGCA TAATTGTATT CGTTGGGAAC AAGACGAATT CGGCACCCTC 72 0 

ACCCTTGCCA AC.^CTTCGA ATACGACACC ACTATCCCTC TGTCGTCTGT ATTGTTCCCC 7 80 

ACGCATGAGG AGTTCTTCCG TCCTCTTCTT CCCAATTTTA TGAAGCATAT GCGTACGAAG 84 0 

CAAACGTATT TCAATAACTC CGGAGAAGGC TTGTCAGAGG TATGCGATTA CAACTACTTC 900 

50 TATACCGATA TGCAGGGTAA TGCACTGACC GATGTTGCCG TGAACGAATC GATCAAGATT 960 

TATCC'I'CGTC CTGCCACGGA TTTTCTGCGT ATAGAAGGTT CGCAACTGCT TCGCCTTTCG 102 0 

CTATTCGACA TGAACGGGAA GCTCATCAGA GCTACCGAAT TGACAGGCGA TTTGGCCATT 108 0 

ATCGGAGTTG CATCTCTTCC GAGAGGCACT TACATCGCAG AAATAACTGC TGCAAACAGC 114 0 

AAAACCATAC GTGCAAAAGT ATCGCTCAGA 117 0 

55 

(2) INFORMATION FOR SEQ ID NO: 128 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

65 Ui) MOLECULE TYPE: DHA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 



(ix) E'EATURE: 
75 (A) NAI-IE/KEY: misc feature 
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BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 PCT/AU98/01023 



88 / 490 



(B) LOCATIOM 1...1233 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:128 

5 ATGAGACAGC ATTTATCTCT ATTTCCTTTT ATCTTGTTTC TGCTTCTTGC CTTCTCTTAT 60 

GTCGGTTGCA GAACAGTCCG ACAAACACCT AAGCAGTCGG AACGGTACGT CGTAGTCCTG 120 
TCTTTGGACG GCTTCCGACC GGACTATACC GATCGGGCAC GTACACCGGC GTTGGATCGG 180 
ATGGCACAGG AGGGATTGAG CGGGTCGCTC CAACCATGCT TCCCCTCGCT TACATTTCCC 2^0 
AATCATTACA GCATGGCTAC GGGGCTTTAC CCCGATGATC ACGGTATCGT AGCCAATGAG 300 

10 TTTGTGGATT CGCTACTGGG CATCTTTCGT ATATCCGACC GAAAAGCCGT GGAGACCCCC 360 
GGATTTTGGG GCGGCGAGCC GGTTTGGAAT ACGGCCGCAC GCCAAGGCAT CCGTACCGGT 4 20 

GTCTACTTTT GGGTAGGATC CGAAACGGCT GTGAACGGAA ATCGGCCGTG GCGGTGGAAA 4 80 

AAATTCTCCT CCACCGTTCC GTTTCGTGAC CGTGCCGACT CCGTCATCGC GTGGCTCGGA 54 0 

CTGCCCGA.AA AGGAGCGACC GCGCTTGCTC ATGTGGTACA TCGAGGAGCC GGATATGATC 600 

15 GGACACAGCC AAACGCCCGA AAGCCCGCTG ACACTGGCAA TGGTAGAGCG GTTGGACAGT 660 

GTGGTCGGCT ATTTCCGCAA GCGGTTGGAC TCTCTGCCCA TAGCCGCACA GACCGACTTC 7 20 

ATCATAGTAT CCGATCACGG TATGGCCACG TACGAAAATG AGAAATGTGT CAATCTGTCG 180 
CATTATCTGC CTGCGGACAG TTTCCTCTAC ATGGCCACCG GGGCCTTCAC CCACTTGTAC 84 0 

CCGAAGCCCT CCTATACCGA GCGAGCCTAT GAGATCCTGC GGGCCATTCC ACATATATCG 900 

20 GTTTACCGCA AGGGGGAGGT GCCCAAGCGT TTGCGCTGTG GCACCAATCC TCGTTTGGGC 960 

GAACTGGTCG TGATTCCGGA CATAGGCTCC ACCGTCTTTT TCGCAATAAA TGAAGACGTT 102 0 

CGTCCGGGAG CGGCACATGG CTATGACAAC CAAGCACCGG AAATGCGGGC TTTACTCCGG 1080 

GCTGTCGGAC CCGATTTCCG TCCGGGCAGT AGGGTGGAAA ACCTGCCGAA TATCACCATC 114 0 

TATCCGCTCA TATGCAGGCT GTTGGGTATA GAGCCTGCAC CCAACGATGC GGACGAAACG 12 CO 

25 TTGCTGAACG GCCTGATCCG AGACAAACGA CCA 1233 

(2) INFORMATION FOR SEQ ID NO: 129 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

t C ) STRANDEDHESS : doubl e 
(D) TOPOLOGY: circular 

35 

(ii) MOLECULE TYPE: DMA (genomic) 

{iii ) HYPOTHETICAL: NO 

40 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

45 (ix) FEATURE; 

(A) NAME/ KEY: mis cofeature 

(B) LOCATIOl] 1 . , .738 



50 



65 



SEQUENCE DESCRIPTION: SEQ ID MO; 129 



ATGAAAGTAG GTTTGTTCAT CCCCTGTTAT GTCAATGCAG TGTATCCGGA AGTGGGTATC 60 

GCCACGTACA AACTGCTGAA GAGTTTGGAC ATAGATGTCG ACTACCCGAT GGATCAGACA 120 

TGTTGCGGCC AGCCTATGGC CAATGCCGGA TTCGAACAGA AAGCTCAAAA GCTGGCTTTG 180 

CGATTCGAAG AGCTGTTCGA GTCGTATGAT GTAGTCGTAG GGCCATCGGC CAGTTGCGTT 24 0 

55 GCTTTCGTGA AAGAAAACTA TGATCATATC CTCAGACCGA CAGGACATGT CTGCAAGTCG 300 

GCAGCCAAGG TTCGGGATAT ATGCGAGTTC TTGCACGATG ACCTGAAGAT CACCAGCCTC 360 

CCCTCCCGAT TCGCCCATAA GGTGAGCCTG CACAACAGTT GCCACGGTGT GCGCGAACTG 4 20 

CATCTGTCCA CCCCCAGTGA AGTGCACCGA CCGTACCACA ACAAGGTGCG CCGGCTATTG 4 80 

GAGATGGTGC AGGGCATAGA GGTATTCGAG CCGAAGCG;^ TAGACGAATG CTGCGGTTTC 54 0 

60 GGCGGTATGT ACTCGGTGGA GGAGCCGGAG GTATCCACCT GTATGGGGCA TGACAAGGTG 600 

CTGGATCACA TATCCACAGG TGCGGAGTAC ATCACAGGGC CGGACAGCTC GTGCCTCATG 660 

CATATGCAGG GAGTGATAGA CAGAGAGAAA TTGCCGATCA AGACAATTCA TGCAGTAGAA 7 20 

ATTTTAGCAG CAAACTTA 7 38 



(2) INFORMATION FOR SEQ ID NO: 130 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 738 base pairs 

70 (B) TYPE: nucleic acid 

(C) STRAIIDEDMESS: double 

(D) TOPOLOGY: circular 



75 



(ii) MOLECULE TYPE: DNA {genomic} 
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(iii) HYPOTHETICAL: UO 

(iv) AMTI-SEMSE; NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISIl: PORY PHYROMONAS GINGIVALIS 



(ix) FEATURE: 

(A) WAI-IE/KEY: misc_feature 
10 (B) LOCATION 1...738 



(xi) SEQUENCE DESCRIPTION: SEQ ID IIO:130 



ATGGATATTG TAAGTATGGC CGAT7W\GCT 
15 TGTCAGGAGG AAAACGTCAT TTTTCAAAAT 
GTCTATCTGA TAGGCTCAGT GGGATCGGGG 
GAGG'J'GCCTA TCTCTGCCGG TTATGCCCGC 
CGGAAGCAGT TGCCCTATCT GCGCAGGAAT 
CTGAACGGAC GTACTGTTGC GGAGAATTTG 
20 AACCGAGCCG ATCGCGAGCA GCGTATCGAG 
AAGGCTTATA AGAGACCGCA CGAACTGTCC 
AGAGCTTTGC TGGCGAAGCC TGCGTTGATC 
TCGGTGACCG GATTGCAGAT CGCTTCTCTG 
GTACTTATGA GCACGCACAA CAGCAGCCTG 
25 GTTCGTAAGA ATGGCGATGC CTCCTCTTTG 
AAAAATACGG AAATAGAT 



CTTGTAGTGG AGATGAGAGA TGTGACGCTC 60 

TTGAATCTGA CCCTTTCCGC CGGAGACTTC 120 

AAGAGCACTT TGCTGAAGGC TTTGTATGCT 180 

GTGATAGATT ATGATCTGGC AAAGTTGAAA 2^0 

TTGGGCATTG TGTTTCAGGA TTTCCAGTTG 300 

GATTTCGTTT TGCGAGCTAC GGACTGGAAA 360 

GAGGTTTTGA CCCGTGTGGG AATGTCTCGG 4 20 

GGAGGGGAGC AACAACGTGT GGGTATAGCC 4 80 

CTGGCCGACG AACCCACAGG CAACCTCGAT 54 0 

CTCTACGAAA TCAGTAAGCA GGGCACTGCA 600 

CTGTCGCATC TGCCGGCACG GACATTGGCC 6 60 

GTCGAGCTGA GTGCAGATGC TGTTTCAAGA 72 0 

738 



30 



40 



(2) INFORMATION FOR SEQ ID MO: 131 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 
(C} STRAHDEDNESS : double 

35 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 



45 

(ix) FEATURE: 



(A) NAME/ KEY: misc^feature 

(B) LOCATION 1 ... 723 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID HO: 131 



ATGGCCGATA AAGCTCTTGT AGTGGAGATG 
GTCATTTTTC AAAATTTGAA TCTGACCCTT 
TCAGTGGGAT CGGGGAAGAG CACTTTGCTG 

55 GCCGGTTATG CCCGCGTGAT AGATTATGAT 
TATCTGCGCA GGAATTTGGG CATTGTGTTT 
GTTGCGGAGA ATTTGGATTT CGTTTTGCGA 
GAGCAGCGTA TCGAGGAGGT TTTGACCCGT 
CCGCACG AAC TGTCCGGAGG GGAGCAACAA 

60 AAGCCTGCGT TGATCCTGGC CGACG7\ACCC 
CAGATCGCTT CTCTGCTCTA CGAAATCAGT 
CACAACAGCA GCCTGCTGTC GCATCTGCCG 
GATGCCTCCT CTTTGGTCGA GCTGAGTGCA 
GAT 

65 



AGAGATGTGA CGCTCTGTCA GGAGGAAAAC 60 

TCCGCCGGAG ACTTCGTCTA TCTGATAGGC 120 

AAGGCTTTGT ATGCTGAGGT GCCTATCTCT 180 

CTGGCAAAGT TGAAACGGAA GCAGTTGCCC 24 0 

CAGGATTTCC AGTTGCTGAA CGGACGTACT 300 

GCTACGGACT GGAAAAACCG AGCCGATCGC 360 

GTGGGAATGT CTCGGAAGGC TTATAAGAGA 4 20 

CGTGTGGGTA TAGCCAGAGC TTTGCTGGCG 4 80 

ACAGGCAACC TCGATTCGGT GACCGGATTG 54 0 

AAGCAGGGCA CTGCAGTACT TATGAGCACG 600 

GCACGGACAT TGGCCGTTCG TAAGAATGGC 660 

GATGCTGTTT CAAGAAAAAA TACGGAAATA 7 20 

723 



(2) INFORMATION FOR SEQ ID NO: 132 



(i) SEQUENCE CHARACTERISTICS: 
70 (A) LEIIGTH: 696 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

75 (ii) IIOLECULE TYPE: DHA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) AIJTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GIMGIVALIS 



10 



15 



20 



25 



lix) FEATURE: 

(A) NAI'lE/KEY: misc_feature 

(B) LOCATION 1 . , .696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 



ATGAGAGATG 
CTTTCCGCCG 
CTGAAGGCTT 
GATCTGGCAA 
TTTCAGGATT 
CGAGCTACGG 
CGTGTGGGAA 
CAACGTGTGG 
CCCACAGGCA 
AGTAAGCAGG 
CCGGCACGGA 
GCAGATGCTG 



TGACGCTCTG 
GAGACTTCGT 
TGTATGCTGA 
AGTTGAAACG 
TCCAGTTGCT 
ACTGGAAAAA 
TGTCTCGGAA 
GTATAGCCAG 
ACCTCGATTC 
GCACTGCAGT 
CATTGGCCGT 
TTTCAAGAAA 



TCAGGAGGAA 
CTATCTGATA 
GGTGCCTATC 
GAAGCAGTTG 
GAACGGACGT 
CCGAGCCGAT 
GGCTTATAAG 
AGCTTTGCTG 
GGTGACCGGA 
ACTTATGAGC 
TCGTAAGAAT 
AAATACGGAA 



AACGTCATTT 
GGCTCAGTGG 
TCTGCCGGTT 
CCCTATCTGC 
ACTGTTGCGG 
CGCGAGCAGC 
AGACCGCACG 
GCGAAGCCTG 
TTGCAGATCG 
ACGCACAACA 
GGCGATGCCT 
ATAGAT 



TTCAAAATTT 
GATCGGGGAA 
ATGCCCGCGT 
GCAGGAATTT 
AGAATTTGGA 
GTATCGAGGA 
AACTGTCCGG 
CGTTGATCCT 
CTTCTCTGCT 
GCAGCCTGCT 
CCTCTTTGGT 



GAATCTGACC 
GAGCACTTTG 
GATAGATTAT 
GGGCATTGTG 
TTTCGTTTTG 
GGTTTTGACC 
AGGGGAGCAA 
GGCCGACGAA 
CTACGAAATC 
GTCGCATCTG 
CGAGCTGAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
696 



30 



35 



40 



45 



50 



55 



60 



(2) IHFORi^lATION FOR SEQ ID HO:133 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

( i I i ) HY POTHETICAL : NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f ea t ur e 

(B) LOCATION 1 . . . 557 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 



ATGATTGAAA 
CTCGATGGCG 
GGATGCGGTA 
ATCTACAAGC 
CGTAAGGGCA 
AGCGAGAACG 
CGAGTGGAGG 
CAGCTCTCCG 
AAGCTCATCC 
ATGGAACTGC 
GAGCACGATG 



TCAGCAACCT 
TATCGCTCAA 
AGTCCACTCT 
TCGATGGGGC 
ATATCGGCTT 
TGGAGTTGCC 
AGGCACTGCG 
GAGGACAACA 
TCGCCGATGA 
TCAGAGGTCT 
CACGTAGTGC 



CACCAAGGTT 
AGTGGACAAA 
GCTCAATATC 
AGAAGTGGGC 
CGTATTCCAG 
GCTCGTCTAT 
CAAGATGAGC 
GCAGCGCGTG 
ACCCACGGGT 
CAATCGCGAA 
CGGCCGCATC 



TTCAGAACAG 
GGCGAATTTA 
CTCGGCCTTC 
AACCTCCGGG 
AGCTTCAACC 
CTGGGTGTGA 
ATCAGCCACC 
GCTATCGCCC 
AACCTCGACT 
GGTGCAACCA 
ATCAATCTGT 



AAGAAATAGA 
TCGCCATAAT 
TCGACAATCC 
AAAAAGACAG 
TCATCGAAGA 
AGGCCTCCGA 
GGGCCGGCCA 
GTGCCGTGGT 
CCAAAAACGG 
TCGTCATGGT 
TCGACGGTAA 



GACGGTAGCC 
GGGGCCTTCG 
CACTTCCGGT 
GACTGCCGTC 
GATGACGGTA 
GCGGAAAGAG 
CTTCCCCAAT 
GGCCAATCCG 
AGCCGATGTC 
GACGCACTCC 
GATTCGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
657 



65 (2) INFORMATION FOR SEQ ID NO: 134 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1785 base pairs 

(B) TYPE: nucleic acid 
70 (C) STRANDEDNESS: double 

(0) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
75 (iii) HYPOTHETICAL: NO 
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10 



65 



(iv) AIJTI-SEMSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOMAS GIUGIVALIS 

(ix) FEATURE: 

(A) HAT 'IE/ KEY: mi sc_ feature 

(B) LOCATIOIJ 1 ... 1785 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 134 



ATGAAAGAAT TTTTCAAAAT GTTTTTCGCC TCGATCCTCG GGGTTATAAC GGCAGGAATC 60 

ATCTTGTTCT GTATCTTTCT ATTTATCTTT TTCGGCATCG TAGCCGGTAT TGCCTCCAAG 120 

15 GCAACGGGAG GAACCATTCC GAAGATCGAA GCAAACTCCA TCCTACATAT ANACAATTCT 180 

TCTTTCCCTG AGATCGTATC GGCCAATCCC TGGAGCATGC TCACAGGCAA AGACGAGTCC 24 0 

GTATCGCTCT CACAGGCAGT CGAAGCCATC GGCCAAGCCA AAAATAATCC CAACATAACC 300 

GGTATCTTCC TCGATCTGGA CAACCTTTCC GTCGGTATGG CATCGGCAGA GGAATTGCGT 360 

CGCGCGTTGC AGGATTTCAA GATGTCGGGC AAGTTCGTCG TATCCTATGC CGACAGATAC 4 20 

20 ACCCAAAAGG GTTACTACCT CTCCAGTATT GCAGACAAAC TCTACCTCAA TCCGAAAGGA 4 80 

ATGTTGGGGC TTATCGGGAT TGCGACCCAA ACAATGTTCT ACAAAGATGC CCTCGACAAA 54 0 

TTCGGCGTGA AGATGGAGAT CTTCAAGGTA GGCACCTACA AGGCAGCCGT AGAGCCATTC 600 

ATGCTCAACA GGATGAGCGA TGCCAATCGC GAACAAATCA CCACATACAT AAACGGGCTT 660 

TGGGACAAGA TCACATCCGA TATTGCAGAG TCGCGCAAGA CGGCAATGGA TTCCGTGAAA 7 2C 

25 ATGTTTGCCG ACAAAGGCGA AATGTTCGGT CTTGCCGAGA AAGCGGTGGA GATGAAGCTC 7 SO 

GTGGATGAGC TGGCTTACCG TACCGATGTG GAGAAAGAAC TCAAAAAGAT GTCCCAACGC 84 0 

GGAGAGAAAG ATGAACTTCG GTTCGTATCG CTTTCTCAGG TTCTGGCCAA TGGCCCGATG 900 

AAC/yVAACGA AAGGCAGTCG GATCGCCGTT CTCTTTGCCG AAGGTGAAAT AACGGAAGAA 960 

ATAATAAAGA AGCCGTTCGA CACTGACGGT AGCTCCATCA CACAAGAACT CGCCAAAGAA 102 0 

30 ATCAAGGCAG CAGCCGATGA CGATGATATC AAAGCCGTAG TACTTCGTGT CAATTCTCCG 1080 

GGAGGTAGTG CTTTCACTTC CGAACAGATA TGGAAGCAGG TAGCCGATCT CAAGGCCAAA 114 0 

AAGCCTATCG TGGTCTCCAT GGGCGACGTA GCAGCCTCGG GCGGATACTA CATAGCCTGC 12 00 

GCAGCCAACA GTATCGTGGC AGAGCATACG ACTCTGACCG GCTCCATCGG CATATTCGGC 12 60 

ATGTTCCCGA ACTTCGCGGG CGTAGCCAAG AAGATAGGAG TGAATATGGA CGTCGTACAG 132 0 

35 ACATCCAAGT ATGCAGACTT- GGGCAACACC TTCGCTCCGA TGACGGTCGA AGATCGTGCC 1380 

CTCATCCAAC GCTACATAGA GCAGGGCTAC GACCTCTTCC TCACTCGCGT ATCGGAAGGC 14 4 0 

CGCAACCGCA CCAAGGCACA GATCGACAGC ATCGCTCAAG GCCGTGTATG GCTCGGCGAC 1500 

AAAGCTCTTG CACTCGGTTT GGTGGATGAG CTTGGAGGTT TGGACACAGC TATCAAACGG 1560 

GCCGCGAAGC TGGCTCAGCT CGGTGGCAAC TACAGCATAG AGTATGGCAA GACCAAGCGC 1620 

40 AACTTCTTCG AAGAGTTGCT CTCCTCATCA GCAGCGGATA TGAAGTCTGC CATCCTGAGT 168 0 

ACCATTCTCT CCGATCCGGA AATAGAAGTT CTGCGCGAAC TCCGCTCCAT GCCGCCCCGT 17 4 0 

CCTTCGGGCA TACAGGCACG TCTCCCCTAT TACTTCATGC CGTAC 17 8 5 

45 (2) IMFORMATIOM FOR SEQ ID NO: 135 

(i) SEQUEKCE CHARACTERISTICS: 

(A) LENGTH: 1767 baee pairs 

(B) TYPE : nucleic acid 
50 (C) STRAIIDEDHESS : double 

{D) TOPOLOGY: circular 

(ii ) MOLECULE TYPE: DNA {genomic) 

55 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
60 {A) ORGANISM: PORY PHYR0I40r IAS GINGIVAL IS 

(ix) FEATURE: 

(A) NA14E/KEY: misc_feature 

(B) LOCATION 1 ... 1767 



(xi) SEQUENCE DESCRIPTIOIJ : SEQ ID NO: 135 



ATGTTTTTCG CCTCGATCCT CGGGGTTATA ACGGCAGGAA TCATCTTGTT CTGTATCTTT 60 

CTATTTATCT TTTTCGGCAT CGTAGCCGGT ATTGCCTCCA AGGCAACGGG AGGAACCATT 120 

70 CCGAAGATCG AAGCAAACTC CATCCTACAT ATAMACAATT CTTCTTTCCC TGAGATCGTA 180 

TCGGCCAATC CCTGGAGCAT GCTCACAGGC AAAGACGAGT CCGTATCGCT CTCACAGGCA 24 0 

GTCGAAGCCA TCGGCCAAGC CAAAAATAAT CCCAACATAA CCGGTATCTT CCTCGATCTG 300 

GACAACCTTT CCGTCGGTAT GGCATCGGCA GAGGAATTGC GTCGCGCGTT GCAGGATTTC 360 

AAGATGTCGG GCAAGTTCGT CGTATCCTAT GCCGACAGAT ACACCCAAAA GGGTTACTAC 4 20 

75 CTCTCCAGTA TTGCAGACAA ACTCTACCTC AATCCGAAAG GAATGTTGGG GCTTATCGGG 4 80 
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ATTGCGACCC AA.ACAATGTT CTACAAAGAT 

ATCTTCAAGG TAGGCACCTA CAAGGCAGCC 

GATGCCAATC GCGAACAAAT CACCACATAC 

GATATTGCAG AGTCGCGCAA GACGGCAATG 

5 GAAATGTTCG GTCTTGCCGA GAAAGCGGTG 

CGTACCGATG TGGAGAAAGA ACTCAAAAAG 

CGGTTCGTAT CGCTTTCTCA GGTTCTGGCC 

CGGATCGCCG TTCTCTTTGC CGAAGGTGAA 

GACACTGACG GTAGCTCCAT CACACAAGAA 

10 GACGATGATA TCAAAGCCGT AGTACTTCGT 

TCCGAACAGA TATGGAAGCA GGTAGCCGAT 

ATGGGCGACG TAGCAGCCTC GGGCGGATAC 

GCAGAGCATA CGACTCTGAC CGGCTCCATC 

GGCGTAGCCA AGAAGATAGG AGTGAATATG 

15 TTGGGCAACA CCTTCGCTCC GATGACGGTC 

GAGCAGGGCT ACGACCTCTT CCTCACTCGC 

CAGATCGACA GCATCGCTCA AGGCCGTGTA 

TTGGTGGATG AGCTTGGAGG TTTGGACACA 

CTCGGTGGCA ACTACAGCAT AGAGTATGGC 

20 CTCTCCTCAT CAGCAGCGGA TATGAAGTCT 

GAAATAGA.AG TTCTGCGCGA ACTCCGCTCC 

CGTCTCCCCT ATTACTTCAT GCCGTAC 



GCCCTCGACA AATTCGGCGT GAAGATGGAG 5^0 

GTAGAGCCAT TCATGCTCAA CAGGATGAGC 600 

ATAAACGGGC TTTGGGACAJ\ GATCACATCC 660 

GATTCCGTGA AAATGTTTGC CGACAAAGGC 7 20 

GAGATGAAGC TCGTGGATGA GCTGGCTTAC 7 80 

ATGTCCCAAC GCGGAGAGAA AGATGAACTT 84 0 

AATGGCCCGA TGAACAAAAC GAAAGGCAGT 900 

ATAACGGAAG AAATAATAAA GAAGCCGTTC 960 

CTCGCCAAAG AAATCAAGGC AGCAGCCGAT 1020 

GTCAATTCTC CGGGAGGTAG TGCTTTCACT 108 0 

CTCAAGGCCA A-AAAGCCTAT CGTGGTCTCC 114 0 

TACATAGCCT GCGCAGCCAA CAGTATCGTG 1200 

GGCATATTCG GCATGTTCCC GAACTTCGCG 1260 

GACGTCGTAC AGACATCCAA GTATGCAGAC 1320 

GAAGATCGTG CCCTCATCCA ACGCTACATA 1380 

GTATCGGAAG GCCGCAACCG CACCAAGGCA 144 0 

TGGCTCGGCG ACAAAGCTCT TGCACTCGGT 1500 

GCTATCAAAC GGGCCGCGAA GCTGGCTCAG 1560 

AAGACCAAGC GCAACTTCTT CGAAGAGTTG 1620 

GCCATCCTGA GTACCATTCT CTCCGATCCG 1680 

ATGCCGCCCC GTCCTTCGGG CATACAGGCA 17 4 0 

1767 



25 



(2) ItiFORIlATIOU FOR SEQ ID MO: 136 



30 



(i) SEQUEIICE CHARACTERISTICS: 

(A) LEMGTH: 939 base pair^ 

(B) TYPE: nucleic acid 

(C) STRAMDEDIJESS: double 

(D) TOPOLOGY: circular 



35 



(ii) IIOLECULE TYPE: DIJA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) AHTI-SEUSE: NO 



40 



(vi) ORIGIIJAL SOURCE: 

(A) ORGAiasM: PORY PH YROMOMAS GIMGIVALIS 



45 



(ix) FEATURE: 

(A) MAI1E/KEY: misc_f ea ture 

(B) LOCATION 1 . . ,9 39 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 136 



50 



55 



60 



65 



ATGAGAGCAA 
ATCGGACAGG 
GAAGCTATCC 
TTGGTGGAGA 
ATCAAATGCC 
GAATGCTTCC 
CTCCAACCAC 
ACGGGGTTGC 
ATGGTGGAGC 
AAGTCTTACC 
CTCAAGCACG 
ACCTTCGATT 
GATGCTCCCG 
GCATGCTCCA 
ATGTCCCAAC 
GAATATATAC 



ACATTTGGCA 
CTCAGAGTCG 
GACAGATGAG 
CAGGAGCCGG 
ACAAATCGTG 
GCAGCTATTC 
GCTATCGTCC 
AACGCTGTGG 
TGTATGAGCT 
CCGGGTCCAA 
AAGCTTACTT 
CTTTGGCCGA 
TGGATTTCCC 
TCTCCAAACA 
GCTATGGCAT 
CCCAAGAGGG 



GATACTTTCC 
AAACCGTACA 
CCGCTACAAT 
AGCCAGTACA 
GACGGGCAAG 
GGCCGCTCGC 
CCTGTTCAAA 
CTATGCCACC 
ATATGCTTTG 
CAAAAAATCC 
CAGCTACGGA 
AGAGTTCGAC 
GATCGAAAAG 
CACACAGGAC 
CCGGATGAAG 
CGATATACTG 



GTTTCGGTTC 
TACGAGGCTT 
ATACCGGCAA 
CTGGCCAGCG 
CGCACCTATC 
GAATCGTATG 
CTCGACAGAG 
AATCGGGGCT 
GATCGCGAGA 
CATCAAACGA 
CTGCTCTACA 
ATGAGAGCCT 
GGCGATGTGA 
GTAGTGCGTG 
AACCTCTACA 
CGCTTGCGC 



ATGTGAAACA 
GCATCACCAT 
TACACAACAA 
GTACCGACGA 
AAGATCATTC 
AAGACTATCG 
ATGCCAATCT 
AGTACCCCTC 
CCAAGCAGAA 
TCATAGCCAA 
CCAAACTGGC 
TCTATCTGGA 
TGGGCGATTC 
AGCTCAACGA 



CGGGACAGCG 
GTACGCCGAC 
AGCACAGGCT 
TCACTTCGGG 
TGCGCCGAAC 
CCGATTTCTG 
GGGCTGGGCT 
GCTGATCAAG 
ATGGTTCCAC 
GCAGAGCGGA 
GCAAGGCGAT 
CAAATACAAC 
GAAAAAGCAC 
GATGCACAGT 
CAAGGATGGC 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
939 



(2) INFORMATION FOR SEQ ID NO: 137 



70 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1569 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



75 



(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) anti-sense:: no 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOUAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI^IE/KEY: misc_f eature 
10 (B) LOCATION 1...1569 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 

ATGGACGGAC GTCGATATTC GGATGGCCTC CATCAGGCTA TCGAAGCCAA AGAGCATGTG 60 

15 AAAGTAGAGG CTGCGACACA GACATTTGCA ACTATCACTT TGCAGAACTA TTTCCGCATG 120 

TATCATAAGC TGGCAGGGAT GACCGGTACT GCTGAAACTG AAGCGGGAGA GCTTTGGGAC 180 

ATCTACAAAC TGGACGTTGT AGTTATTCCG ACAAACAAGC CTATCGCCCG TAAGGATATG 24 0 

T^TGATCGTA TCTATAAGAC GGCACGTGAA AAATATGCAG CAGTTATCGA AGAGATTGTA 300 

CGTCTTGTCG AAGAGGGCAG ACCTGTACTT GTCGGTACTA CTTCGGTGGA AATATCCGTVA 3 60 

20 TTGTTGAGCC GTATGTTACG CTTGCGTGGC ATCCAACACA ATGTACTCA.A TGCCA7\ATTG A20 

CATCAGAAGG AGGCCGAGAT TGTAGCTCAG GCCGGTCAGA AAGGAACTGT TACCATCGCA 4 80 

ACGAACATGG CCGGTCGTGG TACCGACATC AAGCTCTCTG CCGAGGTTAA GAAAGCCGGG 5^0 

GGTTTGGCTA TCATTGGTAC GGAAAGGCAC GAATCCAGAC GAGTGGACAG ACAGCTTCGT 600 

GGTCGTTCCG GCCGTCAGGG TGATCCCGGT TCGTCCATAT TCTATGTTTC CCTTGAAGAT 660 

25 CATCTGATGC GCCTCTTTGC CACAGAAAAG ATTGCATCAT TGATGGATCG TTTAGGTTTC 720 

AAGGAAGGAG AAGTGCTCGA AAACAACATG CTGAGTAAGT CCGTGGAGCG TGCTCAAAAG 780 

AAGGTGGAAG AGAACAACTT CGGTATCCGT AAACATCTGC TTGAGTACGA TGATGTAATG 84 0 

AATTCGCAGC GTGAAGTCAT TTATACCCGT CGCCGTCATG CTTTGATGGG AGAGCGTATC 9 00 

GGTATGGATG TACTCAATAC CATATACGAC GTATGTAAGG CTCTGATTGA CAATTATGCA 960 

30 GAAGCCAATG ATTTCGAAGG CTTCAAGGAA CATCTGATGC GTGCACTCGC GATAGAATCT 1020 

CCTATCACGC AAGAAATATT CAGAGGTAAG AAAGCAGAAG AGCTGACCGA TATGCTTTTC 108 0 

GATGAAGCTT ACAAGTCTTT CCAACGTAAG ATGGATCTGA TCGCAGAAGT GGCCCACCCT 114 0 

GTGGTTCATC AGGTATTCGA GACCCAAGCC GCCGTGTACG AGCGCATTCT AATCCCCATT 1200 

ACGGATGGTA AACGTGTCTA TAACATAGGA TGCAATTTGC GTGAAGCGGA TGAMCTCTVA 12 60 

35 GGGAAAAGCA TCATCAAAGA ATTTGAGAAA GCTATCGTAC TGCATACTAT CGATGAGTCT 132 0 

TGGAAAGAAC ATCTGCGTGA GATGGACGAG CTTCGTAATT CCGTTCAGAA TGCCAGCTAC 1380 

GAAAACAAAG ATCCACTACT TATCTATAAA CTCGAATCTT ACGAACTGTT CCGCAAGATG 14 4 0 

GTAGAAGCCA TGAACCGTAA GACCGTAGCG ATCCTT^ATGC GTGCTCGGAT ACCGGTACCG 1500 

GAGGCTCCTT CCCAAGAAGA GCTGGAACAC AGGCGGCAAA TAGAAATCCG ACATGCAACC 1560 

40 CAACAACGT 156 9 

(2) INFORMATION FOR SEQ ID NO: 138 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1125 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIiDEDNESS: double 

(D) TOPOLOGY: circular 

50 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

55 (iv) ANTI- SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOUAS GINGIVALIS 

60 (ix) FEATURE: 

(A) NAI'lE/KEY: mis cofeature 

(B) LOCATION 1 . . . 1125 



PCT/AU98/01023 



65 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 138 



ATGAATTTCT TAAAAAAAGA ACCGTTTAAA ATATTCTCTA TGATTTATCT GCTGTTAGAT 60 

ACAATAACAA ACCGTGCCGG TACAGAACGC GCCGTGATCA ACTTGGCTAA CAACCTGCAT 120 

GCCAATGGTC ATCGCGTATC ATTAGTCAGC GTTTGTACAA AAGAAGGAGA GCCTTCCTTC 180 

CAAGTAGAAA AAGGAATAGA AGTACACCAT CTCGGAATTA GGCTTTATGG CAATGCATTA 24 0 

70 GCCCGCAAAA CAGTATATTT CAAGGCTTAT CGAAGGATAA AAGCCCTATA CAAGAAGCGT 300 

GAACCGGTTT TATTGATAGG GACTAATATT TTTATCAATA CAATTTTGTC TCAGATCAGT 360 

AACAGAGGCA GAATATTTAC GATCGGATGC GAACATATCT CTTATGATAT TGCCCGCCCT 4 20 

ATTACAAAAC GCATAAGGGG GTTTCTGTAT TCAGGGCTTG ATGCCGTTGT AGCACTGACA 4 80 

AAAAGAGATC AGCAATCGTT CGAGGCAATC TTACGTGGAC GCTCTAAAGC ATATGTCATA 54 0 

75 CCCAATCAAG TTTCATTTAC TACAGTCCAA AGAGATGCTA CTACTCACAA ACAAATGTTG 600 
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10 



25 



55 



65 



GCGATTGGCA GGCTTACCTA CCAGAAGGGT TTTGAATTCA TGATAGAAGA TGCATCACGA 660 

GTGCTGCGAG AAAGGCCTGA TTGGAAGCTT ATCATAGTCG GAGATGGCGA AAATGAATCG 7 20 

ATGCTACGTA AAGAAATTGC ATCTCGCAAT ATGGAGTCGC AAATAGAAAT ACATCCATCT 7 80 

ACACCGGAAA TTCGCAAATA CTACGAATCA TCTGCTATTT ATCTAATGAC GTCCCGTTTC 84 0 

GAAGGACTAC CAATGGTACT TCTCGAAGCA GAAGCATATG CACTACCTAT T^TCTCATAC 900 

OATTGTCCGA CCGGCCCGAG GGAACTGATC GAAAACGGTC GCAATGGTTT CCTTGTGCCA 960 

ATGGAAGCAC ATGAAGACTT CGCGGATAAG TTACGCTTAT TGATGGATGA TGAAACTCTT 102 0 

CGTAAGAAAA TGGGACAAGA ATCAGAGTTG ATGGTCAAAT CCTACTCTCC GGCAAATATC 1080 

TATGAATGTT GGAAGAAACT ATTCGTCGAA ATCGGCTACA TGAAT 1125 

(2) Hi FORMATION FOR SEQ ID 110:13 9 



(i) SEQUEtJCE CHARACTERISTICS: 
15 (A) LENGTH: 1086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

20 (ii) IIOLECULE TYPE: DtlA (genomic) 

(iii) HYPOTHETICAL: HO 
(iv) Al ITI- SENSE : NO 



(vi) ORIGIIJAL SOURCE: 

(A) ORGANISM: PORYPHYROMOMAS GINGIVALIS 



(ix) FEATURE: 
30 (A) NAIIE/KEY: misc_feature 

{Q) LOCATION 1 ... 1086 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 

35 ATGATTTATC TGCTGTTAGA TACAATAACA AACCGTGCCG GTACAGAACG CGCCGTGATC 60 

AACTTGGCTA ACAACCTGCA TGCCAATGGT CATCGCGTAT CATTAGTCAG CGTTTGTACA 120 

AAAGAAGGAG AGCCTTCCTT CCAAGTAGAA AAAGGAATAG AAGTACACCA TCTCGGAATT 180 

AGGCTTTATG GCAATGCATT AGCCCGCAAA ACAGTATATT TCAAGGCTTA TCGAAGGATA 24 0 

AAAGCCCTAT ACAAG7V\GCG TGMCCGGTT TTATTGATAG GGACTAATAT TTTTATC7U\T 300 

40 ACAATTTTGT CTCAGATCAG TAACAGAGGC AGAATATTTA CGATCGGATG CGAACATATC 360 

TCTTATGATA TTGCCCGCCC TATTACAAAA CGCATAAGGG GGTTTCTGTA TTCAGGGCTT 4 20 

GATGCCGTTG TAGCACTGAC AAAAAGAGAT CAGCAATCGT TCGAGGCAAT CTTACGTGGA 4 80 

CGCTCTAAAG CATATGTCAT ACCCAATCAA GTTTCATTTA CTACAGTCCA AAGAGATGCT 54 0 

ACTACTCACA AACAAATGTT GGCGATTGGC AGGCTTACCT AGCAGAAGGG TTTTGAATTC 600 

45 ATGATAGAAG ATGCATCACG AGTGCTGCGA GAAAGGCCTG ATTGGAAGCT TATCATAGTC 660 

GGAGATGGCG AAAATGAATC GATGCTACGT AAAGAAATTG CATCTCGCAA TATGGAGTCG 720 

CAAATAGAAA TACATCCATC TACACCGGAA ATTCGCA/iJ^T ACTACGAATC ATCTGCTATT 7 80 

TATCTAATGA CGTCCCGTTT CGAAGGACTA CCAATGGTAC TTCTCGAAGC AGAAGCATAT 84 0 

GC ACT ACCT A TAATCTCATA CG ATTGTCCG ACCGGCCCGA GGGAACTGAT CGAAAACGGT 900 

50 CGCAATGGTT TCCTTGTGCC AATGGAAGCA CATGAAGACT TCGCGGATAA GTTACGCTTA 960 

TTGATGGATG ATGAAACTCT TCGTAAG7VAA ATGGGACAAG AATCAGAGTT GATGGTCAAA 102 0 

TCCTACTCTC CGGCAAATAT CTATGAATGT TGGAAGAAAC TATTCGTCGA AATCGGCTAC 1080 

ATGAAT 1086 



(2) INFORMATION FOR SEQ ID NO: 140 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 bass pairs 
60 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
ID) TOPOLOGY: circular 



(ii) tlOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AMTI- SENSE: NO 



70 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GINGIVALIS 

(ix) FEATURE: 

(A) NA1-1E/KEY: misc_f eature 
75 (E) LOCATION 1...1920 
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(Mi) SEQUEMCK DESCRIPTION: SEQ ID MO: 14 0 

ATGGGAAAAA TCATTGGAAT TGACTTAGGC ACAACGAACT CTTGTGTCTC TGTATTGGAA 60 

5 GGT7\ACGAAC CTATCGTTAT TACAAACAGT GAGGGCAAGC GCACAACGCC CTCGGTAGTG . 120 

GCTTTTGTGG ATGGTGGCGA GCGTAAGGTG GGCGATCCGG CCAAGCGTCA GGCCATCACC 180 

AATCCGACCA AGACGATATA CTCTATCAAA CGCTTCATGG GCGAAACTTA CGATCAGGTT 24 0 

TCCAGAGAAG TGGAGAGAGT GCCATTCAAG GTAGTACGTG GGGACAATAA TACTCCGCGC 300 

GTAGATATAG ACGGTCGTCT CTATACGCCG CAGGAAATTT CGGCCATGAT CCTTCAGAAG 360 

10 ATGAAGAAGA CGGCCGAAGA CTACCTCGGT CAGGAAGTAA CGGAGGCCGT GATCACTGTG 4 20 

CCCGCATACT TCAACGACGC TCAACGTCAG GCAACGAAAG AAGCAGGAGA GATCGCCGGC 4 80 

CTGAAAGTTC GCCGTATTGT GAACGAGCCT ACGGCAGCTT CTCTGGCCTA CGGTCTGGAC 54 0 

AAGTCCAATA AGGATATGAA GATCGCTGTC TTCGACTTGG GTGGCGGTAC CTTCGATATC 600 

TCTATCTTGG AATTGGGCGA CGGCGTTTTC GAAGTGAAAT CGACCAACGG TGATACGCAC 660 

15 CTCGGAGGAG ACGACTTCGA CCACGTGATC ATTGACTGGC TGGCAGAAGA GTTCAAGTCT 7 20 

CAGGAiAGGTG TGGATCTTCG CCAGGATCCT ATGGCTATGC AGCGTCTGAA AGAAGCTGCC 7 80 

GAAAAAGCCA AGATAGAGCT CTCCAGCACT TCATCTACGG AGATCAACCT CCCCTATATC 84 0 

ATGCCGGTGA ACGGCATCCC CAAGCACTTG GTGATGACGC TTACAAGGGC TAAGTTCGAG 900 

CAGTTGGCCG ATCGTCTGAT TCAGGCATGT GTGGCACCCT GCGAAACGGC CTTGAAAGAT 960 

20 GCCGGTATGT CACGTGGCGA TATCGATGAA GTGATTCTCG TAGGTGGTTC CACACGTATT 1020 

CCTGCTATTC AGGAGATTGT GGAGAAGATC TTCGGTAAGG CTCCGTCCAA GGGTGTGAAT 1080 

CCCGACGAAG TGGTAGCTGT GGGTGCCGCT ATTCAAGGCG GTGTTCTGAC CGGTGAGGTA 114 0 

AAGGATGTCT TGCTGTTGGA CGTTACCCCC TTGTCGCTCG GTATCGAGAC TATGGGAGGC 1200 

GTGATGACTC GCTTGATCGA TGCCAATACC ACTATCCCGA CGAAGAJ\GAG CGAAuATCTTT 1260 

25 ACCACAGCAG TGGACAATCA ACCTTCGGTA GAGATTCATG TACTTCAGGG TGAGCGTTCT 1320 

TTGGCTAAGG ACAATAAGAG CATCGGCCGT TTCAACTTGG ACGGTATTGC TCCGGCGCCC 1380 

CGTCAGACAC CGCAGATCGA AGTAACGTTT GACATCGATG CCAACGGTAT CCTGAATGTA 14 4 0 

ACGGCTCATG ACAAAGCTAC CGGCAAGAAG CAGAATATCC GCATCGAAGC CTCCAGCGGT 1500 

TTGTCCGATG ATGAGATCAA GCGCATGAAG GAAGAGGCGC AGGCCAATGC CGAAGCAGAT 1560 

30 AAGAAAGAGA AAGAACGTAT GGAGAAGATC AATCAGGCCG ACAGCATGAT CTTCCAGACG 1620 

GAAAAGCAGT TGAAGGAGTT GGGAGACAAA TTCCCGGCCG ACAAGAAGGC TCCGATCGAT 1680 

ACCGCTCTCG ACAAACTGAA AGAAGCACAC AAAGCACAGG ATGTAGCTGC TATCGATACA 17 4 0 

GCCATGGCCG AACTGCAAA.C CGCTCTTTCC GCAGCGGGCG AAGAGCTTTA CAAGAATGCC 1800 

GGAGCAGCCC AAGGTGGCGC ACAACCCGGT CCGGACTTCG GCGGTGCTCA AGGTCCCTCT 1860 

35 GCCGGTGATC AGCCCTCTGA CGACAAGAAC GTCACAGACG TAGACTTCGA GGAAGTGAAG 192 0 

(2) IIJ FORMAT I ON FOR SEQ ID NO: 141 

40 {i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1347 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDHESS: double 

(D) TOPOLOGY: circular 

45 

(ii) ]-10LECULE TYPE: DNA (genomic) 

( i i i ) M Y POT H ET I CAL : MO 

50 (iv) ANTI-SENSE: tiO 

(vi) ORIGINAL SOURCE: 

(A) ORGAl-IISIl: PORY PHYROMOMAS GINGIVAL IS 

55 (ix) FEATURE: 

(A) NAl'lE/KEY: misc_feature 

(B) LOCATION 1. . .1347 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 



ATGCGCTACG ACTTAGCTAT CATCGGTGGA GGGCCGGCCG GTTATACGGC TGCCGAACGT 60 

GCTGCCAAAG GTGGCCTGAA AACCCTCCTA ATTGAGAAGA ATGCTCTCGG TGGTGTATGC 120 

CTCAACGAAG GATGTATACC GACCAAGACG CTACTCTACT CGGCCAAAGT GCTACATCAA 180 

ATTGCTACGG CATCTAAATA TGCAGTAAGT GGAACGGCCG ATGGACTTGA CCTCGGCAAG 24 0 

65 GTGATTGCCA GAAAAGGTAA AATCATTCGC AAGCTGACTG CAGGCATCCG TTCACGCCTG 300 

ACAGAGGCCG GAGTAGAGAT GGTGACGGCA GAAGCTACCG TAiACGGGATG CGATGCAGAC 360 

GGCATCATCG GCATTACTGC GGGCGAAGCA CAGTACAAAG CTGCCAACCT GCTACTATGT 4 20 

ACCGGTTCGG AGACGTTTAT TCCACCCATC CCCGGAGTGG AGCAGACAGA GTATTGGACA 4 80 

AACCGTGAAG CTCTACAGAA CAAAGAGATT CCGACCTCTC TCGTCATCAT CGGTGGTGGA 54 0 

70 GTGATCGGAA TGGAGTTCGC TTCTTTCTTC AACGGTATCG GTACGCAAGT GCACGTGGTG 600 

GAGATGCTGC CGGAAATACT CAACGGTATC GATCCCGAAC ATGCAGCTAT GCTACGCGCT 660 

CACTATG7W\ AAGAAGGA^.T CAAATTCTAC CTCGGGCACA AAGTAACATC GGTTCGCAAC 7 20 

GGAGCTGTTA CGGTAGAATA CGAAGGAGAA AGCAAAGAGA TCGAAGGAGA ACGTATCCTG 7 80 

ATGAGTGTGG GACGTCGCCC CGTGCTGCAA GGATTCGAGT CGCTCGGATT GGTGCTTGCC 84 0 

75 GGCAAAGGTG TAAAGACTAA TGAGAGGATG CAAACTTCCC TGCCCAATGT CTATGCTGCA 900 
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GGTGATATTA CAGGCTTCTC GCTTTTGGCA CATACGGCTG TACGGGAAGC AGAGGTAGCA 960 

GTAGATCAGA TTTTGGGCAA AACAGACGAA ACGATGAGCT ACCGTGCCGT ACCAGGTGTG 102 0 

GTGTACACCA ATCCCGAGGT CGCCGGTGTG GGAGAGACGG AAGAATCGCT TCGCAAAGCA 108 0 

GGACGTGCCT ACACTGTTCG TCGCCTTCCT ATGGCCTTCT CCGGTCGATT TGTAGCAGAA 114 0 

5 AACGAACAAG GCAATGGAGA GTGCAAACTA CTACTTGATG AAGAGAACCG CTTGATCGGA 1200 

GCACACCTCA TTGGCAATCC GGCCGGCGAA CTCATCGTAA CCGCTGCCAT GGCCATCGAG 1260 

ACCGGCATGA CGGATCGACA AATCGAACGA ATCATATTCC CTCATCCGAC TGTAGGCGAA 132 0 

ATCCTAA7y\G AAACTCTCGC CGGAGGT 134 7 

10 

{2} IMFORHATIOII FOR SEQ ID NO: 142 



(i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 2 82 3 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRAMDEDMESS: double 

(D) TOPOLOGY: circular 



20 



(ii) MOLECULE TYPE: DMA {genomic) 
(iii) HYPOTHETICAL: HO 



(iv) A11TI-SE1-3SE: IIO 

25 (vli ORIGIIIAL SOURCE: 

(A) ORGAHISll: PORYPHYROHOMAS GIIIGIVALIS 



{i>;) FEATURE: 

(A) HAJIE/PCEY: mi ^c_f ea ture 
30 (B) LOCATION l..,2 823 



(>:i) SEQUEIICE DESCRIPTION; SEQ ID HO; 142 



ATGGAATTGA AAAGATTTTT ATCACTTGGT 

35 CTTTCTGCCC AACAGGCTCA GCCACTCCCT 
GACAACGGAT TGACTTATTT CATCCGTCAC 
TTTATCGCAC AAAAGGTAGG TTCTATTCTT 
TTCTTGGAAC ACATGGCTTT CAACGGTACG 
TATCTCGAAA CGATCGGTGT ACGTTTCGGT 

40 AAGACGGAAT ATACGATAAT GGATGTGCCG 
TTGCTTATCC TGCATGATTG GAGTAACAAT 
GAGCGCGGTG TGATCCAGGA AGAGTGGCGT 
GAGGCTATAC TTGCCAAGGC TATGGCGGGT 
CTGATGGACG TCGTGCTCAA CTTCAAGCAT 

45 TATCGTCCCG ACCTGCAAGG TCTGGTGATC 

AACAAGATCA AAGAACTCTT CAAGGACGTT 
TATACG CCGG TAGAGGACAA CGATGAGCCT 
ACTACCACGC AGCTCTCCAT CAGCTTCAAG 
TCGATATTCG GACTTGTGGA AGACTATATG 

50 CGTCTGTCCG AGATTACTCA CAAGCCTAAC 
TCTAACTTCA TGTACATCAC CCAGACTAAG 
GAGGGTGAAG CGGAG7U\AGC GATGAACGCA 
TTCGGTATCA CCAAAGGCGA ATACGATCGT 
AATCAATACA ACGAAAGAGA CAAGCGTAAG 

55 TACTTCACCG ATGGCGGCTA TATCCCGGGT 
TTTGCTCCTC AGGTTCCTCT GGAAGCATTC 
GTGAAGAATG CTGTCGTTAC CCTCACCGGT 
GAAGCAGACT TCCTCGCTGC TTTCAAAGCT 
GACGAAGTCT CCGACCAAAA ATTGATGGAG 

60 GAGAAGAAAG ATCAGAAGTT CGGTACCACG 
TACCTCAAGA AGACCGATTT CAAATCAAAC 
GGTATCCTCT CCGGAAAGCA TGCTCCCAAC 
GGTGGCTTGG GCAACTTCGA TGCTATCCAG 
TCCGTATCTC CCTCTTTGTC TCTGCTCAGT 

65 GATATGGAAA CTTTCTTCCA GTTGATCTAT 
GAAGCGTTCA AGGCCACACA GGAAAAGTTG 
CCGATGGCTG CGCTTATGGA CTCTATCCGT 
AAACCCATGA AAGCTGCTGA CGTGGAGAAA 
AATGAGCGAT TCGCTGATGC CGGCGACTTT 

70 GCCAAGATGA AGCCATTGAT CGAAACTTAT 
GATAAGATGA ATAAGGCTCA GGTACCGGCT 
GAGAAGGAAA TGGATACTCC TTCGACTACT 
TATACGCTCA AGAACAGTCT CCTGCTGGAA 
ACGGCTACCG TTCGCGAGAA GGAAGGCGGT 

75 GAGCAATATC CTCAGCCCAA GGCTCTGATG 



CTTCTGCTTG TGGGATTCAT TCCGATGAAG 60 

ACAGATCCGG CTGTTCGTGT CGGTAAGTTG 120 

AACGAGAACC CGAAAGATCG TGCGGATTTC 180 

GAAGAAGATA GCCAGTCCGG TTTGGCTCAC 24 0 

AAGAACTTCC CCGGTAAGAA CTTGATCAAC 300 

CAGAACCTGA ACGCTTCTAC CGGATTCGAC 360 

ACTACACGTC AGGGAATCAT CGACTCCTGC 4 20 

ATTACCCTCG ACGGGCATGA GATCGACGAG 4 80 

GCTCGTCGCG ATGCCAACCT TCGTATGTTC 54 0 

AATAAATATG CAGAACGCAT GCCCATCGGT 600 

GATGAGCTGC GCAACTATTA TAAGAAATGG 660 

GTGGGAGATA TCGATGTGGA CTATGTGGAG 7 20 

CCTGCTCCCG TGAATCCAGC AGAGCGTATC 7 80 

ATCGTAGCCA TTGCTACCGA TGCTGAGGCT 84 0 

AGCGACCCCA CTCCTCAAGA AGTGCGAGGA 900 

AAACAGGTGA TCACTACAGC CGTGAATGAG 960 

GCTCCTTTCC TCAGTGCAGG AGCTTTCTTC 1020 

GACGCATTCA ATTTTGTTGC CACGGTTCGT 108 0 

TTGGTGGCAG AGATAGAAAG CCTCCGTCAG 114 0 

GCACGCACGA ATGTGCTCAA GCGATACGAG 1200 

AACAATGCTT ATGCCAATGA ATACTCCACC 1260 

ATTGAGGTGG AATATCAGAC GGTGAATGCT 132 0 

AATCAGGCTA TTGCCCAAAT GATCGATCCG 1380 

CCTTCAAAGG CTGAAGCCAA GATTCCGAGC 14 4 0 

GCTCGTCAGC AGAAAGTAGA AGCCAAGAAA 1500 

AAAGCTCCTA AGGCCGGAAA GATCGTTTCC 1560 

GAACTTACCC TTAGCAATGG CATCAAAGTA 162 0 

GAAATCCTGA TGAGTGCTCT CAGCCCGGGT 168 0 

CAATCTGTGA TGAATTCGTT CATGAACGTG 174 0 

CTGGATAAGG TGCTGACAGG TCGCTCTGCT 1800 

GAAGGTCTTT CGGGCAAAAC GACTGTAGAA 1860 

CTCCAAATGA CTGCTAACCG CAAGGATCCC 192 0 

TACAATAACT TGAAAAATCA GGAAGCCAAC 1980 

CATACCATGT ACGGCGATAA TCCGATGATG 204 0 

GTAAATTACG ATCAGGTAAT GGCTTTCTAC 2100 

ATGTTCTTCT TTATCGGTAA TCTGGATGAA 2160 

CTTGCTTCAT TGCCCAACCT CAAGCGTGGC 222 0 

GCCCGTTCGG GAAAGATCGA TTGCAAGTTC 2280 

ATATTCGATG TCGTGTCCGG AAATGTGGAA 234 0 

GTCTTCTCAG CCGTAATGGA TCAGGTGTAC 2 4 00 

GCATACAGTG TGGCTGCATT CGGCGGTCTC 2 4 60 

CAGATCTATT TCCCCACGGA TCCTGCTCGT 2520 
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GCCGAGGAAA TGAATGCTAT CGTTTTTGCT GAGTTGGAGA AGCTTGCCAA GGAGGGCCCC 258 0 

AATGTGGAAT ACTTTAAGAA GACTATCGAA AACCTGAATA AGCAGCACAA AGAAAGTCTG 2 64 0 

CGTGAGAATC GTTTCTGGCT CGAAGCCATG AAGGCGTCTT TCTTCGAAGG AAATGACTTC 27 00 

ATCACAGACT ACGAATCCGT ACTGAACGGT CTTACTCCTG CTGAATTGCA AAAGTTTGCG 27 60 

GCAGACCTCT TGAAGCAGCA GAATCGGGTT GTTGTCATGA TGGCTCCTGT TGCAAAGGCT 2 82 0 

CAA 2823 



10 



20 



(2) III FORMAT ION FOR SEQ ID HO: 1^3 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 2052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 
15 ( D) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DIIA (genomic) 
(iii) HYPOTHETICAL: HO 
(iv) At IT I -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGAN ISM: POKY PH YROMOMAS GINGIVALIS 



25 

vix) FEATURE: 



(A) NAIIE/h:EY: misc__f eature 

(B) LOCATION 1 . , . 2052 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 



ATGAGTAAGA AAGGAACAAT CGGGGTAACG 
TTCCTGTACA GCGACCATGA GATATTCCTG 
ACGCAGAAGC TGAAAACGCT TACATCCGTC 

35 CGCGTAACGG TCAGCGTGGA TGAAGTGGCA 
GGGATGACCG AAGAGGAGGT GGAGAAGTAC 
GAGTTTCTTG AAAAGTACAA AGACGACAAG 
TTTTACTCGG CTTTCATGGT GTCCGAGCGA 
GATGCTACGG CGGTGAAATG GAGCTGCGAC 

40 GACAAGGCTG ACCGTGGCAC CGACATCGTG 
CTCAAAAAAG AAAAGATAGA GGGGCTCCTC 
ATCATTTTCG GCAAGAAGCA GGAATGGAAA 
AATCAGATCA ACGACACACA TCCTGCCTGG 
GACTATAAGG AATTTTACCG TTCGCTCTAT 

45 CACCTCAATG TGGACTATCC GTTCAATCTG 
AACAACTTGG ATCTGCAGCG CAACAAGATT 
GATGAAGTAC AGGGTATCGT GCCGGACTTC 
CCGGATATTC CCCTCAACGT ATCGCGCTCC 
ATCTCGTCTC ATATCACCAA GAAGGTGGCA 

50 CGCCCCACAT TCGAGGAGAA ATGGGATAGT 
ACGGATGAGA AGTTCTATGA GCGTGCAGCC 
CACAAGTACA CGTTCGACGA ATACCGAACG 
GGACAGGTAG TGTATCTCTA TGCTACGGAC 
GCATCCGACA AAGGCTACAG CGTGATGCTG 

55 AGCCTGCTGG AGCAAAAGTT GGAGAAGACA 
AACAATCTGA TCCGC7\AGGA GGAAAGAGCC 
ACTCTCGTGA AGCTGTTCGA AGCACGCCTG 
GCTTTCGAAT CGCTCGGAGC CGAAGGTGAA 
CGCCGTATGC GCGATATGGC ACAGCTGCAG 

60 GATTCGTACA ATCTGGTACT TAATACCGAT 
GAGAAAGAAT CGGTAGAGCC TTCGCTCACA 
GCGGAAGAGG CCAAGCTGCT CGATGAGGAA 
GCCACGAAGG AAGCCAAGGA GAACAACGCC 
AACGATCAAC TGACCAAATA TGCTCAGGAC 

65 GCTCTGCTCG GAAGCGGATT GCTGACGGGA 
CAGCGTCTTC TC 



AGCGACAATA TATTCCCCGT CATCAAAAAA 60 

CGTGAGATCG TCTCCAATGC CGTGGATGCT 12 0 

GGCGAATTCA AAGGCGAGAC GGGTGACCTC 180 

CGCACGATCA CGGTCAGCGA CCGCGGCGTA 2^10 

ATCAATCAGA TTGCTTTCTC CAGTGCGGAA 300 

GCCGCCATTA TCGGCCACTT CGGACTCGGA 36 0 

GTGGACGTGA TCACGCGCTC TTTCCGAGAA 420 

GGATCGCCCG AATACACGCT CGAACCTGCG 4 80 

ATGCACATCG ATGAGGAGAA TAGCGAGTTC 54 0 

GGCAAATACT GTAAGTTCCT TACCGTGCCG 600 

GACGGCAAGA TGCAAGATAC GGACGAGGAC 660 

ACCAAAAAGC CTGCCGACCT CAAGGACGAA 720 

CCCATGTCCG AAGAGCCTCT CTTCTGGATC 7 80 

ACAGGTATCC TCTATTTCCC GAAGATCAAA 84 0 

CAGCTCTACT GCAATCAGGT TTACGTCACC 900 

CTCACCCTCC TGCACGGGGT CATCGATTCG 960 

TATCTGCAGA GCGATGCCAA TGTGAAGAAG 102 0 

GACCGTCTGG AAGAAATTTT CAAAAACGAC 108 0 

CTGAAGCTCT TCGTCGAATA CGGTATGCTG 114 0 

AAATTCTTCC TTTTCACCGA TATGGACGGA 12 00 

CTCGTCGAAG GTGTACAGAC GGATAAGGAC 1260 

AAGCATGGAC AGTACAGCCA CGTGAAACGT 132 0 

TTGGATGGTC AGTTGGATCC GCATATCGTG 1380 

CACTTTGTCC GTGTCGATAG CGATACGATC 14 4 0 

GAAGTGAAAC TGTCCGATAC GGAGCGCGCC 1500 

CCACGGGACG AGAAGAAGCA CTTCAATGTA 1560 

GCCATCCTTA TCACACAAGC CGAATTCATG 162 0 

CCGGGAATGA GCTTCTACGG CGAACTCCCC 168 0 

CATCCGCTCA TCGACAGGGT ACTCTCCGGT 17 4 0 

GAGCTTAGAG CGAAAATCGC CGAGCTGAAA 18 00 

AAAGGGAAGA AACCGGAGGA AATCCCTGTT 1860 

GTCGAACAGG CCAAAACCGA AGGCAGTATC 192 0 

AACGAGCTGA TAGGTCAGCT CATCGACTTG 198 0 

GAGGCTTTGG CCGAATTCAT TCGTCGCAGC 204 0 

2052 



(2) INFORMATION FOR SEQ ID NO: 14 4 

70 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
75 (D) TOPOLOGY: circular 
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60 



65 



(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) At IT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) OR GAI II SH: PORYPHYROMOWAS GINGIVALIS 

(ix) FEATURE: 

(A) NAl-IE/KEY: misc_f eature 

(B) LOCATION 1...1392 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14-3 

ATGGAAAAAC TGATCGATAT TTTGGTCGTA GACGATGATG TGGCAGTCTG TGCCGCACTG 60 

CGTCTGGTGC TCAAGCGAGC GGGCTAT7\AT CCCGTTATAG CCAACAGTCC CGACGAAGCT 120 

TTGTCCATAA TGCGGAATCC TGATGGCGGC TGTAAGCCGG CTGTGATTCT GATGGATATG 180 

20 AATTTCTCCC TTTCGACCTC CGGCAGGGAA GGATTGGAAC TACTGGAGAA GATGCAGATA 24 0 

TTCACTTCCT GCCCTGTCAT ACTGATGACG GCTTGGGCTT CGATTCCACT GGCAGTGGAG 300 

GGAATGAGGC TTGGAGCTTT CGACTTCATA GGCAAGCCAT GGGACAACGA TCGGCTCCTT 360 

CGTACCATAG ATACGGCCTT GCATCTGGCT GCTCCCTCAG CTGTGGCGAA TCCATCGGAA 4 20 

CAGTCTGACA GAGATACAGC CCGTCAGCCG AAAGCTACAG TCCAAGAGAA TGACCCCTGT 4 80 

25 GCCCATATCA TAGGCCGGAG CGATGCCATC TGTAAGATCA AGGT^ACGGAT ACGCCGCATA 54 0 

GCTCCCACCC ATGCCTCTGT GCTGATCACG GGCGAGAGCG GTACGGGCAA AGAGTTGATA 600 

GCCGAAGCTC TGCACCGTGG GAGCAAACGA GCCTCAGCCC CATTCGTCAA GGTCAATTTG 6 60 

GGTGGGATTC CCGAAAGTTT GTTCGAAAGT GAGCTGTTCG GACATAAGAA AGGAGCTTTT 7 20 

ACCAATGCTT TTTCCGACAG GAAAGGACGG TTCGAGCTGG CTGATGGCGG CACGATCTTT 7 80 

30 CTGGACGAAA TAGGCGAACT ACCGGTCGGC MCCAAGTAA AACTGCTGCG AGTGCTACAG 84 0 

GAACAGACAT TCGAGCCGTT GGGCGAGAGC GTCTCCCACC GAGTGGACAT CCGTGTGGTA 900 

TCGGCTACGA ATGCTTCCTT GGAGCGAATG GTAGCCGAAG GACGTTTCAG AGAGGACCTC 960 

TACTATCGAA TC7\ACCTGAT ACATCTGCAT CTGCCTCCGC TGCGTGAGCG TCAGGAGGAT 1020 

ATACAGCTGC TGGTGGAAGC CTTCAGTGAA GCCTTTGCCC AATCGAACGG ATTGCCCCAT 1080 

35 GCCGTTTGGA GTGCGGAAGC TATGCGACGT ATCTGTGCCA TGCCCCTACC GGGCAATGTA 114 0 

CGCGAACTGA AAAACGTAGT GGAGCGTACG CTATTGCTCT CGGGATCGAG AGAAATCAGT 12 00 

GCCCGGGATG TGGCTGACTT CGGTTCGCAG GTGACGGCAG CAGACCACTC CGACGAACGG 1260 

GCTTTGACCG ACATGGAGGA AGCTGCTATC CGAGAGACGC TGACTAAATA CAACGGCAAC 1320 

GTTAGTCGTG CTGCACGAGC CTTGGGATTG AGCCGGGCAG CTCTTTACCG GCGAATGGAG 13 80 

40 AAATACGGAC TG 13 92 

(2) INFORMATION FOR SEQ ID NO: 14 5 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

50 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

55 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAI-aSII: PORYPHYROHONAS GINGIVALIS 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1 ... 750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 5 



ATGCTTAAGA TAAAGAACCT CCACGCCACA GTACAGGGCA AAGAGATATT GAAAGGAATC 60 

AATCTGGAGA TCAATGCCGG AGAGATTCAT GCTATCATGG GGCCGAACGG ATCGGGGAAA 120 

AGTACGCTCT CTTCCGTTTT GGTGGGACAT CCCTCCTTTG AAGTCACGGA AGGAGAGGTG 180 

ACATTCAATG GAATCGACCT GCTCGAACTC GAACCGGAAG AACGTGCACA CCTCGGACTC 24 0 

70 TTTCTCAGTT TCCAATATCC GGTCGAGATC CCGGGCGTCA GCATGGTGAA TTTCATGAGG 300 

GCAGCTGTCA ATGAACATAG GAAAGCGATC GGAGCAGAAC CCGTATCGGC AAGCGACTTC 360 

CTCAAGATGA TGCGAGAGAA GCGTGCCATT GTGGAGCTGG ACAACAAATT GGCCAGCCGT 4 20 

TCTGTGAACG AAGGCTTCTC CGGTGGAGAA AAAAAGAGGA ACGAAATCTT CCAAATGGCT 4 80 

ATGCTCGAAC CCAAGCTGGC TATTTTGGAC GAAACCGATA GCGGGCTCGA TATCGACGCT 54 0 

75 CTCCGCATCG TAGCAGGCGG GGTAAACCGA CTCCGCTCTC CGGAGAATGC TGCTATTGTG 600 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929S70A1J_> 



wo 99/29870 



PCT/AU98/01023 



5 



15 



99 / 490 

ATCACACACT ATCAGCGTTT GCTCGAGTAC ATCAAGCCGG ACTTCGTACA CGTCCTTTAC 660 

AAGGGGCGCA TCGTCAAGTC GGGAGGAGCC GAGCTGGCTC TCACGCTCGA AGAAAAAGGC 7 20 

TACGACTGGA TCAAGGAAGA GATAGGAGAA 7 50 

(2) INFORMATION FOR SEQ ID NO: 146 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: HO 

( iv) ANTI-SEIISE: MO 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI-IE/KEY: mi £c_ feat are 
25 (B) LOCATION 1...1383 

{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 1^6 

ATGGCTAAGG AGAAAACGAT CTACGTCTGC CGTTCGTGCG GAACCAAATA CGCCAAATGG 60 

30 CAAGGCAACT GCAATGCCTG TGGAGAGTGG AACTGCATTG ATGAGGAGAA GGTGCCGGCA 120 

CCGGCATCGG GCAAGCATGC AGCCAAGAGT TTTATGCCTC GGGAGCAGGA -CAACCGGCCA 180 

AGACTCTTAC AGGATGTGGA GTCCGGCGAT GAAGAGCGTA TTCGCCTCGG CGATGAAGAG 24 0 

TTCGACCGCG TACTGGGTGG AGGAATTGTC AAAGGAGCAT TTGTCCTGCT TGGCGGCGAG 300 

CCGGGAATCG GTAAGTCCAC GCTTATCCTC CAGACGGTGC TGCGTCTGCC GCAGTTGCGC 360 

35 ACGCTCTATG TGTCGGGCGA AGAAAGTGCC CGACAACTGA AGATGCGCGC CGAACGACTG 4 20 

GGGCAAGCCA TGAATGGGTG CTACGTATAC TGCGAAACGA ATATAGAGAG GATACTCTCC 4 80 

CGTGCAGAJXG AACTCACACC CGATCTCCTC GTGATAGACT CTATACAGAC GGTCTATACC 54 0 

GAGGAAATGG AAAGCTCGGC CGGCAGCGTG GGGCAGATCC GCGAATGTGC CGCCTTACTG 600 

CTCAAATACT GCAAGACTAC GGGTATCCCC GTCATCGTCA TCGGACACAT CACCAAAGAA 660 

40 GGTAGCATAG CCGGACCGAA GGTGCTGGAG CATATAGTGG ATACGGTGCT TCTCTTCGAC 7 20 

GGGGATAAGC ATCATCTCTA CCGGATACTC CGAGGACAGA AGAACCGCTA TGGCAGTACT 7 80 

TCCGAGCTGG GGATATACGA GATGCGGCAG GACGGTCTGC GTGGCGTGGA GAATCCGAGC 84 0 

G7V\CATCTCA TCACACGCAA TAGGGAAGAC CTCAGTGGCA TAGCCATAGC CGTAGCGATG 900 

GAGGGCATTC GCCCGATACT CATCGAAGCG CAGGCTTTGG TCAGCTCGGC CATTTATGCC 960 

45 AATCCGCAGC GTTCGGCCAC GGGCTTCGAT ATTCGGCGGA TGAACATGCT CTTAGCCGTA 102 0 

CTGGAGAAAC GTGCCGGCTT CAAGCTCATA CAGAAGGATG TGTTTCTGAA CATTGCCGGA 1080 

GGTATCA/\AA TAGCCGATCC GGCTACGGAT CTGGCCGTTA TCTCGGCAGT GCTGGCGTCG 114 0 

AGTCTGGACA TCGTTATCCC GCCGGCCGTA TGCATGACGG GCGAGGTCGG ACTCTCCGGA 1200 

GAGATACGTC CCGTGAGCCG CATCGAGCAG CGCATAACGG AAGCGCGTCG CATAGGGTTC 12 60 

50 AAAGAGATAT TGGTACCGGC CGATAATTTC CGGCAGGAGG ATGCCGGCCG CTTCGGTATT 1320 

CGGCTCGTGC CGGTCAGAAA GGTGGAGGAA GCCTTCCGCC ATCTGTTCTC GAAAGGAAGA 1380 

GAA 13 8 3 

55 (2) INFORMATION FOR SEQ ID NO: 147 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

65 (iii) HYPOTHETICAL: MO 

(iv) ANTI-SENSE: NO 

( vi ) ORIGINAL SOURCE : 
70 (A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI-IE/KEY: Tnisc_f ea t ure 

(B) LOCATION 1 ... 813 

75 
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(xi) SEQUENCE DESCRIPTION: SEQ ID 110:141 

ATGAATAGCA GACATCTGAC AATCACAATC ATTGCCGGCC TCTCCCTCTT TGTACTGACA 60 

TTGGGCGGCT GCTCCGTAGC CCAACAAGAT ACGCAGTGGA CTCTCGGCGG AAAGCTCTTT 120 

5 ACTTCGGCGT GGATACAACG TTCGGCCGAA TATCAAGCGC TTTGCATTCA GGCATACAAC 180 

ATCGCTACGG AAAGAGTGGA CGCTCTACCG GCAGAACGTA AACAAGGAGA TAGGCCTTAT 24 0 

GCCATCGTAA CGGACATAGA CG AAACCATT TTGGACAATA CGCCTAACTC CGTGTATCAG 300 

GCTCTCAGGG GCAAGGATTA TGATGAAGAG ACTTGGGGGA AATGGTGTGC ACAGGCCGAT 360 

GCCGACACAC TGGCAGGAGC TTTGTCTTTC TTCCTCCATG CAGCGAACAA GGGGATCGAG 4 20 

10 GTCTTTTACG TCACCAACCG CAGAGACAAT CTGCGCGAAG CAACTCTTCA GAACCTTCAG 4 80 

CGTTACGGAT TCCCCTTTGC CGATGAAGAA CATTTGCTTA CGACCCATGG GCCATCCGAC 5^0 

AAAGAACCCC GTCGGCTCAA AATACAAGAA CAGTATGAAA TAGTATTGCT CATAGGAGAC 600 

AACTTGGGCG ACTTCCACCA CTTCTTCAAT ACGAAAGAAG AGTCCGGACG CAAACAGGCT 660 

CTGGGCCTGA CAGCCGGGGA GTTTGGCCGG CACTTCATCA TGCTGCCCAA TCCCAACTAC 720 

15 GGATCTTGGG AACCGGCATG GTACGGCGGG AAGTATCCGC CACTGCCCGA AAGAGACAAA 180 

GCACTTAAAC AACTGCACTC ACAGAACAGC AGA 813 



20 



30 



35 



(2) iriFORMATION FOR SEQ ID NO: 149 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 bass pair^ 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 
25 <D1 TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

ilii) HYPOTHETICAL: NO 

(iv) AHTl -SENSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: f ORYPHYRa-IONAS GINGIVALIS 



(ix) FEATURE: 

(A) NAl-lE/KEY: misc_feature 

(B) LOCATION 1 . . .1251 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID MO: 148 

ATGAGCACCA ATATAGATGT ACAACAGATC AAACAGCGTT TCGGCATCAT CGGTAGCAGT 60 

CCGCTGATGG AACATGCCAT ACGAGTGGCA GCACA.GGTGG CTCCTACCGA CATGTCCGTC 12 0 

CTCGTGACGG GGGAGAGCGG TTCGGGGAAA GAGTTCTTCC CACAGATAAT CCACTACTAC 180 

45 AGCGCCCGGA AACATCATAG CTACATTGCA GTCAATTGCG GAGCCATCCC CGAAGGAACC 24 0 

ATCGATTCCG AGCTGTTCGG ACACCGCAAA GGTTCCTTTA CCGGAGCCGT ATCGGATCGC 300 

AAGGGGTACT TCGAAGAAGC ATCCGGCGGC ACGATCTTTC TGGACGAAGT GGGCGAACTG 360 

CCTTTGCCCA CGCAGGCGAG GCTGCTGAGG GTGCTGGAGA CGGGCGAGTT CATCCCCGTA 4 20 

GGAGCCAGCC AGTCGCAGAA GACGGATGTC CGTATCGTAG CGGCGACGAA TGTGAACCTC 4 80 

50 AAGGAGGCGG TAGCGAACGG GAAGTTCCGG GAAGACCTCT TCTTCCGGCT C7VATACGGTA 54 0 

CCGATCGAGG TGCCTGCGCT GCGTATGCGA CCGGACGACG TGCCCTTGCT TTTTCGCCGA 600 

TTCGCCGCCG ACAGCGCCGA GAAGTATCGG ATGCCTCCGC TGCGCCTATC GGACGAAGCC 660 

CGTACCATAT TAATGCGTTA CCGCTGGCCC GGCAATGTGC GAGAGCTGCG CAATATAACC 120 

GACAGGCTGA GCATCCTGGA GGAGGAGCGG ACGGTATCGG CAGAGACCAT CACTCGCTAC 7 80 

55 CTGGACGCTG AGGGGATGCA AGACCTCCAC CCCGTCGTGA TCCGACGGAA CGAAACGACC 84 0 

G7KAGCGGACA AACAAATCCC CCATTACGAG CGCGAAATCA TCTACCAGGT GCTATACGAT 900 

ATGAAGAAAG AGATAGCCGA TTTGAAGGGG ATGATGAACC GCCTGGCGCA CCACGAACAG 960 

CCCTCATGGC CTGTAGGGTC GGACGTCTGG GGCAACGACG ACAAGCGCAC CGCAGATCCG 1020 

AAGTGGGGCG TCAGCACGCA CAAGGCCCCC ATCGCGAACG CGGCAGAACC CGTGGAGCCG 1080 

60 ATACAGGAAG CCAGCGAATA CACCGAGGAT CCGGTTTCGC TGGAGGAGGT AGAGAAGAAA 114 0 

ATGATTTCCC TTGCATTGGA ACGCCACGGC GGAAGGCGCA AGCAGACAGC CGAGGAACTG 1200 

AAGATTTCGG AGCGGACACT ATACCGTAAA ATCAAGGAGT ATGGACTGGA A 1251 

65 (2) INFORMATION FOR SEQ ID NO: 14 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1806 base pairs 

(B) TYPE: nucleic acid 
70 (C) STRAI7DEDNESS: double 

(D) TOPOLOGY: circular 

(ill ]IOLECULE TYPE: DNA (genomic) 
75 (iii) HYPOTHETICAL: NO 
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10 



45 



55 



(iv) AIJTI-SENSE: HO 

(vi) ORIGinAL SOURCE: 

(A) ORGAI-IISM: PORY PHYROMOHAS GIHGIVALIS 

(ix) FEATURE: 

{A} IJAI-IE/KEY: misc_feature 

(B) LOCATION 1 . . .1806 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOil-ig 



ATGATCCTAC TCTTCGGCGG TACTACGGAA GGCCGTGCCG CAGCTCGCGT GCTGGATGAA 60 

GCGGGAAGTC CGTTTTTCTA CTCCACCAAA GGCAA.TCTGC AAGAGATCCA GAGTAGCCAC 120 

15 GGCCATCGTC TGACAGGAGC CATGACGGTT GCCGACATGG TTTCGTTTTG TCGGAAAGAA 180 

GAGATCCGAC TGATCGTGGA CGCCGCTCAT CCTTTCGCCG AAGAATTGCA CGCTTCAGTG 24 0 

GCAGAAGCCT CTGAACAAAC AGGTATCCCC GTAGTAAGAT ACGAGAGACA ATACCCTCCA 300 

CGCGAAGAAG GTATCGTCTG GTGTGCAAAC TACGATACGG CTGCCGAGCG GATGCTTGGC 360 

GATGGCGTGC AGCGTCTGCT GATGCTCACA GGAGTGAATA CGATCCCCAA GCTGGCTGCT 4 20 

20 TTCTGGAAAG AGCGCACCAC CTTTTGCCGC ATATTGAAGC GAGACGAATC GGTTGCTTTG 4 0O 

GCAGAGAAGA ACGGCTTTCC TGCGGAGCGC ATCGTTTTCT TCGAACCGCA TGCGGACGAG SAO 

GAGCTGATGC AAGCCGTTCG CCCCGATGCC ATTATCACAA AAGAAAGCGG AGAGAGCGGT 600 

TACTTCCGAG AAAAGATAGA AGCTGCCCGA CGGATGGGCA TCCGTATATA TGCCGTCGTA 660 

CGTCCCCCTT TGCCTCCTTC ATTCATTCCC GTAGGCGGGC CTGTCGGTTT GAGACGGGCG 7 20 

25 GTAGAACGCC TCGTGCCGGG ATTCTTTTCA CTCCGA.AGCG GATTCACTAC CGGCACCACA 780 

GCTACCGCTG CAGTAGTAGC AGCCATGTAC CGATTGATGG GGCTTGGCTC TCTCGCCGAA S40 

GCTCCCGTAG AATTGCCTTC GGGCGAAATA GTCAGTCTGC CCATAGCGGA AATTCGAGAG 900 

GAAGAAGATG CTGTCGTATC CGCAGTCCTG AAAGATGCAG GTGATGATCC GGATGTGACC 9 60 

AATGGCATGG CGGTATGCGC TACGATCAGG CTCAATCCCG AACATGAGGA AGTCCGCTTC 102 0 

30 CTGCAGGGTG AAGGGGTGGG GGTAGTGACG CTCCCCGGCC TCGGTCTGGA GGTCGGAGGT 1080 

CCGGCTATCA ACCTCGTACC TCGACGAATG ATGACAGCAG AGGTACGCCG ACTCTATGCG 11-10 

CAGGGAGGTG TGGATATTAC GATTAGCGTA CCCGAAGGCC GAGAGGCTGC TACCCAGACA 1200 

TTCAATCCCC GACTCGGCAT ACGGGACGGC ATCTCTATTA TCGGAACATC GGGAGTCGTG 12 60 

AAACCTTTTT CGGCCGAAGC GTTCGTTGGT GCCATCCGTA AGCAAGTGGG TATTGCCACC 1320 

35 GCCTTGGGAG CC/UXTCATAT CGTCCTCAAT TCGGGAGCCA AGAGTGAGCG TTATGTAAAA 1380 

GGAGCCTATC CGGCACTCAT TCCACAGGCC TTTGTGCAGT ATGGCAATTT CGTCGGCGAA 144 0 

TCACTCAGTT GTGTAGCTTC CTTCCCTTCT GTCCGTTCGG TAACGGTAGG AATCATGCTC 1500 

GGCAAAGCAG TGAAACTCGC CGAAGGCTAT CTGGATACGC ACAGTAAAAA GGTAGTGATG 1560 

AATCGGGATT TCCTGCACGA ACTGGCTCGT CAGGCAGGTT GTTCGGAAGA CATCCATGCC 1620 

40 ATAATAGACA GCCTGAATTT GGCTCGTGAG CTATGGACTA TGCCGAGTGC GGAGGACAGC 1680 

GATCGACTGC TACGAAAGAT TGCCGAACGA TCTTGGGAAA CTTGCCGCCC ATCGGTACCA 174 0 

TCGGCCGAAT TAGAACTCCT GCTGATCGAT GAGTCCGGAG CGATTCGTTT TCGTATCGGT 1800 

GGAGAA 1806 



(2) IMFORMATION FOR SEQ ID NO: 150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 
50 (D) TYPE: nucleic acid 

(C) STRAIIDEDHESS: double 

(D) TOPOLOGY: circular 



(ii) nOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: WO 
(iv) ANTI-SENSE: MO 



60 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVAL IS 

(ix) FEATURE: 

(A) NAI IE/KEY: misc_feature 
65 (B) LOCATION 1...132 9 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:150 

ATGTTGAGGA CTTTCCGAAT CGGTGGTATT CACCCCCCCG AAAACAAGTT GTCGGCAGGC 60 

70 AAGCCCGTAG AGGTGTTGCC TATCCCCTCA CAGGTAGTCA TCCCTCTTGG TCAGCACATC 120 

GGTGCACCGG CAACTGCCAC GGTCAAGAAA GGGGATGAAG TTAAGGTCGG GACTATCATT 180 

GCTCAGGCCG GAGGATTCGT ATCAGCTAAT ATCCACTCAT CTGTGTCGGG TAAGGTGCTG 24 0 

AAGATCGATA ACGTATACGA CTCAAGCGGC TATCCCAAGC CCGCAGTCTT CATTAGCGTA 300 

GAAGGTGACG AATGGGAAGA GGGCATCGAT CGCTCACCAG CCATCGTCAJX AGAATGCAAT 360 

75 CTGGATGCAA AAGAAATCGT AGCCAAAATT TCTGCAGCCG GTATTGTGGG TCTTGGCGGT 4 20 
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GCTACCTTCC CTACCCATGT GAAGCTGTCC CCTCCTCCGG GCAACAAAGC TGAGATCCTG ^80 

ATCATCAACG CCGTAGAGTG CGAGCCTTAT CTGACGAGCG ACCATGTCCT TATGCTGGAG 5^0 

CACGGCGAAG AGATCATGAT CGGCGTGAGT ATCCTGATGA AAGCCATTCA GGTAAACAAG 600 

GCCGTCATCG GAGTTGAGAA TAATAAGAAA GATGCTATTG CTCACCTCAC CAAACTGGCC 660 

5 ACTGCATATC CGGGCATAGA GGTAATGCCG TTGAAGGTGC AATATCCTCA AGGCGGTGAG 7 20 

AAGCAGCTGA TCGATGCAGT GATCCGCAAG CAGGTAAAAA GCGGTGCCTT GCCTATCAGC 7 80 

ACAGGTGCCG TAGTACAAAA CGTGGGTACG GTATTCGCCG TGTACGAAGC AGTACAGAAG 84 0 

AACAAGCCTC TGGTCGAGCG CATCGTGACG GTTACAGGAA AAAAACTGTC TCGTCCGTCT 900 

AACCTCCTCG TTCGTATAGG TACTCCTATT GCGGCTTTGA TCGAAGCAGC AGGTGGCTTG 960 

10 CCGGAGAATA CGGGCAAGAT CATCGGCGGA GGTCCGATGA TGGGACGCGC TCTGCTGTCA 102 0 

CCGGATGTGC CTGTGACCAA AGGCAGCTCC GGAGTATTGA TTCTCGATAG AGAAGAGGCA 1080 

GTTCGCAAGC CTATGCGCGA CTGTATCCGA TGCGCCAAGT GCGTCGGAGT GTGTCCGATG 11 'JO 

GGACTCAATC CGGCTTTCCT TATGCGCGAC ACCTTATATA AGAGCTGGGA AACAGCGGAA 1200 

AAAGGCAACG TGGTTGACTG TATCGAATGC GGTTCGTGCA GCTTCACCTG TCCGGCCAAC 1260 

15 CGTCCTCTGC TGGATTATAT CCGCCAAGCC AAGAAGACTG TGATGGGTAT CCAAAGAGCA 132 0 

CGTAAGCAA 132 9 



20 



30 



35 



( 2 ) I r>f PORM AT I Ol J FOR SEQ ID MO:151 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1437 base pairs 

(B) TYPE: nucleic acid 

(C) STRAUDEDIJESS: double 
25 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: D!IA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : PORY PHYROMONAS GINGIVALIS 



(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION 1. . .1437 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 

ATGAAAAGAA TACAACTAAC TCTTATCGCT CTCTTCGCCG CTGTTGCCGG TTTGGTCGCT 60 

CAAAATGCTT ACGAGGGAGT AATTTCATAT AAAATTTCGT TGGACAAAAC CGGAAACAAG 120 

GTTGTACTGA ATGGTGCGGC AGATATGAGT AATTTAAAGC TCAAGAGCAC TCAGATGATC 180 

45 ATTGTTACGC CTATTCTTCG TTCAGAAGAT GGTACCAGCC GGGTGG7\ATT TCCTTCGGTA 24 0 

GTCATTACAG GCCGCAATAG AACAAAAGCT CTCAAGCGTG AAATCGCATT TAGTTCGGCT 300 

TTGCCCCAAG CAAAACATGC AGCTCAATAC ATTCGCCGTC ATAATGGGAA GAGCGAGCAG 360 

TTTGCTTTTA CAGGAGAACA TGCTTATGCA TCATGGATGA TGGATGCCAA GTTTGTGGTT 4 20 

CGTGAGGAGG TACGAGGTTG TGCTAAATGC CCTGTAGGTC TCTCGAGTAA TATTGTTCCT 4 80 

50 TTTGATCCAC TCTTCAATCC GGCAGAGGCT CCTTATTTGT TGGCACACAT TACTCCGGCA 54 0 

GAAGAAGTGG AAAAACAGCG AGAGTCCAGC TTCGATGCTT ATATCAACTT CAAAGTCAAT 600 

AAGGCAGATG TCCTTCCTGA GTATCGCAAC AATAAGGCGG AGTTAGAGAA AATCAAAGAA 660 

TTTGTAAGCA CCGTTAAGGC TAATCCAAAC TATTCGGTCA ATAAAATGAT CATCGAAGGG 7 20 

TTTGCTTCTC CCGAGGCTTC AATAGCCCAC AATAAGGCTT TGTCGGAGCG CCGTGCTAAA 78 0 

55 AGACTCGCGG AAGAATTGGT GCGTAAGTAT GGCAAAACAT TGCCGAATAT AACCACTGAA 84 0 

TTCGGCGGTG AAGATTGGAA GGGGCTGAAA CTGGCTATCG AAAAGAGTGA TATAGCCGAT 900 

CGTGACCGCG TATTGGAGAT AATCAACTCC GATAAATATG CCGATGATGA TGCACGTGAA 960 

CAGGCTCTGA AGCAACTTTC GTCTTATCGT TATATCTTGG ATCAGATCTA TCCGAATTTG 102 0 

CGTCGCAATA CGATAACCAT GGGGTATATC GTTCGTGATT ATACCCTCGA AGAAGCTCGT 1080 

60 G7WVTCATTA AGACTGCTCC GAAAGAACTT AGTGAGGCCG AAATGTACCG TGTGGCAATG 114 0 

TCTTATCCTG AGGGGCACCA AGAGCGTTTG TTTGCTCTGA ATACGACCCT TAAGTATTTC 1200 

CCTGAAAGTG TAACGGGCCG AATCAATTTG GCTGTAGCCG CTTTT7\ATGG TGGAGACGTT 1260 

CAACAGGCAA TTGCTCTGTT GAGTCCGATT CAGACAGAAA AGGGTGTAAG CAATATCCTT 1320 

GGAGCTGCTT ATGCTCGTAC GGGAGATTTT GCTCGTGCCG AAACCTTCTT CCGTAAGGCC 138 0 

65 GTTGCAGAAG GAGATGCAAA TGCGCAGCGC AACCTCGATA TGCTGCTTGG CAAAAAG 14 37 



(2) INFORMATION FOR SEQ ID NO: 152 

70 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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(ix) tlOLECULE TYPE: DMA {genomic) 

(iii) HYPOTHETICAL: MO 

5 (iv) AHTI-SEMSE: NO 

(vi) ORIGll-JAL SOURCE: 

(A) ORGAI^IISM: PORY PHYROMONAS GIIJGIVALIS 

10 (ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1. . .1149 



(xi) SEQUENCE DESCRIPTION: Si 

15 

ATGGCAGAAA AAAGAGACTA TTACGAAGTC 
GAACTGAAA7\ AAGCATATCG CAAGAAGGCT 
GACAAGGAGG CCGAAGAGCA CTTCAAAGAG 
CCGCAGAAGC GCAGTCAATA TGACCAGTTC 

20 GGAGGTTTCA GCGGAGGCGG TATGTCCATG 
TTCGGTGGGT TCGGCGGTTT CGGCGGATTC 
GTTCGCAGAG GGTCTGACCT GCGAGTACGA 
GGTGTGGAGA AGAAAGTGAA GGTAAAAAAG 
GGCACGGAAG AAGCCAATGG CAAGACTACC 

25 ACACGTGTGA GCAACACTTT CCTTGGGGCC 
CACGGAGAG3 GTGAGATCAT CACGAAGCCA 
ATCGGCGAAG AGGTGATCTC ATTCCACATC 
TCCGTGAACG GCAAGGGAAJ^ TGCCGCGCCC 
GTGATCGCCG AGGAACCGGA TCCGAATCTG 

30 CTGCTTATAT CCGTTCCGTT GGCTATAAAA 
GGACGAGCCA AGATCCGCAT CGAGGCGGGG 
AATAAGGGGT TGCCCAGCGT AAACGGCTAT 
GTCTATATCC CCGAATCGAT CGATGCCTW^ 
TCGGACAGCT TCAAACCTAC CGATGCTGCT 

35 ATGCTGGAT 



:Q ID HO: 152 

CTCGGTGTAT CGAAGAATGC CACCGACGAT 60 

ATCCAATACC ATCCTGATAA GAACCCCGGT 120 

GTAGCTG/^G CCTACGACGT ATTGAGCGAT 180 

GGCCATGCCG GATTGGGCGG AGCTGCCGGT 24 0 

GAGGATATTT TCAGTCGCTT CGGTGATCTA 300 

TCCGATATGG GCGGTGGCAG TCGCAGACGT 360 

GTGAAGCTTT CTTTGGCCGA TATAAGTAAA 4 20 

CAGGTAGTGT GCAGCAAATG TCGTGGCGAT 4 80 

TGCCAGACCT GCCATGGAAC CGGCGTGGTT 54 0 

ATGCAGACCC AGAGCACTTG TCCCACTTGC 600 

TGCTCCAAGT GTAAGGGCGA AGGTGTGGAG 660 

CCTGCCGGTG TAGCCGAAGG AATGCAAATG 7 20 

CGAGGAGGCG TGAATGGCGA CTTGATAGTC 7 80 

ATCCGCAATG GCAACGATCT GATATACAAT 84 0 

GGAGGTAGTG TGGAAGTGCC GACGATAGAC 900 

ACACAACCCG GCAAGATGCT GCGTTTGCGC 960 

GGCATGGGAG ACCAACTGGT GAATGTCAAT 1020 

GATGAGCAGG CTATCGCAGC GATGGAAAAC 1080 

CGTAAGGATA TAGACAAGAA ATACAGAGAG 114 0 

114 9 



(2) IMFORMATIOH FOR SEQ ID NO: 153 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDWESS: double 

(D) TOPOLOGY: circular 

45 

( i i ) nOLECULE TYPE : DMA ( genomic ) 
(iii) HYPOTHETICAL: MO 
50 (iv) AMTI-SEMSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

55 (ix) FEATURE: 

(A) NA?^E/KEY: mi sc_ feature 

(B) LOCATION 1 ... 879 



(xi) SEQUENCE DESCRIPTION: Si 

60 

ATGAAAAAAC TGATTTTAGC GACTTTGGGA 

AGCAACAACA AGGATTTGGA GAACAAAGGG 

TCCTATAAAG CTCCACGCGA AACCTATGCG 

CCCGATCAAA GGATAAGCTG GACATACACG 

65 CAGGGTATTT ATATCGATGC TCCGGATGAG 

AAGAAGAl'CA ATGTACAGAG TCTTCATGTG 

GACTTTGTCA ATAAGTTTAA GGCAGCACAT 

CTTTTCGATA CCGATGAAGA TATGCGCGAG 

CAAACGATAG AGAAAGGTGA AGCTATTGTA 

70 AATGACAGGT ATGCCCGTAT CAATAAGATC 

GGAACCGTCG AGTCCGATCC CTCTATCAAT 

GCCACGGCCG TAACAATGAT GCCGCTGATG 

ATGGCCGGAG ATGAGGACGA TAGCTGGAAG 

TCTATAGACA AGCTGGACAA TGGCAATTTC 

75 AATATCTGGC TCAAGCATAT GAAAGCCACC 



;Q ID MO: 153 

CTTATGGCCA TTGCCATGCT CTCATGTTCA 60 

GAGGCTACTC TTTTGGTAAC GTTTGGTAGC 120 

AAGATTGAGA AGACTTTTGC CGCAGCTTAT 180 

TCTTCTATTA TCCGAAAGAA ACTGGCTCAG 24 0 

GCTTTGGAGA AATTGGCTCG TCTGGGTTAT 300 

ATTCCCGGCC GAGAATATGA TGAGATGATC 360 

AGTGATATTA CTGTGAAGGT AGGGGCTCCG 4 20 

GTGGCAGAGA TCTTGCACAA GCGTTTTCAG 4 80 

TTCATGGGAC ACGGCACCGA GCATGCTGCC 54 0 

ATGAAG7\ACT ATAGCAAGTT CATGATCGTC 600 

GATGTTATTG CCG AACTGAA AGAAACCGGT 660 

AGTGTGGCAG GCGACCATGC TACGAATGAT 7 20 

ACGTTGCTGA CCAATGCCGG CTACACAGTT 7 80 

TCAGCTCTTG GAGATATAGA AGAGATCCGG 84 0 

TCTGCTCGC 87 9 
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(2) IMFORIlATIOli FOR SEQ ID 110:154 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DllA (genomic) 

(iii) HYPOTHETICAL: MO 

15 (iv) A1ITI-SE13SE: UO 

(vi) ORIGIMAL SOURCE; 

(A) ORGANISM: PORYPHYROMOMAS GUiGIVALIS 

20 (ix) FEATURE: 

(A) NAI'lE/KEY: misc_feature 

(B) LOCATION 1 . , .1068 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 



ATGACATCCG TCAGCCACTT ACGTACAATT TCTGTCGCAG GTATCCTGGC TGCGCTGGGA 60 

GGGGCTGTAC TCATTCTCTT CGGGGTTAAT CTCTTCCTCG GCTCGGTGGC TATTCCGATG 120 

AGCGAGATCT TCCGACATCT TTTTTCAGAT CGTCCCGAAG GAGGAGAAGC ACTCGTGCAC 180 

TACAATATCC TATGGAAATC CCGCCTGCCC GAAGCCCTCA CGGCTGCTTT TGCCGGCGCA 24 0 

30 GGTTTATCCG TTAGTGGCTT GCAGATGCAG ACCGTCTTTC GCAATCCTTT GGCCGGTCCG 3 00 

TCCGTTCTCG GCATCAGCTC CGGTGCCAGT TTGGGTGTTG CTTTGGTCGT TCTGCTGAGC 360 

GGCTCGCTGG GAGGAGTGGC ATTGAGTAGC CTGGGTTATA TGGGCGAGGT GGCCATGAAT 4 20 

ATAGCCGCTG CCGTAGGCTC GCTGGCAGTA ATGGGGCTGA TCGTTTTTGT CAGCACCAAG 4 80 

GTGCGCAGCC ACGTTACGCT GCTCATTATC GGCGTTATGA TCGGATATGT AGCCACTGCC 54 0 

35 GTCATCGGGG TATTCAAGTT TTTCAGTATC GAAGAAGATA TTCGGGCATA CGTAATTTGG 600 

GGGTTGGGCA GCTTTTCCCG TGCCACGGAT TCGCAACTGA GTTTCTTTGC CATTCTGATG 660 

TTGATCTTTA TTCCGGCCGG TATGCTCCTT GTCAAGCAGT TGAATCTCTT ATTGCTGGGA 7 20 

GAAAGCTACG CACGTAATCT GGGACTG7\AT ACTCGTCGGG CACGGCTGCT CGTGATCTCT 7 80 

TCCGCCGGTT TGCTCATCGC TACCGTCACG GCCTATTGCG GTCCCATCGG CTTTTTGGGG 84 0 

40 ATGGCTGTGC CACACTTGGC ACGGGTTATC TTTCACACAT CGGATCATCG GATCCTGATG 900 

CCTGCTACCT GTTTGATTGG AAGTGCTCTG GCTCTTTTCT GCAATATCAT TGCTCGTATG 960 

CCGGGGTTTG AGGGGGCTTT GCCCGTCAAT TCCGTAACGG CTTTGGTGGG AGCACCTATT 102 0 

ATCGTCACCG TTTTGTTCCG GCGCAGACGC TTC/VAGGAAG PJ\ACCGAC 1068 



( 2 ) I N FORr lAT I ON FOR SEQ ID M 0 : 1 5 5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2271 base pairs 
50 (B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

( D) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DUA (genomic) 

(iii) HYPOTHETICAL: MO 

(iv) ANTI- SENSE: UO 

60 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAllE/KEY: nusc_feature 
65 (B) LOCATION 1...2271 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 155 

ATGCGGACAA AAACTATCTT TTTTGCGATT ATCTCTTTTA TTGCTCTATT GTCGTCTTCT 60 

70 CTGTCGGCTC AGAGCAAAGC CGTTTTAACC GGTAGTGTGT CGGATGCCGA AACCGGAGAG 120 

CCTCTTGCCG GTGCTCGAAT CGAAGTCAAA CACACCAACA TAGTAGCCGG TGCCGATGCC 180 

GGCGGACATT TCGAGATCAA GAACCTGCCG GCAGGGCAGC ATACTATTAT ATGTTCGTTG 24 0 

GGGGGGTATG GACAGAAAGA GGAGGTGGTT GCCATCGAAG CCGGACAGAC CAAAACGATC 300 

TCTTTTGCAT TGCGACTGCG AACGAACAAC TTGGAGGAAG TCGTCGTTAC CGGTACCGGT 360 

75 ACACGTTACC GCTTGGTCGA TGCTCCTGTG GCAACGGAAG TCCTTACCGC TAAGGACATA 4 20 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



105/490 



GCCTCTTTCT CGGCTCCTAC TTCCGAGGCC 
TTCGGCCCCA ATCTGATGGG CTCTTTCATG 
CTCATCCTTA TCGATGGTAA GCGTGTGTAC 
CGTATTTCTC CTGATCAGAT CGAACGGATC 
5 TACGGATCCG ATGCCATCGC CGGGGTAATC 
CTGAGTGCAT ATACGTCACA TCGCATATCG 
CTCGATATAA ACATCGGTAA GTTCAGTAGC 
GGCTGGCAGA ATAGTCCGTT CGAAATAAAA 
GAGGAAACGT ATAAGAAAAC TTTTCGTGCA 

10 TCCTATTATG CAACTAACAA TCTTAGCTTC 
ATCTTCACTC CGACTTTTTC CGAAAAGAAG 
GCTTCACTCG GTACGAACTA TCTTTTCCCC 
GTCTACGATC GCTTCCGTTT CGGATATTTG 
ATCAACAACC AAGGTCAGAC CGAGCAACCC 

15 AACGATCAGA TCCGATACAC GGCAGAGGCT 

AAACTGACCG GCGGTTTGGA GTATTTCCGT 
ACCGACAAGG CAGATGCTTC CACGCTCTCT 
GATTGGTTCA ATATGACAGC CGGTTTCCGT 
ATGACGCCTA AGGTATCCAT ACTCGCCAAG 

20 GCTAACGGCT ATAAGACTCC CACGCTGAAA 
ATGGGTTCGC ACAATCTCTA TCTCGGCAAT 
TATGCTTTGG GCTTGGAGTA CAATCAAGGC 
AATGAACTTC GCAATCTGAT CTCCTTTATG 
CAGGGAATCA AGAAAACCAA GCAGTATGCC 

25 GATGTCCTAT GTGATGCCTC TATCGGTTGG 

GTGGAAGCTA AGAATCTCCA GACGGATGAG 
AATGTGCACG CCGATTGGGT TCACTACTGG 
GGCCGTATTC AGAGCGAGCG TTACTACAAA 
CGACTCGCCA CATCGCATCG TTTCGCTCAT 

30 GGTATAGACA ACCTGTTTGA CTACGTGGAT 

GTAACGCCGG GACGTACTTT CTTTGCTCAA 



TTATTGCAGG GGCTGAGTCC GTCTTTTGAC 4 80 

CAGCTGAACG GCCTTAGCAG TAAGTATATC 5^0 

GGCGATGTAG GCGGTCAGGC CGATTTGAGT 600 

GAACTGGTGA AAGGTGCTTC GAGTTCGCTC 660 

AATGTGATCA CAAAAAAGAA TACGAATCGA 720 

AAGTACAACG ATCGGCAAAC CAATACTTCG 7 80 

AATACCAACT ATTTCTTCTA CCATACGGAT 84 0 

AAGAAAAAAG GATCCGGCGA ACCGGTCTTG 900 

CAGGAAAATC AGGGTGTAAG CCAATCGCTT 960 

AGCGGAAATG TGCAGTACAA TAAACGTCAG 102 0 

GCCTATGACA TGGATTATCG TGCTTTGACG 1080 

AATGGTCTGC ATACGCTTTC TTTCGATGCC 114 0 

TATCATGACA AGGACAGCAG TGAGAGCCTG 1200 

ACATTCTTTC CGGGTCAGCT ACGCAATAAA 1260 

CGCGGTGTAT TTACACTGCC TTATGCGCAG 132 0 

GAGGAATTGA TCTCTCCCTA TAATTTGATT 138 0 

GCTTATGTAC AAGATGAATG GA7\ACCGCTC 1^40 

CTGGTACACC ATCAGGAGTT CGGTACACGA 1500 

TATGGGCCGC TGAACTTCCG CGCTACGTAT 1560 

GAGCTTTTTG CACGGAACGA ACTCACCACT 162 0 

GCGGATCTTA AGCCACAGAT GTCGGATTAT 168 0 

CCTATCTCGT TCAGTGCAAC GGTTTATGAC 17 4 0 

GATATACCGA CCTCACCCGA GCACGAAGCT 18 00 

AACATAGGAA AAGCTCGCAG CCGCGGCCTT 1860 

GGTATCAAGT TAGGAGCCGG ATACAGCCTC 192 0 

TGGCTGGAAG GAGCTGCACG TCATCGTGCC 1980 

GGTCAGTATA GACTTGGCGT GAGCCTTTTC 2 04 0 

GACGGCAATG CTCCGGACTA TACCTTGTGG 2100 

TTCCGCCACA TCATCCTGGA TGGAACGCTC 2160 

GATCGTCCTA TGGGTGTCAA TTATGCTACC 2 220 

ATAGCGATTC GATTCAACAA C 2271 



35 



45 



I H FORMATION FOR SEQ ID HO: 156 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 base pairs 

( B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HVFOTHETICAL: NO 
(iv) AIITI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOHAS GINGIVALIS 

50 

(ix) FEATURE: 

(A) NAME/KEY: niisc_f eature 

(B) LOCATION 1 . . . 993 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 



ATGACGGACA ACAAACAACG TAATATCGTA 
ATCGCAGTGG TGACGATCGT TGGTTTTTTC 
GGACAGATAG AAGTGACCGA ATACCGAGTG 

60 CTTAGGGTAT CCGAGGGACA GCAGGTGCAG 

CCCGACGTAG CGGCTAAGAT GGAGCAGGCA 
AACGCCAAGG CTCTCAAAGG AGCACGCAGC 
CAGAAAGCTC AGGCCGGCGT AGCCATAGCG 
TATGACCAGG GAGTGGTACC GGCTCAGAAG 

65 GCCATCGCTA CGCAAAAAGC GGCCGAAGCC 

CGCGAAGACA AGCTGGCAGC TTCTGCCCTC 
GTGGAGTCGT ACATCAACGA AACCTACCTC 
ATATTCCCCA AAGCCGGCGA ACTCGTAGGT 
ATGGGCGATA TGTGGGCCAG CTTTGCCGTT 

70 GGAGCCGTTC TGGAGACTGT GGTGCCGGCT 
ACATTCATCA AGAACATGGG TACCTATGCT 
TACGACCTGA AGACCTTCGA GGTAAAGGCC 
CTACGCCCGG GTATGTCCGT GATCATACGC 

75 



TTCCCGGCGT TTCTCCTCTT GCTGGGAGTC 60 

ATGCTCAGAC CGGCCGAGGA GATTATCCAA 120 

TCCAGCAAAG TGCCCGGGCG CATCAAGGAA 180 

GCCGGCGATA CCCTCGCTGT CATCGAAGCC 24 0 

AAGGCTGCCG AAGCAGCTGC ACAGGCTCAG 300 

GAACAGATAC AGGCAGCCTA TGAGATGTGG 360 

ACCAAGACAC ACCAGCGCGT GCAGAACCTC 4 20 

TTGGACGA.^\G CCACTGCCCA GCGCGATGCG 4 80 

CAGTACAATA TGGCTCGCA.^ CGGTGCCGAA 54 0 

GTCGATAGAG CGAGAGGAGC CGTCGCCGAG 600 

ATCGCCCCAC GGGCAGGCGA AGTGTCGGAG 660 

ACCGGCGCAC CTATCATGAA TATCGCCGAG 72 0 

CGTGAGGATT TCCTCAGCAG CATGACCATG 780 

CTGAATGAAG AAA7\AGTACG CTTCAAGATC 84 0 

GCCTGGAAAG CGACCAAGAC AACAGGGCAG 900 

ACCCTTGCGG ATAAAGACAA GGCACAAAAG 960 

AAG 993 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929a70A1_l_> 



wo 99/29870 



PCT/AU98/01023 



106/490 



(2) INFORMATION FOR SEQ ID NO:157 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 base pairs 
{B) TYPE: nucleic acid 

(C) STRAUDEDNESS: double 

(D) TOPOLOGY; circular 

( i i ) MOLECULE TY PE : DNA ( genomi c ) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



20 



(ix) FEATURE: 

(A) HAIIE/KEY: misc_feature 

(B) LOCATION 1 . . . 801 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157 



25 



30 



35 



ATGCGTATTG 
CGTTCCCAGC 
ATCGCTAAAC 
AGTAGAGACC 
CGCAACGGCT 
AATTATCCTA 
GAGATACAGA 
CTTCAAAATT 
AAAGGAGAGC 
GGTCGCCGCG 
ACGGATCTCA 
GTCATCTATG 
AACGTAAACG 
ACGATAATAG 



TCAGTAATTT 
GAGAAAAGGT 
CATATGACGT 
CGGAGCTTTC 
ATGGAACGAA 
TTTTAGGCCA 
AGAGGATTAT 
TCAAGGTGTC 
GAAT7VACTCT 
ATCGGGTTTT 
GAAAGGCCGA 
TGGAGCCGAA 
TATGGCTGAG 
ATAAGACCAA 



TTTGTTCGTC 
CGTTTACCTG 
AAAAATTGAG 
AACGCCCTAC 
CTCGAACGAA 
GATCTATGTA 
TTCGAGTGGA 
GGTTTTGGGA 
TTTGGAAGCG 
TGTGATTAGA 
CTTGCTCGCA 
CGACAAGAAA 
TGTTACCTCC 
A 



TCTTTTTCGG 
CAAGATATCC 
AAGGACGATG 
AACCAAGTGT 
GGCTTCCTGG 
GAGGGCCTTA 
TTTATCAAGG 
GAGGTGAATC 
ATCGGAATGG 
GAAACCGATG 
AGCCCCGTGT 
ACACAGATGA 
ACTTTGGTAT 



TTTTGCTTTT 
AAACTTTTAA 
TGCTGAACAT 
TGACCACTCG 
TCGATTCGAA 
CTCGTACCGA 
ATCCTACGGT 
ATCCGGGTTC 
CCGGAGACCT 
GGCATCGCGA 
ACTATCTGCA 
GCGAGATCAA 
CCATTTCCAC 



TGCATCATGC 
TCGGGAGATT 
CCTTGTCAGC 
TGCACTGGCC 
AGGGTACATC 
ACTGGAGAAG 
AACGGTGCAG 
GATGTCGGTA 
GACAATCTAT 
GGTTTTCCAG 
TCAGAACGAC 
CCAGAATAAT 
GCTGACGATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
801 



40 (2) INFORMATION FOR SEQ ID NO: 158 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1707 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

{ii) MOLECULE TYPE: DNA (genomic) 

50 (iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



60 



( ix ) FEATURE : 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 . . .1707 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 



ATGAAA7\AGA CCAATCTGTT TTTATCTCTG CTGGTGATCT TTATCACCGG TAGTTTTATG 60 

ACTGCCTGTG CACAGAAGTC CAAGACGAAC AAACTCACCG AAGAAGATCG GAGCCGCAAT 120 

65 GAGTATGTAC AGTCGATGGA TGTGCTTAGC AATATTATCG GTAACGTCAG GCTGTATTTC 180 

GTCGATACCA TAAGTATCAA ACATATGACT CGGCGTGGTA TAGATGCGAT GTTGGGCGGG 24 0 

CTTGACCCCT ATACCGAATA CATTCCTTAC GAGGAAATGG ATGAACTGAA ATTGATGACT 300 

ACGGGAGAGT ATGCCGGAGT CGGAGCTATC ATATCGCAGC GCCCGGATAG TGCTGTGATT 360 

ATCCAGAGAC CTATGGAAGG TATGCCCGCA GACGAAGCAG GATTGATAGC AGGCGACCGC 4 20 

70 ATCCTGACTA TCGATGGGAA AGACTTCCGC AAATCCACCA CACCGAAAGT AAGCCAAGCA 4 80 

CTGAAAGGGA TAGCCGGTAC TGTTGCAAAG GTGACAGTAA TGCGCTATGG CGAAACCAAA 54 0 

CCTCGTACTT TTTCCGTG/iJX ACGTC7WVAA GTGATTATGA ATTCCGTCAC TTACAGCGGA 600 

ATGCTCGATG GCTCGATAGG ATATATCCGC TTGAACAACT TTACGGACAA AAGTGCAGAA 660 

GAGGTGCGCA CGGCCTTGTT GGATCTTCGT GACAAACAAG GAGCGAAAGG TCTCATTTTG 7 20 

75 GATTTAAGAG GCAATGGTGG CGGACTGATG CAGGCTGCTA TCGAGATAGT CAATCTGTTC 7 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 

107/ 490 

GTCCCTAAGG GCAAAGAGGT GGTAACGACC AAAGGTCGCA TTGCAGAGTC GGCGTCCGTA 84 0 

TTTCGCACAT TGACTGAACC GATCGACACG AAACTCCCGA TAGTAGTCCT GATCGATGGA 900 

CAATCGGCAT CTTCCTCGGA GATTGTAGCC GGAGCACTGC AGGATATGGA CAGGGCTGTA 960 

CTGATGGGAC 7U\AAGAGCTA TGGCAAAGGG CTTGTACAAA CGACTCGTCA GCTACCATAC 1020 

5 AACGGCGTGA TCAAATTGAC TACGGCCAAG TACTACATCC CAAGCGGACG TTGTATTCAG 1080 

CGTTTGGACT ACAGCCGCAC CAATCGGACA GGTATGGCAA CGGCCATTCC TGACAGTCTG 1140 

CACAAAATCT TTTACACTGC TGCCGGAAGA CGTGTAGAAG ATGCAGGAGG AATCCTGCCT 1200 

GACATCGAGG TCAAACAAGA TACAGCTGCG ACATTACTTT ATTATATGGC CATCAATAAT 12 60 

GACGTTTTCG ATTTCGTCAC AGGTTATGTG CTCAAGCATA AAACGATTGC CAAGCCGGAG 132 0 

10 GATTTTTCCA TAACGAACGA GGACTATGCA GCTTTCTGCA AGATGATGGA AGAAAAGAAA 1380 

TTTGACTATG ATCGCCAGAG TGGCAAGATG CTTGACAAAC TGGAGGAACT GGCTAAGATA 1440 

GAAGGCTACC TGCCGGAAGC CAACTCGGAG CTTAAAGCAC TACGCGAAAA GCTAAAACCC 1500 

AACCTGTCGC GTGATCTGCT ACGATTCAAA AAGGAGATAA CAAACTATCT CAACAATGAG 1560 

ATTGTCACTC GCTATTATTA TGAGCGAGGC AGTATCCGCC AGAGTTTGCC GGAAGATAAG 1620 

15 GTAGTCAAAG AAGCTATTAA GCTGCTGAAG GACCATCCGG AACAAATTCG ACAGATCCTT 168 0 

GCAGCTCCGA AAGCAGAGAA TAAAGGG 17 07 



PCT/AU98/01023 



20 



30 



35 



(2) in FORMATION FOR SEQ ID t JO: 169 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2943 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 
25 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



(ix) FEATURE: 

(A) NAI4E/KEY: misc_feature 

(B) LOCATION 1 . . .2943 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 9 

ATGCAAAACA AAGGATTTGT GATTGTTATC ACATCGGCTC TGGCCATCAT CTGTGCGTTT 60 

TACCTGTCAT TCTCTTTCGT TACGAACCGT TACGAAAAGA AGGCTAAGGC GATGGGCGAT 120 

GTTGCCGGAA TGGCCTATCT TGATTCCATG TCG/^TGAGA AGGTCTGGTT CGGCTACACG 180 

45 CTGAAAGAAG CTCAAGCCCA GCAAATTGGT CTTGGCCTTG ACTTAAAGGG GGGTATGAAC 24 0 

GTTATCTTGA AACTTAACGC AAGCGATCTG CTTCGTAACC TCTCTAACAA AAGTTTGGAT 3 00 

CCCAACTTCA ACAAAGCTCT GGAGAATGCT GCCAAGAGCA CGGAGCAATC CGACTTCATC 360 

GATATTTTCG TGAAGGAATA TCGCAAGCTC GATCCCAACG GTCGCTTGGC CGTTATCTTC 4 20 

GGTTCGGGTG ACCTTCGCGA CCAGATTACC GCAAAGTCTA CGGATGCAGA CGTAGTGCGT 4 80 

50 CTGCTCAAAG AAAAATATAA TAGTGCTGTA GAAGCTTCGT TCAATGTGCT CCGTGCTCGT 54 0 

ATCGATGCTT TCGGTGTGGT TGCACCTAAT TTGCAGCGAT TGGAAGGGCA AGGGCGTATC 600 

CTTGTCG7V\C TCCCCGGAGT GAAAGACCCT GAGCGTGTTC GTACCCTTTT GCAACGCAGT 660 

GCCAACCTAC AGTTCTGGCG TACATACAAA TTCGAAGAGG TCAGCGGAGA CTTGATCGCT 7 20 

GCCAATGATC GTCTGAGCGA ATTGGCTATG AACAACACGG ATGCTACCCC GGAAACAGAG 7 80 

55 CCTGCAACTA CTGACTCTGT AGCTGCAACA GCCGATTCTG CTGCTGTACA AGCTGTAGCT 84 0 

GATTCTGCTA CTGTAGCACA AAAAGAGGCC AAGGATGCTA CTCGTAAAGA CGCACTCTTC 900 

TCTCTGCTTA CTCCCGTGAA TCGTGGCGGT GCAGTAGTGG GTGTGGCTCG TCGTGCTAAT 960 

ATGGCTCAGA TATCTGAAAT GCTCCAGCAA GCTCACGATC TGAAGGTTAC ACGTGAAGAT 1020 

GTGCTTTTCC TCTGGGGTGC TAAAGCAATC GAAGACCCCG 7\AACC7WW\ GGAGACCGAC 1080 

60 CTCTACGAAC TCTATGCTAT TCGTACCAAT CGTACGGGAG ATCCTGATTT GGGAGGTGAT 1140 

GTAGTGACTT CCGCCAAGAG TGATATCCAA AATGACTTCG GTCGTTCCGA ACCGATCGTT 1200 

TCGATGACGA TGAATGAAGA AGGTGCTCGT AA.ATGGGCGC GTATCACAAA GGATAACGTG 1260 

GGACGGGCAA TCGCTATCGT TTTGGATGGT GTGGTTTATT CTGCTCCGAA CGTGAATGAT 132 0 

GAGATCACGG GCGGTCGCTC TCAGATCTCC GGGCACTTCA CCGTGGAGGA GGCCGGTGAC 1380 

65 CTTGCCAACG TACTCAACTC CGGTAAAATG GATGCTACGG TAAGCATCGA ACAGGAA.AAC 14 4 0 

GTGATTGGTC CTACGCTGGG TGCCGAGTCC ATTAAAGCCG GATTCTTGTC GTTCCTGCTC 1500 

GCTTTGGTTA TCCTGATGTG TTACATGTGT CTGGCTTACG GTTTCTTGCC GGGTCTTATC 1560 

GCAAACGGCG CATTGATTGT AAACAGCTTC TTCACATTGG GCGTATTGGC TTCTTTCCAT 1620 

GCCGTGCTGA CCCTCTCGGG TATCGCAGGT TTGGTGCTGA CGCTGGGTAT GGCTGTGGAT 1680 

70 GCCAACGTAC TTATCTTCGA GCGTATCAAA GAAGAGCTTC GTGCCGGTAA GACTCCGATT 174 0 

CGTGCCGTTA CGGATGGTTA TGGCAACGCT TTCTCTGCCA TCTTCGACTC GAACGTTACG 1800 

ACTATTATTA CCGGTATCAT CCTATTCCTC TACGGGACGG GGCCGATTCG CGGTTTTGCC 1860 

ACTACGTTGA TTATCGGTCT TATCGCTTCT TTCATTACGG CTGTCTTCTT GACTCGTATC 1920 

GTCTTCGAGA AACTGGCGAA AAAAGGTCGT TTGGATAAGA TTACATTCAC TACGAGCATT 198 0 

75 ACTCGCAATC TCCTTGTCAA TCCCTCATAC AACATCTTGG GTAAGCGCAA GACCGGCTTT 204 0 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929e70A1_L> 



wo 99/29870 



PCT/AU98/01023 



108/490 



ATCATTCCGG TGATTATCAT CGTTTTGGGA 

AGGGGTATTG AATTCTCCGG AGGACGTAAC 

TCCGAAGCCG TTCGTTCGGC CTTGTCTTCT 

ATCGGTACTG AAGGGACAGA GGTGCGTATA 

5 GAAGAAACTG AAGCAGAGAT TACTGACAAA 

CAGCAGCCTA CTGCTGATCA GTTCTTGGAC 

AGTATGTCGA GTGACATCAC GAGAGGTGCT 

ATGGCCATTT ATATTCTGAT TCGCTTCCGT 

TCTGTGGCCG CTACTACATT CTGCATTATT 

10 CCCTTCACCA TGGAGATCGA TCAGAACTTC 

TCGCTCAATG ACACCGTGGT TGTATTTGAC 

AACAGAGATC GCTATCAGGT GATCAACGAT 

AATACGTCTT TGACTACGTT TATCGTTATG 

ATGCGTAGTT TCACGTTCTC GATCCTGCTC 

15 TTTGTTGCTA CACCCCTTGC CTACGAGATC 
AAA 



CTTATAGCTT CATTTACAAT CGGTCTCAAT 2100 

TACGTAGTTA AATTCGACCA GCCTGTATCT 2160 

CCCCTGCAGG AAAAGGTATT GGTTACCTCC 222 0 

TCTACGAACT ATAAGATCCA GGAGGAAAGC 22 8 0 

TTGTATCAGA GCCTGAAAGG TTTCTACACC 234 0 

AATATCATTA GCTCTCAGAA AGTAAGTCCC 24 00 

ATTTGGGCTG TGCTGTTATC GATGATCTTC 24 60 

GACATTTCTT TCTCTGCCGG GGTATTCGTA 252 0 

GCTCTGTATG CGTTGCTGTG GAAGATTCTG 2580 

ATCGCTGCTA TTCTGGCTAT CATCGGTTAC 264 0 

CGTATCCGAG AGACGATGAA ATTGTACCCC 27 00 

GCCCTTAATT CAACATTGGG TCGAACATTA 27 60 

TTGGTAATCT TCATCTTTGG AGGTGCTACG 2820 

GGTATCGTTA TCGGTACATA CTCTACGCTC 2880 

CAAAAGCGCA AGCTCAACAA AGCAGCTAAG 294 0 

2943 



20 



30 



(2) IMFOR]lATIOM FOR SEQ ID MO: 160 



(i) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH : 3051 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: circular 



(ii) HOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: MO 
(iv) AHTI-SEMSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GINGIVALIS 



35 

(ix) FEATURE: 



(A) NAI'lE/KEY: misc_f eature 

(B) LOCATION 1 . . . 3051 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 



ATGAAAAGAA TGACGCTATT CTTCCTTTGC 
CAGAATAGAA CCGTGAAGGG TACAGTTATC 
GCGAATGTCG TGGTTGTCGG AAACACCACT 

45 TTCACGCTTA GCGTGCCTGC CAATGCCAAA 
ACC7\AAGAGG TCGCCATCGC TAATGTGATG 
CTGGAGCAGG TAGTTGTATT GGGTTACGGT 
TCTGTGGCCA AAGTGTCCAG CGAAAAGCTC 
GCCCTCCAAG GTCAGGTAGC CGGTATGCAG 

50 GTCGCTTCTG TGGAGATCGA TGGTACAGGG 
ATCGTGGATG GTATGCAAAC TTCTTTGGAT 
GAATCTATGT CCGTTTTGAA AGATGCTTCT 
AACGGAGTCG TTTTCATTCA AACGAAGAAA 
TTTAATGCCA GTTACGGGAT TTCTCAAATC 

55 ACTGGAGATG AATTGCTGGA TTTTCAGGTG 
GTTCAGAAGG TTAAAGATAT GATCCTTGCC 
TCTTTGAAAG ATGAGTATGG TAAGACATTG 
TGGCTCAAGG CTTTGTTTAA AACAGCACCC 
GGGTCTCAGG GT^CTTCATA TTATGCCTCT 

60 CGTGAACCGG CAAATTTTAA GCGCTATAGT 
GAATGGCTGA AAGTTGGTGC AAATTTGTCT 
TATTTTGGAA AGTATTATAT GGGGTCAGGT 
TATAACCCTT TTGATGTGAA TGGGGATTTA 
AGACCTTCTA TGACAGAACC GTACTTCGCA 

65 CAGGCC7VATG TAAATGGTTT CGCCCAGATT 
CAGGCTGGTG TTGATATTAC T7KATACTCGC 
TATGATTCTA CTCCTCTTGG GGAAAGAAGA 
TTTACAAATA CGGCTGAATA TAAGTTTTCA 
ATGGGGCATG AATATATTGA ATATGAAGGG 

70 GAAAGTGATA AGTTGATGTT ACTGAGCCAG 
GAACACAGAG TCGCTGAATA TGCCTATTTG 
GACAAATGGA TGTATATAGA TTTCTCTGTT 
AATAATAGAA GCGCGTGGTT CTATTCTGTC 
ATTCAAGAAA GTAATTGGCT CAGTGATCTT 

75 TVACTCGGAGA TTGGTAATTA CAACCACCAA 



TTGCTGACGA GCATTGGGTG GGCTATGGCC 60 

TCCTCCGAGG ATAATGAGCC CCTGATCGGC 120 

ATCGGTGCTG CAACCGACTT GGATGGCAAC 180 

ATGTTGAGAG TGTCCTATTC CGGTATGACT 24 0 

AAGATCGTAC TGGATCCGGA CTCTAAGGTT 300 

ACGGGACAGA AACTCAGCAC TGTTTCCGGT 360 

GCGG7\AAAGC CCGTTGCCAA TATCATGGAT 4 20 

GTTATGACTA CATCCGGTGA CCCTACTGCC 4 80 

TCGTTGGGGG CAAGCTCTGC ACCATTGTAT 54 0 

GTTGTGGCTA CGATGAATCC GAATGATTTT 600 

GCAACATCTA TTTATGGAGC TCGTGCTGCA 660 

GGTAAAATGA GCGAGAGAGG TCGTATTACC 720 

CTGAATACTA AGCCCCTTGA TAATATGATG 7 80 

AAGGCAGGTT TTTGGGGGAA CAATCAAACC 84 0 

GGAGCTGAAG ATTTGTATGG CAATTATGAT 900 

TTCCCAGTGG ATTTTAATCA TGATGCAGAC 960 

ACCAGTCAAG GTGATATTTC TTTCTCCGGA 1020 

ATAGGCTACT TCGATCAGGA AGGTATGGCT 108 0 

GGCCGGCTCA ACTTCGAAAG TCGTATCAAT 114 0 

GGTGCGATAG CGAATAGACG ATCTGCCGAC 1200 

ACTTTCGGTG TGTTAACGAT GCCTCGTTAT 1260 

GCAGATGTCT ATTACATGTA TGGAGCTACC 1320 

AAAATGAGAC CGTTCAGTTC CGAATCACAT 1380 

ACTCCGATCA AAGGCCTTAC TTTAAAGGCA 14 40 

ACTTCTTCTA AGAGAATGCC CAATAATCCG 1500 

GAAAGAGCTT ATCGAGATGT TAGCAAGTCT 156 0 

ATTGATGAAA AACATGATCT TACAGCATTG 1620 

GATGTTATTG GGGCATCTTC TAAAGGATTT 1680 

GGAAAAACCG GAAATAGTTT GTCTTTGCCT 17 4 0 

TCTTTCTTTA GTCGTTTTAA TTACGGTTTT 1800 

CGTAATGACC AATCCTCTCG ATTCGGATCC 1860 

GGTGGAATGT TTGACATATA TAATAAATTC 1920 

CGACTGAAAA TGAGTTATGG TACAACGGGT 1980 

GCACTCGTTA CTGTGAACAA TTATACTGAA 2 04 0 
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GATGCTATGG GGCTTAGCAT TTCTACAGCA 

TCTCAGTTCA ACTTCGGTTT GGCTGCAGGG 

GATTTCTATG TCCGCACTAC GAATGATATG 

GGTTTCTTCT CACAGTATCA GAATGTAGGC 

5 CTTAAGGGGA CGATCTACCA AAATAAGGAC 

TACAATAGAC AGGAAATAAC AAAGCTTTTC 

ACCGGTACTA TATGGGAAAT TGGGTACCCC 

ATCGACAAAA AAACCGGTAA GCAGTTGTGG 

AATAAAGTTA CAACAAGCCA GTACTCAGCT 

10 ACTCCTCCTA TTACAGGTGG TTTCTCCTTA 

GCTGATTTTG CCTACATCGT TGGTAAATGG 

AATGCAGGTG GATTGATGCA ATTAAATAAA 

GATAATAAAG AAACAGATGT TCCAAAATTG 

TTGGAGAATG CTTCTTTCCT GCGTTTGAAG 

15 AGTCTTTTTG CTGGGCAGAA TGTGATTGGT 

CTGTTAACTG TTACGAAGTA TAAAGGCTTT 

AATCAATATC CTAATTCTAA GCAGTACGTT 



GGCAATCCCG ACCTCTCGTG GGAAAAGCAG 2100 

GCTTTCAATA ATCGCTTATC TGCAGAGGTA 2160 

TTGATTGATG TCCCGATGCC TTATATCAGT 2220 

TCTATGAAAA ATACGGGTGT AGACCTTTCT 2280 

TGGAATGTAT ATGCTTCTGC GAATTTCAAC 2 3*30 

TTCGGTCTCA ATAAGTACAT GTTGCCTAAT 2^00 

AATTCGTTCT ATATGGCTGA ATATGCTGGA 2 4 60 

TATGTTCCTG GTCAAGTCGA TGCGGATGGT 2 520 

GACTTGGAGA CACGAATTGA TAAGTCTGTT 2580 

GGTGCTTCTT GG/W\GGACT TTCTTTAGAT 2 64 0 

ATGATCAATA ATGACCGTTA CTTTACAGAG 27 00 

GATAAAATGC TATTGAATGC CTGGACAGAG 27 60 

GGACAGTCTC CTCAGTTTGA TACGCATTTG 2 82 0 

AATCTCAAAC TCACCTATGT ACTCCCCAAT 2 880 

GGAGCTCGTG TCTATTTGAT GGCGCGCAAT 2 94 0 

GACCCTGAAG CAGGGGGGAA TGTGGGAAAA 3000 

GCGGGTATTC AGTTGTCTTT C 3051 



20 



(2) ItJPORIIATIOH FOR SEQ ID NO: 161 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3042 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRAMDEDMESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 
30 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



35 



(vi ) ORIGINAL SOURCE: 

(A) ORGTMIISH: PORYPHYROMOHAS GINGIVAL IS 



40 



45 



50 



55 



60 



65 



70 



75 



(ix) FEATURE: 

(A) NAI1E/KEY: mi sc_f eature 

(B) LOCATION 1 . . . 304 2 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 161 



ATGACGCTAT 
ACCGTGAAGG 
GTGGTTGTCG 
AGCGTGCCTG 
GTCGCCATCG 
GTAGTTGTAT 
AAAGTGTCCA 
GGTCAGGTAG 
GTGGAGATCC 
GGTATGCAAA 
TCCGTTTTGA 
GTTTTCATTC 
AGTTACGGGA 
GAATTGCTGG 
GTTAAAGATA 
GATGAGTATG 
GCTTTGTTTA 
GG AACTTCAT 
GCAAATTTTA 
AAAGTTGGTG 
AAGTATTATA 
TTTGATGTGA 
ATGACAGAAC 
GTAAATGGTT 
GTTGATATTA 
ACTCCTCTTG 
ACGGCTGAAT 
GAATATATTG 
AAGTTGATGT 
GTCGCTGAAT 
ATGTATATAG 
AGCGCGTGGT 
AGTAATTGGC 



TCTTCCTTTG 
GTACAGTTAT 
GAAACACCAC 
CCAATGCCAA 
CTAATGTGAT 
TGGGTTACGG 
GCGAAAAGCT 
CCGGTATGCA 
ATGGTACAGG 
CTTCTTTGGA 
AAGATGCTTC 
AAACGAAGAA 
TTTCTCAAAT 
ATTTTCAGGT 
TGATCCTTGC 
GTAAGACATT 
AAACAGCACC 
ATTATGCCTC 
AGCGCTATAG 
CAAATTTGTC 
TGGGGTCAGG 
ATGGGGATTT 
CGTACTTCGC 
TCGCCCAGAT 
CTAATACTCG 
GGGAAAGAAG 
ATAAGTTTTC 
AATATGAAGG 
TACTGAGCCA 
ATGCCTATTT 
ATTTCTCTGT 
TCTATTCTGT 
TCAGTGATCT 



CTTGCTGACG 
CTCCTCCGAG 
TATCGGTGCT 
AATGTTGAGA 
GAAGATCGTA 
TACGGGACAG 
CGCGGAAAAG 
GGTTATGACT 
GTCGTTGGGG 
TGTTGTGGCT 
TGCAACATCT 
AGGTAAAATG 
CCTGAATACT 
GAAGGCAGGT 
CGGAGCTGAA 
GTTCCCAGTG 
CACCAGTC7y\ 
TATAGGCTAC 
TGGCCGGCTC 
TGGTGCGATA 
TACTTTCGGT 
AGCAGATGTC 
AAAAATGAGA 
TACTCCGATC 
CACTTCTTCT 
AGAAAGAGCT 
AATTGATGAA 
GGATGTTATT 
GGGAAAAACC 
GTCTTTCTTT 
TCGTAATGAC 
CGGTGGAATG 
TCGACTGAAA 



AGCATTGGGT 
GATAATGAGC 
GCAACCGACT 
GTGTCCTATT 
CTGGATCCGG 
AAACTCAGCA 
CCCGTTGCCA 
ACATCCGGTG 
GCAAGCTCTG 
ACGATGAATC 
ATTTATGGAG 
AGCGAGAGAG 
AAGCCCCTTG 
TTTTGGGGGA 
GATTTGTATG 
GATTTTAATC 
GGTGATATTT 
TTCGATCAGG 
AACTTCGAAA 
GCGAATAGAC 
GTGTTAACGA 
TATTACATGT 
CCGTTCAGTT 
AAAGGCCTTA 
AAGAGAATGC 
TATCGAGATG 
AAACATGATC 
GGGGCATCTT 
GGAAATAGTT 
AGTCGTTTTA 
CAATCCTCTC 
TTTGACATAT 
ATGAGTTATG 



GGGCTATGGC 
CCCTGATCGG 
TGGATGGCAA 
CCGGTATGAC 
ACTCTTVAGGT 
CTGTTTCCGG 
ATATCATGGA 
ACCCTACTGC 
CACCATTGTA 
CGAATGATTT 
CTCGTGCTGC 
GTCGTATTAC 
ATAATATGAT 
ACAATCAAAC 
GCAATTATGA 
ATGATGCAGA 
CTTTCTCCGG 
AAGGTATGGC 
GTCGTATCAA 
GATCTGCCGA 
TGCCTCGTTA 
ATGGAGCTAC 
CCGAATCACA 
CTTTAAAGGC 
CCAATAATCC 
TTAGCAAGTC 
TTACAGCATT 
CTAAAGGATT 
TGTCTTTGCC 
ATTACGGTTT 
GATTCGGATC 
ATAATAAATT 
GTACAACGGG 



CCAGAATAGA 
CGCGAATGTC 
CTTCACGCTT 
TACCTVAAGAG 
TCTGGAGCAG 
TTCTGTGGCC 
TGCCCTCCAA 
CGTCGCTTCT 
TATCGTGGAT 
TGAATCTATG 
AAACGGAGTC 
CTTTAATGCC 
GACTGGAGAT 
CGTTCAGAAG 
TTCTTTGAAA 
CTGGCTCAAG 
AGGGTCTCAG 
TCGTGAACCG 
TGAATGGCTG 
CTATTTTGGA 
TTATAACCCT 
CAGACCTTCT 
TCAGGCCAAT 
ACAGGCTGGT 
GTATGATTCT 
TTTTACAAAT 
GATGGGGCAT 
TGAAAGTGAT 
TGAACACAGA 
TGACAAATGG 
CAATAATAGA 
CATTCAAGAA 
TAACTCGGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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ATTGGTAATT ACAACCACCA AGCACTCGTT ACTGTGAACA ATTATACTGA AGATGCTATG 20-10 

GGGCTTAGCA TTTCTACAGC AGGCAATCCC GACCTCTCGT GGGAAAAGCA GTCTCAGTTC 2100 

AACTTCGGTT TGGCTGCAGG GGCTTTCAAT AATCGCTTAT CTGCAGAGGT AGATTTCTAT 2160 

GTCCGCACTA CGAATGATAT GTTGATTGAT GTCCCGATGC CTTATATCAG TGGTTTCTTC 222 0 

5 TCACAGTATC AGAATGTAGG CTCTATGAAA AATACGGGTG TAGACCTTTC TCTTAAGGGG 228 0 

ACGATCTACC AAAATAAGGA CTGGAATGTA TATGCTTCTG CGAATTTCAA CTACAATAGA 234 0 

CAGGAAATAA CAAAGCTTTT CTTCGGTCTC AATAAGTACA TGTTGCCTAA TACCGGTACT 24 00 

ATATGGGAAA TTGGGTACCC CAATTCGTTC TATATGGCTG AATATGCTGG AA.TCGACAAA 2^60 

AAAACCGGTA AGCAGTTGTG GTATGTTCCT GGTCAAGTCG ATGCGGATGG TAATAAAGTT 252 0 

10 ACAACAAGCC AGTACTCAGC TGACTTGGAG ACACGAATTG ATAAGTCTGT TACTGCTCCT 2580 

ATTACAGGTG GTTTCTCCTT AGGTGCTTCT TGGAAAGGAC TTTCTTTAGA TGCTGATTTT 26'JO 

GCCTACATCG TTGGTAAATG GATGATCAAT AATGACCGTT ACTTTACAGA GAATGCAGGT 27 00 

GGATTGATGC AATTAAATAA AGATAAAATG CTATTGAATG CCTGGACAGA GGATAATAAA 27 60 

GAAACAGATG TTCCAAAATT GGGACAGTCT CCTCAGTTTG ATACGCATTT GTTGGAGAAT 282 0 

15 GCTTCTTTCC TGCGTTTGAA GAATCTCAAA CTCACCTATG TACTCCCCAA TAGTCTTTTT 288 0 

GCTGGGCAGA ATGTGATTGG TGGAGCTCGT GTCTATTTGA TGGCGCGCAA TCTGTTAACT 294 0 

GTTACGAAGT ATAAAGGCTT TGACCCTGAA GCAGGGGGGA ATGTGGGAAA AAATCAATAT 3000 

CCTAATTCTA AGCAGTACGT TGCGGGTATT CAGTTGTCTT TC 3042 

20 

(2) TfirORMATIOII FOR SEQ ID MO: 162 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 2^163 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDED! JESS: double 

(D) TOPOLOGY: circular 



30 



(ii) MOLECULE TYPE: DMA (genomic) 
I i i i ) HYPOTHETICAL : NO 



(iv) AMTI-SEIISE: t(0 

35 (vi) ORIGIIJAL SOURCE: 

(A) ORGAIIISM: PORYPHYROMONAS GIllGIVALIS 



(i::) FEATURE: 

(A) IIAIIE/KEY: mi sc_f ea t ure 
40 (B) LOCATIOr; I...2463 



(:;i) SEQUENCE DESCRIPTION: SEQ ID NO: 162 



ATGPAGAAAA AGAATTTTTT GCTTCTTGGC 

45 AGCATGCAGG CACAACAGGC CAAAGATTAT 

TTCTCATTCA AAGTGCCTGA TAGGGCCGTT 

GACGAGTTTG ACCCCGTAAC CAATGAAGCC 

GCATTCCTGC GCTATGGGCT CAAGCCTACA 

GTCGAGATGT TCGACTACCG CTCAGGAGAA 

50 GAAGCCTATA .TCAGCATGAT GGAwAGAGTTC 

TCCGTCATTG GCAAGTCCGT AAAGGATCGT 

GCCAATACAG GGAAAAAGCC TCGCGTGCTC 

ACCGGATATG TGGTACTGCT CCGACTCATA 

CCGAGGATTA AGAACATTCT GGATAAAACG 

55 GACGGAGCAT ACAGAGCCGG AAACCACACC 

AATGTCGATT TGAACCGTAA CTTCAAGGAT 

CCTTGGCAGC CGGAGGCAAC TGCATTCATG 

GGTGCCAATA TACATGGAGG AACAGAGGTG 

AGACATGCAG ACGATGAGTG GTACAAACTG 

60 AGTATTTCCG CCAGCTACAT GACCTCCGAA 

TGGTATGTAA TTCGCGGAAG TCGTCAGGAC 

ATTACCCTTG AAATCAGCAA CACGAAGTTG 

AATCTGAACA AAGAATCTCT GCTTGCTCTG 

ACAGTGACTT CCGCTGCGAA CGGACAGCCT 

65 GACAAGCGCA ACTCCGATGT TTACTCCGAT 

AAAGCCGGCA CTTATACGGT GAAATACAAA 

ATTACGATCA AGGACAAAGA AACCGTCATC 

CTGCCTGTAC CCGATTTCAC AGCTTCTCCT 

TTCCAAGATC AAACGACAAA TAACCCCACG 

70 CCTGCCATGA GTACAGAGCA GAATCCGCTC 

GTTACGC']'CA AAGTGTGGAA TGCAAGTGGT 

ACTGTCAATG CCGTTATGCC TGTAGCTGAA 

GGCCAGACGG TATCTTTCCA AAACCAATCC 

GATGGCGGCA CTCCCGCTAC CAGTGAAGAC 

75 GGCCAATACG ATGTCACGCT CAAGGCGATC 



ATTTTCGTTG CTTTGCTGAC TTTCATCGGC 60 

TTCAACTTTG ACGAACGGGG CGAGGCCTAC 120 

CTACAAGAGC TGGCTCTGAT CATGTCCATC 180 

ATTGCCTATG CCAGCGAAGA GGAGTTCGAG 24 0 

TTCTTGACTC CTCCATCCAT GCAGCGCGCT 300 

AAATACGAAT GGAATGCTTA CCCCACCTAT 360 

CAAACAAAGT ATCCATCACT TTGTACTACT 4 20 

AAACTGATGA TTTGCAAGCT GACGTCCTCT 4 80 

TATACTTCTA CGATGCACGG AGACGAAACG 54 0 

GACCATCTGC TGTCGAACTA CGAATCCGAT 600 

GAAGTATGGA TCTGCCCTTT GACCAATCCG 660 

GTACAAGGAG CTACTCGCTA CAATGCCAAC 720 

GATGTAGCCG GTGATCACCC CGATGGAAAA 7 80 

GATTTGGAAG GAAACACCTC TTTCGTGCTC 84 0 

GTGAACTATC CATGGGATAA TAAAAAAGAA 900 

ATCAGTCGCA ACTACGCAGC CGCTTGTCAG 960 

ACCAATTCGG GAATCATCAA CGGTTCAGAC 1020 

AATGCAAATT ATTTCCATCG TCTGCGAGAA 1080 

GTGCCGGCCT CTCAACTTCC AAAGTATTGG 114 0 

ATCGAAGAAT CCTTATACGG CATCCATGGT 1200 

CTCAAATGCC AGATCTTGAT AGAAAACCAT 12 60 

GCTACCACAG GCTACTACGT ACGTCCTATC 1320 

GCCGAGGGTT ATCCTGAGGC AACTCGTACC 1380 

ATGGACATTG CATTGGGCAA CTCGGTTCCT 1440 

ATGACCATCT CAGTAGGCGA AAGCGTCCAA 1500 

AATTGGGAGT GGACGTTCGA AGGCGGACAG 1560 

GTATCCTATA GTCATCCCGG TCAGTACGAC 1620 

TCCAACACGA TTACGAAAGA AAAATTCATC 1680 

TTCGTCGGTA CCCCGACGGA AATAGAAGAG 17 4 0 

ACCAATGCCA CCAACTACGT ATGGATATTC 1800 

GAAAACCCGA CTGTGCTTTA CAGCAAAGCC 1860 

AGTGCTTCCG GTGAAACGGT GAAGACGAAA 1920 
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GAAAAATACA TCACTGTCAA GAAAGCTCCG GTCCCTGCTC CGGTAGCCGA CTTCGAAGGA 1980 

ACACCTCGAA AAGTAAAGAA AGGCGAGACA GTTACTTTCA AAGACTTGTC TACGAACAAT 204 0 

CCGACTTCAT GGCTTTGGGT GTTCGAAGGC GGCTCTCCTG CCACCAGCAC GGAGCAAAAC 2100 

CCGGTGGTCA CCTACAATGA AACAGGCAAG TACGATGTCC AGCTGACTGC CACCAACGAG 2160 

5 GGCGGAAGCA ATGTGAAGAA AGCAGAAGAC TACATTGAGG TTATCCTCGA TGACAGTGTC 222 0 

GAGGACATAG TGGCACAGAC GGGTATCGTC ATTCGTCCGC AAAACGGAAC G7\AGCAGATC 22 8 0 

CTCATAGAAG CCAACGCTGC TATCAAAGCG ATCGTTCTCT ATGACATCA/^. TGGACGGGTC 23'iO 

GTACTCAAAA CTACTCCGAA TCAGCTCCGC TCGACCGTAG ATCTTTCCAT CCTGCCCGAA 2^500 

GGAATCTACA CCATCAATAT CAAAACGGAA AAATCCGCTC GCACGGAAAA GATCCATATC 24 60 

10 GGG 24 63 

(2) IHFORilATIOM FOR SEQ ID lJO:163 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

20 

(ii) MOLECULE TYPE: DIJA (genomic) 

(iii ) HYPOTHETICAL: MO 

25 (iv) AlITI-SEIISE: IIO 

(vi) ORIGIt-IAL SOURCE: 

(A) ORGAMISM: PORY PHYROMOMAS GIMGIVALIS 

30 { ix) FEATURE: 

(A) HAl'IE/KEY: misc_feature 

(B) LOCATIOIJ 1 . , , 228 



35 



40 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID 110:163 

ATGCGACTGA TCAAGGCTTT TCTCGTGCA/i. CTCTTACTGC TCCCCATTTT CTTCTACAAG 60 

CGGTTTATAT CGCCGCTTAC ACCGCCTTCA TGCCGGTTTA CCCCCTCATG TTCGTCCTAT 120 

GCCATCGAAG CCTTACGTAA ATATGGCCCG GGCAAAGGAC TATTGCTGAG CATCAAGCGT 180 

ATTCTCCGCT GTCACCCGTG GGGTGGAAGT GGCTATGACC CCGTTCCG 22 8 

(2) IHFORilATIOM FOR SEQ ID NO: 164 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 2577 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

50 (ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) AHTI -SENSE: HO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHiROMONAS GIHGIVALIS 



(ix) FEATURE: 
60 (A) MAI-IE/KEY: misc_feature 

(B) LOCATION 1 . . ,2 577 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 

65 ATGGCATACG ACTTTACACA AACATTCCGC AACAGCCTGG AGTACAGCTA TCAGGAAGCA 60 

ACCCGTCTCG GCGTCGTAGC CGTGACGCAA GATATGCTCG TACTCGGTAT CATTCGCGAC 12 0 

GGAGACAATG GCGCGATCGA CATCATGCGG CACTATGGGA TCAACTTGTA CGAACTCAAA 180 

CGGTTGATCG AGTTGGAAGC CATCGCCGAG AGTTTGCCTG CTTCGCCTGA GGGATCGCCC 2 40 

ATCTTCACCC CTTCGGCTCG GGAGGCTATC GATGATGCCA CAGACATCTG TGCCGACATG 300 

70 GAGGACGAGG CCGTCAGCCC GGTCCATCTG TTGCTGAGTA TCCTCAACTC GACACAGGAG 360 

AGCTTAGTAC AAAAGATATT TATGAAACAA GGTATAAAAT ACGACACCAT CCTGTCGGAT 4 20 

TACTTCGGAC AGCGCAACCC CTCCGAAGGG AAGTCTCCCT CCGAAATGGA GATCCTCGAC 4 80 

GGGTACCAAG ACAACGACTT CGACGACGAA GAGGACGAAT CCTCTCCGCC TTCCGGGAAT 54 0 

AGCGGGACAG GCGGAGGCTC CGGCGACGCC CCCGAACAGA ATACCGGCGG AGGCGATACT 600 

75 ACCACCACGA CACGGAGTGG AGGCGACACG CCTGCACTGG ACACCTTCGG CACCGACATC 660 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



112/490 



10 



15 



20 



25 



30 



ACTGCCATGG 
AGGGTGATAC 
GGTGTAGGCA 
AGCCGTATTC 
ACCAAATATC 
AATCCGCAGA 
GCCGGATCGA 
TGCATCGGAG 
GAACGCCGCT 
CTGCAAAACA 
ATCAAAGCGG 
GCGATAGATG 
AAAGAAATCG 
GTAAAGGCTC 
CAGCAGATAG 
GTGGACGAGA 
CTGAGCACGG 
GTAGGTCAGG 
CTTCGCAATG 
AAGACCTATT 
AGGGTGGATA 
CCGGGATATG 
TATTCCGTGG 
TTACAGGTGA 
AACACCGTGA 
GGTATCGGGT 
ATCCAAAAAG 
ATCCTCTTCG 
GCCGTCTTGG 
GATGTGATAG 
CTCCAGAACG 
GGGCAGACGC 



CGGCAGCAGG 
AGATACTCAG 
AGAGTGCCAT 
TTTTCGACAA 
GCGGACAGTT 
TCATCCTCTT 
TGGATACGGC 
CCACTACGCT 
TCCAGAAGGT 
TCAAAGAGAA 
CAGTGGAACT 
CCATGGACGA 
AGATACTGGA 
AGAACTACGA 
CGGAAGAGAA 
ATGTAGTGGC 
GCGAAGGCGA 
ACACAGCCAT 
AAAAGAAACC 
TGGCC/KAG/VA 
TGAGCGAGTA 
TGGGCTATGA 
TTCTCTTGGA 
TGGACGAAGG 
TCATCATCAC 
TCCGTrCGGA 
CTCTGAACAA 
ACCAACTGGG 
CGCGCATCCA 
CGACGAAGGG 
AAGTGGAGGA 
TTACGCTCTC 



CAAGCTCGAC 
CCGGCGCAAA 
CGTGGAAGGA 
GCGGATCATC 
CGAAGAGCGG 
CATCGACGAG 
CAATATGCTC 
GGATGAGTAT 
GCCGATAGCC 
ATACGAGGAC 
GACCGATCGC 
GGCCGGCGCG 
GGCCGAATTG 
ACTGGCTGCC 
GAAAAAATGG 
GCATGTAGTG 
ACGTCTGCGC 
CGAAAAGATG 
GATCGGTTCT 
GCTCGCCGAA 
TATGGAGAAG 
AGAAGGCGGC 
TGAGATCGAA 
TCAGCTGACC 
CTCCAACGTG 
AAAAGACGAG 
GACGTTCAGC 
CAAGACGGAG 
TCGTGCCGGA 
ATACGACCTC 
TCGCCTCACG 
TGCTCGCGAT 



CCGGTAGTGG 
AAGAACAATC 
CTGGCCGAAC 
AGCCTCGATT 
TTGAAAGCCG 
ATACATACCA 
AAACCCGCTC 
CGTAAGAACA 
CCCTCGACTG 
TATCACGGTG 
TATGTATCCG 
AGCGTCCATA 
GCATCGGTGC 
TCCTTCCGCG 
GAAGAGCAGA 
GCGTTGATGA 
ACGATGGCAG 
GTGCATGCCA 
TTCCTTTTCC 
TACCTGTTCG 
TTCTCCGTTT 
CAACTGACGG 
AAGGCGCATG 
GACAGTCTGG 
GGTACACGCC 
GAAGCGAACA 
CCCGAATTTC 
ATTCGCCGGA 
TACGACCTCG 
CaATACGGAG 
GATCTTATCC 
GGCGAGATCA 



GTCGGGAGCA 
CGGTGCTCAT 
GCATCGTGAA 
TGGCTCAGAT 
TGCTCGATGA 
TCGTGGGAGC 
TTGCCCGTGG 
TAGAAAAGGA 
CAGAAGAAAC 
TACGCTATAC 
ATCGTTTCTT 
TCACCAATGT 
GAGAGAACAA 
ATCAGGAGCG 
TGTCCAAGCA 
CAGGCGTTCC 
ATGATCTCAA 
TCCAGCGCAA 
TCGGCCCCAC 
AGGATGAGAA 
CGCGTCTCGT 
AGCGCGTAAG 
CCGATGTCTT 
GACGGCGCGT 
AGCTCAAAGA 
AGGAGCATAG 
TCAACCGTTT 
TGGTGGACAT 
TCCTTACCGA 
CACGACCGCT 
TCTCCGGACA 
TCGTACAAGA 



GGAGATCGAA 
CGGCGAACCC 
CAGGAAGGTG 
GGTAGCCGGC 
GCTGAAGAAG 
AGGCTCTGCA 
ACAGGTACAG 
CGGAGCACTC 
GCTGACCATC 
GGACGAAGCG 
CCCAGATAAG 
GGTGGCTCCG 
GCTCTCGGCC 
GCGCACTCAG 
CCGCGAGACG 
GGCTGAGCGG 
GACCAAAGTA 
TCGTCTGGGA 
GGGGGTAGGC 
TGCCATGATC 
GGGTGCCCCT 
ACGCAAACCC 
CAATCTGCTC 
GAATTTCAAG 
CTTCGGGCAG 
CCGTTCCGTG 
GGACGATATC 
AGAGCTTAAA 
TGAAGCCAAG 
CAAGCGCACA 
GATCGAGAAA 
ACAAGCA 



720 
780 
840 
900 
960 
1020 
1080 
11^0 
1200 
1260 
1320 
1380 
14^0 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2 34 0 
2400 
24 60 
2520 
2577 



35 (2) in FORMAT I Oli FOR SEQ ID MO: 165 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRAMDEDMESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 
45 (iii) HYPOTHETICAL: MO 

(iv) ANTI -SENSE: MO 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIMGIVALIS 



55 



60 



65 



70 



75 



(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1. . . 1251 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165 



ATGAATTACC 
ATTGCTTTCC 
ACAGGCAGAG 
GTGGGTAATA 
CCTTTGGCGC 
ACTATCATCA 
AATATGATGA 
CTGTCTAAAC 
GTGCCTACAA 
TCCGGAGAAA 
GATTTTTCCG 
GAGTTGCAAA 
ATCATTATCT 
TTTCGTGGGC 
TATGCCAATA 
GATGAACT TG 
GGTGACATTG 
ACCTATCTGG 



TGTACATACT 
TTTCTTCGGA 
CGTTAAACTT 
ATATCGTTTT 
AATGGATTGA 
TACTGTTTAC 
TGAGGGTATT 
TCTTCACCGG 
CAGTAGGGTT 
ACGAACAGAA 
GTATTCAGGT 
CGGATATTGA 
ACAGACAGAA 
AAGACTGGCA 
AACTGATGCG 
GAGGTACGGC 
AGGACGAACA 
TCAGTGGTCG 



GATTACACTT 
CAAACTGCGT 
GCTGTATCGA 
GGTAGTCTAT 
TAACGATGCT 
CGGGGAATTT 
CGCCCTCCCT 
TTTATCTCGC 
GGGGCGCGTA 
CGACTTGACT 
GCGAGACTGC 
AGTACTCAAA 
CATAGATGAC 
AAAACGTATC 
ACTACTCATG 
CGGAATGGTC 
CGACACTCGC 
TATGGAAATA 



TTACTCTCCG 
CTTGAGTTGG 
CATCCGGACC 
GGTCTGCTGA 
ATGATCGTCG 
CTACCCAAAG 
ATCGTAGCGA 
TCTTTTATTC 
GATCTCGATC 
ACCGAAGTGA 
ATGATCCCAC 
ACGACTTTTA 
GTAGTAGGAT 
AATACTACTG 
CAGCGCAAGA 
ACATTAGAGG 
AAGATCATAG 
GATGATGTGA 



GCTTTTTTTC 
ACAGGAATAG 
AACTGGTGAC 
TGGCGGGATT 
TTCTCCAATC 
CCATTTTCAA 
TCTATTATCT 
GTCTGGTGGA 
ATTATTTGGC 
AAATCATCCA 
GCAATGAGAT 
TCGATACCGG 
ATATCCATTC 
TATTCGTACC 
AAAGCATTGC 
ATTTGGTAGA 
CCAAACAGCT 
ACGAACGTTT 



CGGTGCTGAG 
AGGCGATCTC 
TACTCTCCTT 
GCTGGCCGCA 
TGTCTTATCC 
GACCAATGCC 
GCTTTATCCT 
CAAGAATTAT 
AGAAAATATG 
GAJKTGCGCTG 
GATAGCATGT 
TTTGTCCAAG 
GAGCGAAATG 
CGAAAGCATG 
GATCGTCATC 
AGAGATTTTC 
CGGCCCTCAT 
TGGGTTGTCC 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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TTGCCTGAGT CTGACGACTA CCTTACCGTG GCCGGATTTA TCCTGAATAG CCATCAAAAT 
ATCCCACAGG CCAATGAGGT CGTGGAGATT GCTCCTTATA CTTTTACCAT TCTCAGATCT 
TCTTCCACCA AGATCGAACT GGTGAAAATG TCCATCGACG ACCAATCGAA C 



(2) lUFORMATIOH FOR SEQ ID MO; 166 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEUGTH; 879 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDMESS : double 

( D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DIJA {genomic} 
(iii) HYPOTHETICAL: NO 



(iv) ANT I -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVAL IS 



(ix) FEATURE: 

{A) NAIIE/KEY: misc_f eature 
(B) LOCATION 1 ... 879 



(.^:i) SEQUENCE DESCRIPTION: SEQ ID NO; 166 



ATGAAACAGA 
ATTATGCCAT 
TTGACTTTCG 
CAGGAGATAG 
GTCTATCGTG 
GATGTTCCTG 
GACAAAGTAT 
ACTGCTTCCA 
CCGGCAAGTG 
ACAGCTGAAA 
GAAGTAAAGT 
AGCTCATTGA 
ATTATTGCGG 
GTAGCCCCGA 
GTGGGGAATA 



ACTACTTCAA 
TGGAAGTAGC 
TTGCACCGGT 
TCGCTACTGA 
ATGGCGTGAA 
CCGGCGAACA 
GCGTGGACGT 
ATGACGAAGT 
ATAAAGCAGT 
CTCATTATGT 
ATCCTGACGG 
GCAATGTAGA 
AAGCCCATGG 
ATCGATTGGA 
AACACCATGT 



AAGAGTCTGC 
AGCTCAAGAG 
TGAGGAGACA 
GGAGATTGCG 
GATTGCTTCA 
TACCTACTGC 
AGAGGTGAAG 
TTCTTTGGAC 
CAGCTACAAC 
GGAGACCGGT 
TGTATCTCCG 
TGGACAGGCT 
TATGATCACG 
ATACATGGCG 
ATCGAAAATA 



TCACTGCTTT 
ATTATTCCGA 
GACGCAATAG 
GAGCAGGCTG 
GGATTGACTG 
GTAGAAGTAC 
GACTTCAAAC 
TGGGACGGTG 
GTCTACAAGA 
GTAGCCAATG 
AAGGTGGCTG 
CCTTACACAT 
CTCTACGACA 
CAAACCGGTT 
CAAGTAAGA 



GGCTGGTTTT 
ACGAAGAGGT 
AGGCAGAGGT 
TTCGTTCTTA 
AGCCCACTTT 
AGTATCAGGG 
CGGTTACCAA 
TGGAAGAGAA 
ATGGAACCTT 
GTACATACAT 
TAACCGTGAC 
TGCGAGTAGA 
TCAACGGACG 
TCTATGCAGT 



ACCCATGCTT 
GTTGGAATCA 
AGAAGCTCTG 
TACCTACACG 
TCTCGATGAA 
AGGCGTATCC 
TCTCACCGGA 
AGCTGAAGAG 
GATCGGTAAT 
CTACGAAGTG 
CGTGACCAAC 
AGGCAAGAAG 
TACCGTGGCC 
GCGCTTCGAT 



(2) INFORMATION FOR SEQ ID NO: 167 



{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANT I -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVALIS 



(ix) FEATURE: 

(A) MAJIE/KEY: misc_feature 
( B) LOCATIOIJ 1 . . . 936 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 



ATGATTCCTC TAAGCGAGAG TTTTGAATCA 
GCAGATGGCG ATGGCTATAA TTGGATGCAT 
TGTGTCTCTT CGGCTTCATA CATAGGCGGC 
ATAACACCCG AATTAAAACT ACCCACAGAC 
ACTCAAGATC TCACTGCTCC ATCGGAGCAC 
AATGCTGCTG ACTTTGTTAA TCTCTTATAT 
TCCCCCGAGT TGATCCGCGG AAATCGGACA 



GGTATTCCAG CTATATGGAA GACCATTGAC 

TTGACCAATT TCACGGGACA GAGTGGTCTC 

GTCGGAGCTT TGACTCCGGA CAATTATCTG 

GCGTTGGTGG AAATAATCTA TTGGGTATGT 

TATGCCGTTT ATTCCTCTTC TACAGGCAAT 

GAAGAGACTT TGACTGCCAA ACGGATACAA 

CAAGGTGTTT GGTATCAAAG AAAGGTGGTA 
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CTCCCT7\ACG ATACTAAATA TGTTGCTTTC CGCCATTTTA ATTCCACGGA TAATTTCTGG 4 80 

CTCAATTTGG ATGAAGTATC TATCCTGTAT ACCCCTCTTC CCCGAAGAGC TCCGTGTCCG 54 0 

CATCCGGGTG GTTACACTTA TTCTGTATTC CGTGATGGAC AAAAGATAGC GAGTGGATTG 600 

TCGGCATTGG CATATATCGA TACGGATGTA CCGTATGGGA CTCAAGACTA TTGTGTCCAA 660 

GTCAATTATC TGCAAGGAGA CTCGTATAAA GTCTGCAAAA ATATAGTGGT GGCAAATTCT ^20 

GCAAACATCT ATGGGGCGGA TAAGCCTTTT GCGTTGACCG TGGTTGGCAA GACCATTGTA 7 80 

GCGAGTGCTT TCAAAGGAGA GATCACTCTT TATGACATTC GTGGCCGGCT GATAGCTTCC 84 0 

GGCTGCGATA CGCTTAGGTA CAAAGCGGAA AATGGTTTTT ACCTCATTAA AATACAGGTA 900 

AACGGAACTG TCTATACTGA GAAAATCCAA ATCCAA 936 

(2) Ilf FORMATION FOR SEQ ID HO: 168 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LEMGTH: 2529 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

20 (ii) MOLECULE TYPE: DMA {genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: UO 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROI lONAS GINGIVALIS 



(ix) FEATURE: 

30 (A) NAME/KEY: misc_feature 
(B) LOCATION 1. . .252 9 

(xi) SEQUEUCE DESCRIPTION: SEQ ID NO: 168 

35 ATGAAAAAAA GTTTTCTTTT AGCCATAGTA ATGCTCTTTG GCATTGCCAT GCAGGGACAT 60 

TCTGCTCCGG TTACGAAAGA GCGAGCTTTG AGTCTGGCTC GGCTGGCTTT GCGACAGGTA 120 

TCCTTGCGAA TGGGACAAAC AGCAGTATCT GACAAGATTT CCATCGATTA CGTTTATCGG 180 

CAAGGAGATG CTGAGAGGGG TATCACATCA CAAGAGGAAG GCTCTCCTGC ATATTTTTAT 240 

GTAGCTAATC GTGGAAATAA TGAGGGCTAT GCTCTTGTAG CAGCAGATGA CAGAATACCG 300 

40 ACAATTTTAG CCTATTCACC CATTGGCCGT TTCGACATGG ACAGTATGCC GGACAATCTT 3 60 

CGCATGTGGC TACAAATTTA CGATCAGGAA ATAGGCCTGA TACTTTCCGG AAAAGCTCAG 4 20 

CTCAATG/^G AGATATTACG TACCGAGGGC GTACCGGCTG AAGTACATGC TCTGATGGAT 4 80 

AACGGTCATT TTGCC7\ACGA TCCCATGCGA TGG7VATCAAG GTTACCCATG GAACAATAAG 54 0 

GAACCACTGC TTCCTAATGG CAATCATGCC TATACCGGCT GTGTTGCTAC TGCTGCAGCA 600 

45 CAAATCATGC GCTACCATAG CTGGCCGCTT CAAGGTGAAG GCTCTTTCGA TTATCATGCA 660 

GGTTCATTAG TTGGCAACTG GTCCGGCACA TTTGGTGAAA TGTACGACTG GATCAATATG 7 20 

CCCGGAAATC CCGACCTTGA TAATCTGACT CAATCTCAAG TGGATGCCTA CGCCACACTG 7 80 

ATGCGTGATG TGAGTGCATC TGTTTCGATG AGTTTTTATG AAAATGGAAG TGGTACGTAC 84 0 

AGCGTTTATG TAGTAGGAGC CTTGCGAAAC AACTTTCGCT ACAAGCGTTC ACTGCAGCTA 900 

50 CATGTACGCG CCTTATATAC CTCACAGGAG TGGCACGATA TGATCCGCGG GGAACTTGCC 960 

TCCGGAAGGC CGGTCTATTA TGCAGGGAAT AACCAGAGCA TAGGACATGC TTTCGTTTGC 102 0 

GATGGTTATG CTTCGGATGG TACTTTCCAT TTCAACTGGG GTTGGGGAGG TGTTTCCAAC 108 0 

GGCTTCTACA AACT7\ACACT CCTCTCGCCG ACTTCGTTGG GTATCGGAGG TGAGGGAATA 114 0 

GGTTTTACCA TTTATCAAGA GATCATCACC GGTATCGAAC CGGCTAAGAC TCCCGCTGAA 1200 

55 GCCGGTACAG ATGCCTTGCC GATCTTGGCA CTGAAAGACA TAGAAGCCGA GTATAAAAGT 1260 

GAATCCGGAT TGAACGTAGG GTATTCGATA TATAATACAG GTGAAGAGCA ATCAAATCTT 132 0 

GACCTCGGAT ACAGATTGAA CAAGGCTGAC GGAGAAGTCA TAGAGGTGAA AACTTCATCT 1380 

ATCAATATCT CTTGGTACGG ATACGGAGAG CATCCCGAGA GTTTCTCATT GGCACCTAAT 14 4 0 

CAGTTGTCAC 7\AGGAATCAA CACCATCACC CTACTTTATC GTCGCACAGG CACCGAACAG 1500 

60 TGGGAGCCGG TACGGCATGC ACAGGGAGGA TATGTCAATA GCATTAAAGT AAATACGACA 1560 

GACCCGAACA ATGTCGTAGT CACGGTAGAT AATAACGAAG GCAAGCTCAG TATCGTCCCC 162 0 

AACAGCTTTG TCGCAGATCT GAATTCTTAT GAACATAGTA CGATTACAGT ACAGTTCAAT 168 0 

AGCGACAGCC CTGATGAGAT CCGTACACCC GTAGCCTTTG CTCTATCTAC AGGAGCTACT 174 0 

GCGGACGATG TAATATCTTT GGGCTGGGTA ATGGCTGAAG TTCCGGGCGG TAGCAGCAAC 1800 

65 TATCCGGTGG TTTGGTCTAA AGACGTTCTC ACTCTCTCGG AAGGCGACTA TACATTGTGG 18 60 

TATAGATTTT CCATCAACAA CCAAAAGGAT GAATGGAAAA AGATCGGAAG CGTGTCAGTA 192 0 

AAAACACCGA CAGAGTATAC GCACCCCTTA TTCGAAGTGG GCCATAATCA AACTTCTACC 198 0 

TATACGCTGG ATATGGCACA CAACAGAGTA TTGCCCGACT TTACACTCAA AAATCTCGGA 2 04 0 

TTGCCTTTCA ATGGTGAGTT GGTTGTTGTT TTCCGCCAAA CACAATCCTC ATCGGGGTCT 2100 

70 TTATGGGCAG CTCAAGAAAC AGTACATATC AAGCAAGGAG AAACTTTCGT ATATAAACCT 2160 

GTTGTCGAAG GCCCTATACC TGATGGATCC TATCGTGCGA CCCTCCATGC ATTCGTAAAC 222 0 

GGACAACAAC AGTTGTACCT CAAGGGGAAA AGGAACTACA CGGTGAAGAT CGTCAATGGT 228 0 

ACAGCGGTAG AAGCAATAGA ATCGTCAGAA GAGATCAGAG TATTCCCTAA TCCGGCACGC 234 0 

GATTATGTGG AAATATCGGC ACCTTGCATT CCCCAAGAAA CATCTATCAT TCTTTTCGAT 2 4 00 

75 CTGTCAGGCA AGATTGTCAT GAAGAATAGT TTATCAGCGG GGCATGGCAG AATGGATGTC 2 4 60 
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AGC03ACTTC CTAATGGGGC CTACATCCTT AAGGTGGATG GATATACGAC GAAAATAAAT 
ATAGTGCAC 



(2) IIIFORMATIOIJ FOR SEQ ID NO: 169 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LEMGTH: 870 base pairs 
{B} TYPE: nucleic acid 

(C) STRAtfDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 



(iii) HYPOTHETICAL: MO 



(iv) AIITI-SEMSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PH i ROMOUAS GIIIGIVALIS 



(ix) FEATURE: 

(A) U A] -IE/ KEY: misc_f eatur- 
iB) LOCATION 1 ... 870 



(::i) SEQUENCE DESCRI PTIOtJ : SEQ IDNO:169 



ATGAAAAAGC 
GATATAATCG 
CCGGATGACA 
GCTGAAGAAT 
GCTCTCAAGG 
TCTCTCGTGC 
AATAATACCC 
TATGATAAGA 
AAGATCAATA 
AAGGGTAAAA 
GAGGATAAGC 
AATCACGTGT 
GATTATTTGT 
TATTCGATTG 
GTAAAGATCA 



TATTTCTCTC 
ACAAGGATCA 
ATCCTTCGGA 
TTACCGGCCA 
AGCAATACGG 
CACCTCTTTT 
CTCTCCCTGC 
GCTACAAAAC 
TCTTTGCCGT 
TACTGGAGGG 
TGATCGCTCC 
TGCGTGGAGC 
ATACTTACGC 
TGGCTTTTGT 
ATCCGCAATC 



GCTCACGAGT 
AACCCTCTTG 
GATCGACATT 
AAAATGTCTC 
TAAGAGATTG 
CCGTACAGAA 
GCTGATGGTT 
GTGGGACGTG 
GGCCGAATAC 
GAATACACTC 
GCAGGTGGAT 
CGTTAATGGT 
CGTTGAACCG 
ATACGATGTG 
CGATGGCAAA 



CTTGTAATGG 
CCGGCTCCGA 
ACGCAGACGC 
AACTGTCCGA 
ACTGTTGTCG 
GCCGGAGACG 
TCGCGCAAAA 
CCTATTGCCG 
ACCGATACCC 
CCGAAGTCCA 
GGCAATACGA 
ATTTGGGGCG 
CTCTCGGGTA 
CAGACGTTCG 



TCTTCGCTGT 
CCAATGTGAC 
ACACAGAAAA 
AAGGTCATCG 
GTATACATGC 
CATATTATAG 
AGTTCGGCTC 
AGCAGATGGA 
AAAAGATCAA 
TGGTTCAGGT 
CAGTCGAGAA 
AAGAATTTGT 
TGTCCTTCGT 
AAGTGTATGA 



TGCAAGTTGC 
ACCCGATAAT 
ATATGTTTTG 
CAAACTGGCG 
CGGCCCTGGA 
CAAGTTCGCC 
TTCCTACGTT 
GCAAAAGGCG 
GGTGACTGTA 
GTATCTGTTG 
TTACGAGCAC 
GAATCTCAAA 
AGCCGAGAAT 
CGTTGTGCAT 



(2) IIIFORMATIOIJ FOR SEQ ID MO: 170 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTl- SENSE: WO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI-IE/KEY : misc_f eatur- 

(B) LOCATION 1 ... 669 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 170 

ATGAAGAAAT CAAGTGTAGT AGCCTCAGTT TTGGCCGTGG CTCTCGTGTT CGCCGGTTGC 
GGACTGAACA ATATGGCAAA AGGCGGCCTT ATCGGCGCCG GAGTAGGAGG TGCCATTGGT 
GCCGGAGTAG GTAACGTAGC CGGAAATACG GCTGTCGGTG CCATCGTCGG TACTGCAGTC 
GGTGGAGCAG CCGGTGCTCT CATCGGAAAG AAGATGGACA AGCAGAAAAA AGAACTGGAG 
GCCGCAGTAC CCGATGCTAC GATTCAGACA GTAAATGACG GAGAGGCTAT TCTGGTTACT 
TTCGATAGCG GTATCCTCTT TGCGACGAAC TCCAGCACTC TGAGTCCCAA CTCACGCACT 
GCGCTGACGA AGTTTGCTGC AAACATGAAC AAAAACCCCG ACACGGATAT TCGTATCGTA 
GGCCATACGG ACAATACCGG CTCCGACAAG ATCAACGATC CTCTGTCTGA GAGACGTGCA 
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GCCAGCGTAT ATTCTTTCCT GAATTCTCAG GGTGTGAGTA TGTCGCGCAT GGCAGCCGAlA 54 0 

GGGCGTGGGA GCCATGAACC GGTTGCAGAC T^ATAGCACAG TTGCCGGACG TTCGGCCAAC 600 

CGCCGTGTGG AGGTTTATAT CTTGCCGAAT GCCAAGATGA TCGAACAAGC ACAGCAAGGT 660 

ACGCTGAAG 669 

5 

(2) IliFORHATIOM FOR SEQ ID 110:111 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LEtlGTH: 1011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 
ID) TOPOLOGY: circular 

15 (ii) llOLECULE TYPE: DUA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AtlTI-SEMSE: MO 



20 



50 



55 



60 



65 



70 



(vi) ORIGIUAL SOURCE: 

(A) ORGAIJISM: PORY PHYROIIOMAS GiriGIVALIS 



(ix) FEATURE: 
25 (A) 13AI1E/KEY: misc_feature 

{B) LOCATIOM 1 ... 1011 

SEQirEHCE DESCRIPTION: SEQ ID ]IO:171 

30 ATGTCGAAAA AATCGATCCT TCTGCTTTGC TGTTCGCTGT GCTTCATTTC TGCTACGAAG 60 

GCTGTGACCC CCGTCAGAAA TGTGCGC7\AT AGCCAAGTGA ACAGCAAAGC AAAGACCGAA 120 

CGTACAAAGC CCTCGGACTC TGTACGGTAC ATTAGCAACA TGATTGCAGA TCGGCTGGAG 180 

TTCCGCAACA AGATTTCTTC CGAAAAAGAG GTAAGTVmAG CCG7ATATGA 7\AATCGGCTG 240 

GCGATGGAAG CACTCAATTA CCCTGCCATA GATTTATATG GTGAAGATTC TTGGAGCGAG 300 

35 TATGTAAACC CTTTCGTGGG TGCAGGAACC GATGTCGAAA TTCCGAACTC CTATGACATT 360 

GATTGCTCTT CGTTGGTGAT GCCCGTCGAA GATAAGCAGG TCACCTCTCA ATTTGGCTAC 4 20 

CGTCGGCGTT TCGGACGGAT GCACTATGGT ATTGATCTTT CAGTG7VATCG TGGCGATACG ^80 

ATACGAGCAG CCTTTGACGG GAAAGTTCGT GTACGCAGCT ATGAAGCGCG TGGCTATGGC 54 0 

TACT AC AT AG TCTTGCGCCA TCCG7VACGGA CTGGAGACTG TGTACGGACA CATGAGTCGC 600 

40 CAATTGGTAG ACGAGAATCA GATCGTTCGA GCAGGACAAC CGATCGGATT AGGAGGCAGC 660 

ACGGGTCGAA GCACCGGTCC TCATCTTCAC TTCGAGACCC GCTTCATGGG TATTCCCATC 720 

AATCCGAGTA CCATTATAGA CTTCGATAAC GGAGTGCCGC TCCGAGACAT TTACACATTC 7 80 

AAACGAGGGA GCAATTCTCG CTATGCAA7VA GCCTCTAAGA CTTCTTCTCG CTATGCAAAA 84 0 

AAAGGGAAGA 7\AGGCAGACA AGCTTCTTCT CCTATGACCT ATAGAATCAA AAAAGGCGAT 900 

45 ACTTTGGAAA CAATAGCCAA T^GGCACGGC ACTTCTGTTC AGAAACTCTG TGCTACCAAT 960 

GGCATTGGCA AGAGTAAAAT TTTGACTCCG GGCAAAGCCT TGAGGATCAA A 1011 



(2) 11! FORI 1 AT 1 01) FOR SEQ ID HO: 17 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

tii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISII: PORY PHYROMOMAS GINGIVAL IS 

(ix) FEATURE: 

(A) NAJIE/KEY: misc_feature 

(B) LOCATION 1 ... 453 

{>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 17 2 



ATGGCAAAAA TCAATTTCTA TGCTGAAGGC GTCAGCCTTC CTCGGATCAG AAGACGGATC 60 

GTCGGTAAGT GGATAGCCGA AGTATGCAGC CGATATGGGA AAGCGGTGGG AGAAATCTCC 120 

TATCTTTTCT GTGATGACGA ATATATCCTG AAAGCCAATC AGGAATTTCT CGATCATGAC 180 

75 TACTACACCG ACATCATCAC CTTCGATTCC TGCGAAGCGG ATACGGTGAA TGGCGACCTG 24 0 
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CTTATCAGTC TCGATACCGT ACGCTCGAAT GCCCGTGCTC TTGATCTTCG ATACGAAGAC 300 

GAACTGCATC GTGTCATTAT CCACGGCATA CTGCATCTTT GCGGATTGAA AGACAAGAGC 360 

AAAAAGGATG AAGCCCAAAT GCGTGCAGCC GAAGAGAAAG CCCTTGTCAT GCTGCGAGAA 420 

ACCATCGGAT CGGAGCTTTC CCTATTGCAT ACA 4 53 

5 

(2) IMPORMATIOU FOR SEQ ID MO: 173 

(i) SEQUEllCE CHARACTERISTICS: 
10 (A) LENGTH: 1173 base pairs 

(B) TYPE: nucl-ic acid 

(C) STRAHDEDtlESS: double 

( D) TOPOLOGY : circular 

15 (ii) MOLECULE TYPE: DIJA (genomic) 

(iii) HYPOTHETICAL: IJO 
(iv) AliTI-SEllSE: HO 



20 



50 



65 



(vi) ORIGIIIAL SOURCE: 

(A) ORG AM ISM: PORYPHYROI-IOIIAS GINGIVALIS 



(ix) FEATURE: 
25 (A) IIAIIE/KEY: misc_feature 

(B) LOCATIOM 1...117 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID 110:17 3 

30 ATGAAGGTAA AGTACTTAAT GCTCACATTG GTTGGAGCAA TTGCACTGAA CGCAAGTGCA 60 

CAGGAGAATA CTGTACCGGC AACGGGTCAG TTACCCGCTA AGAATGTTGC TTTTGCTCGC 120 

AATAAAGCAG GCAGCAATTG GTTTGTAACA CTGCAAGGCG GTGTTGCAGC GCAGTTCCTC 180 

AATGACAACA ACAACAAAGA CCTCATGGAC CGCTTAGGAG CCATAGGTTC TCTTTCTGTC 24 0 

GGAAAGTATC ACAGCCCTTT CTTTGCAACT CGTTTGCAAA TTAACGGAGG TCAAGCCCAC 300 

35 ACTTTCCTCG GAAAAAATGG CGAACAAGAA ATCAACACCA ATTTTGGTGC AGCTCACTTC 360 

GACTTTATGT TTGATGTGGT TAACTACTTT GCACCATATC GCGAAAATCG TTTCTTCCAT 4 20 

TTAATTCCAT GGGTAGGTGT TGGCTACCAA CACAAATTCA TCGGTAGCGA ATGGAGCAAA 4 80 

GACAATGTGG AATCACTGAC GGCGAATGTA GGAGTTATGA TGGCTTTCAG ATTAGGAAAG 54 0 

CGAGTAGACT TTGTGATCGA AGCACAAGCA GCTCACTCCA ATCTCAATCT AAGTCGCGCA 600 

40 TACAATGCCA AGAAAACTCC CGTATTCGAA GATCCCGCAG GACGTTATTA CAATGGATTC 660 

CAGGGGATGG CTACAGCAGG TCTTAATTTC CGCCTGGGAG CCGTAGGCTT CAATGCCATT 7 20 

GAIICCAAiTGG ACTACGCACT TATCAATGAT CTGAATGGTC AGATTAACCG TTTGCGCAGC 7 80 

GAGGTCGAAG AACTCTCAAA ACGTCCTGTA TCATGCCCCG AATGTCCTGA AGTAACTCCT 84 0 

GTTACTAAGA CAGAAAATAT ACTGACGGAA AAAGCTGTAC TGTTCCGTTT CGACAGCCAC 900 

45 GTTGTGGACA AAGATCAATT GATCAACCTG TATGACGTAG CTCAGTTTGT AAAAGAAACT 960 

AACGAGCCGA TTACCGTTGT TGGTTATGCT GATCCTACGG GTAATACTCA ATACAACGAG 1020 

AAATTGTCTG AGCGTCGGGC TAAAGCCGTT GTTGATGTTC TGACAGGTAA ATATGGTGTG 108 0 

CCTTCCGAAT TAATCTCTGT AGAATGGAAG GGCGACTCTA CGCAACCGTT CAGCAAGAAA 1140 

GCTTGGAATC GTGTTGTAAT CGTTCGCTCC AAG 1173 



{2) IlJFORl'IATIOM FOR SEQ ID IJO:174 



(i) SEQUEIJCE CHARACTERISTICS: 
55 (A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS : double 

( D) TOPOLOGY : circular 

60 (ii) MOLECULE TYPE: DIIA (genomic) 

(iii ) HYPOTHETICAL: MO 
(iv) ANT I -SENSE: MO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOUAS GINGIVAL IS 



(ix) FEATURE: 
70 (A) NAME/ KEY: mic c_f sature 

(B) LOCATION i ... 1155 

(;:i) SEQUENCE DESCRIPTION: SEQ ID NO: 174 

75 ATGACATACA GAATTATGAA AGCTAAATCT TT ATT ATT AG CACTTGCGGG TCTCGCATGC 60 
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ACATTCAGTG CAACAGCCCA AGAAGCTACT ACACAG7\ACA AAGCAGGGAT GCACACCGCA 120 

TTCCAACGTG ATAAGGCCTC CGATCATTGG TTCATTGACA TTGCAGGTGG AGCAGGTATG 180 

CCTCTCTCGG GATGGAATAA TGATGTAGAC TTTGTAGATC GTCTAAGTAT CGTTCCTACT 24 0 

TTCGGTATCG GTAAATGGCA TGAGCCTTAT TTCGGTACTC GTCTCCAATT CACAGGATTC 300 

5 GACATCTATG GATTCCCGCA AGGGAGCAAG GAGCGTAACC ACAATTACTT TGGAAACGCC 360 

CACCTTGACT TCATGTTCGA TCTGACGAAC TATTTCGGTG TATACCGTCC CAATCGTGTC A20 

TTCCATATCA TCCCATGGGC AGGTATAGGA TTTGGTTATA AATTCCATAG CGAATVACGCC 4 80 

AATGGTGAAA AAGTAGGAAG TAAAGATGAT ATGACCGGAA CAGTTAATGT CGGTTTGATG 54 0 

CTGAAATTCC GCCTATCAAG AGTCGTAGAC TTCAATATTG AAGGACAAGC TTTTGCCGGA 600 

10 AAGATGAACT TTATCGGGAC AAAGAGAGGA AAAGCAGACT TCCCTGTAAT GGCTACAGCA 660 

GGTCTAACGT TCAACCTTGG CAAGACAGAG TGGACAGAAA TTGTTCCTAT GGACTATGCT 720 

TTGGTCAATG ACCTGAACAA CCAAATCAAC TCACTTCGCG GTCAAGTGGA AGAGTTGAGC 7 80 

CGTCGTCCTG TTTCATGCCC TGAATGCCCT GAGCCTACAC AGCCTACAGT TACTCGTGTA 84 0 

GTCGTTGACA ATGTGGTTTA CTTCCGTATC AATAGTGCAA AGATTGATCG TAATCAAGAA 900 

15 ATCAATGTTT ACAATACAGC TGAATATGCG AA6ACCAACA ACGCACCGAT CAAGGTAGTA 960 

GGTTACGCTG ACGAAAAAAC CGGTACTGCG GCCTATAACA TGAAGCTTTC AGAGCGTCGT 102 0 

GCAAAAGCGG TAGCCAAGAT GCTTGAAAAG TATGGTGTTT CTGCGGATCG CATTACAATT 108 0 

GAATGGAAGG GCTCATCAGA GCAAATCTAT GAAGAGAACG CTTGGAATCG TATTGTAGTA 114 0 

ATGACTGCAG CGGAA 1155 



20 



35 



55 



70 



(2) IMFORllATION TOR SEQ ID 110:175 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LEHGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 

(D) TOPOLOGY: circular 

30 (ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: MO 
(iv) AlITI-SEMSE: NO 



(vi) ORIGINAL SOURCE: 

I A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



(ix} FEATURE: 
40 (A) NAME/KEY; niisc_f eature 

(Bl LOCATION 1 . - .57 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:175 

45 ATGGAATTTT TCATGTTATT CATAGCGGCG GTTTTCGTTA ATAACGTCGT GCTGTCGCAG 60 

TTCCTCGGTA TATGCCCATT CTTAGGCGTA TCGAAGAAGG TAGACACCTC AATCGGTATG 120 

GGTGCAGCCG TGACATTCGT ATTGGCACTG GCTACCTTGG TTACCTTCCT GATTCAGAAG 180 

TTCGTTTTGG ATCGTTTCGG ATTGGGCTTT ATGCAGACCA TTGCATTTAT TTTGGTCATT 24 0 

GCCGCCTTGG TGCAGATGGT GGAGATCATA CTCAAGAPAG TATCTCCTCC CCTCTATCAG 300 

50 GCACTGGGTG TATTCTTGCC CTTGATTACG ACGAACTGCT GTGTGCTCGG TGTGGCTATT 360 

TTGGTTATCC AGAAGGATTA TACCCTGCTC CAGAGCTTCG TCTATGCAAT ATCCACGGCT 4 20 

ATCGGTTTCA CCTTGGCAAT GGTTACTTTC GCAGGTATTC GAGAGCAACT CGATATGACC 4 80 

AATCTCCCCA AAGCTATGAA GGGAATACCT TCGGCACTCT TGGCTGCCGG TATATTGGCT 54 0 

ATGGCTTTCA TGGGCTTCAG CGGTATCGCC 57 0 



(2) INFORMATION FOR SEQ ID NO: 17 6 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 558 bafe pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDHESS: double 

(D) TOPOLOGY: circular 

65 (ii) JOOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGAN I SH: PORY PHYROUONAS GINGIVALIS 



(i.-) FEATURE: 
75 (A) NAllE/KEY: misc feature 
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(B) LOCATIOIl 1 . . . 558 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:176 



10 



15 



ATGTTATTCA 
TGCCCATTCT 
ACATTCGTAT 
CGTTTCGGAT 
CAGATGGTGG 
TTCTTGCCCT 
AAGGATTATA 
TTGGCAATGG 
GCTATGAAGG 
GGCTTCAGCG 



TAGCGGCGGT 
TAGGCGTATC 
TGGCACTGGC 
TGGGCTTTAT 
AGATCATACT 
TGATTACGAG 
CCCTGCTCCA 
TTACTTTCGC 
GAATACCTTC 
GTATCGCC 



TTTCGTTAAT 
GAAGAAGGTA 
TACCTTGGTT 
GCAGACCATT 
CAAGAAAGTA 
GAACTGCTGT 
GAGCTTCGTC 
AGGTATTCGA 
GGCACTCTTG 



AACGTCGTGC 
GACACCTCAA 
ACCTTCCTGA 
GCATTTATTT 
TCTCCTCCCC 
GTGCTCGGTG 
TATGCAATAT 
GAGCAACTCG 
GCTGCCGGTA 



TGTCGCAGTT 
TCGGTATGGG 
TTCAGAAGTT 
TGGTCATTGC 
TCTATCAGGC 
TGGCTATTTT 
CCACGGCTAT 
ATATGACCAA 
TATTGGCTAT 



CCTCGGTATA 
TGCAGCCGTG 
CGTTTTGGAT 
CGCCTTGGTG 
ACTGGGTGTA 
GGTTATCCAG 
CGGTTTCACC 
TCTCCCCAAA 
GGCTTTCATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
558 



(2) INFORMATIOH FOR SEQ ID HO: 177 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LEMGTH: 24 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

25 (ii) MOLECULE TYPE: DMA { genomic) 



(iii) HYPOTHETICAL: NO 



(iv) AMTI- SENSE: NO 

30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHOMAS GIIIGIVALIS 



(ix) FEATURE: 
35 (A) NAME/KEY: niisc_f eature 

(B) LOCATION 1 . . .24 99 



(xi) SEQUENCE DESCRIPTION: SI 

40 ATGAAACAAC TAAACATTAT CAGCTTCATC 
TCGGCTCAGC AATCGGGCGG ATCCGTTACA 
CCTATCGCAT ACGTACAAGT ATTCGTCA7\A 
AACGGAAACT ACTCGATCAA GGGAATCCCT 
ATGGGTTACT CCACTTGCGA AGAAAAAGTA 

45 GACCTCTATC TGACCGAAGA GATTCTCTCT 
AACGAGACTT TCCGCCGTCA AGCACCCTCG 
CTCAAAACCA ACTCTACCAA CCTGAGTCAG 
GAGGACAACT GTCAGAACTG CGGTTTCAAC 
TATTCGCAAA TTCTTATCGA CAGCCATCCC 

50 TTGGAGCAGA TGCCTGCCAA TATGATCGAA 
GCTCTGTTCG GCTCTAATGC TGTGGGAGGC 
CGCAATTCGG CCGAGATCAG CCATTCTACG 
AGCTTCCAAA ATACGACCCA GTTCAACGGT 
GTCATGGTAT TCGGCCAACA CAACTACCGT 

55 ACCGAACTAC CCAATCTGCG CAACCGCTCG 
CTCTACAGCA AAGCAACCCT CGAATATCAC 
AGACTGGACA ATCCTCCTTT CGAAGCCCAG 
GGCGGAAGTT TCAAATTCGA TCAGGGCTTC 
GCTTCAGCAC AAGACGTTCA GCGTCGTAGC 

60 CTGCTGAACG GAGCAGTTCA GAGTGGAAGC 
ACGGCTCTTA CTTCCTACGG GACTACCAAG 
CGTCATACCT TCGGAGAAAA CTGGGACTTT 
CTCGATGACA GAAGCGGCTA CAGACCGAGC 
CAGTACGACC AGCTCGAATA TAAGACGGAG 

65 GACTATGTTC TCCTCAATCA GGATGGCAAA 
CCTAGAGCCA ACGTACGATA CAATCCCAAT 
GAAGGATTCC GCGCTCCTCA GTATTTCGAT 
ACTCCTATCA GCCGTGTCCT TTCCCCCAAT 
GCTTCTTTCG ATTATTACCA CAGAGCCGAC 

70 TTCTCCACCT TTATCAGCAA TCAGTTCAAA 
GGCAAAGAAT GGATCATTCG TACCATCTAC 
GGTGTGAATC TGGAGGGAAG AATCGCCTAC 
ACATGGCAGA GAAGCCGCTA CGGAAGCATC 
GGACAAGCCG AGATCTCTGT GAAAGACTAT 

75 GTTGCTACGG TACGTCCTAC CGAGCACTTC 



:Q id NO: 177 

ATTGCTTTCC TATTCTTAGG AACGAGCGCA 60 

GGTACCGTAG TGGACAAAAG CTCAAAAGAA 120 

GGAACCACTC TCGGAACTTC CACGGATGCA 180 

TCGGGTAATC AAACTATCGT AGCCCGACTC 24 0 

CATATAGAAi\ AGGGTGGTTC CCGCCACGTA 300 

CTCGATGGGG TAGTGGTATC TGCCAATAGA 360 

TTGGTAACGG TACTGTCGCC GGAACTTTTC 4 20 

GGACTTAAGT TCCAGCCCGG TCTGCGCGTG 4 80 

CAAGTTCGTA TCAATGGACT CGAAGGAGCC 54 0 

ATCTTCAGTT CGCTTGCCGG TGTCTATGGC 600 

CGTGTAGAAG TAATTCGCGG TGGAGGTTCG 660 

GTTATCAACG TAATTACGAA AGAACCGCTT 720 

ATGACCTTCG ACCACGCGAA AGGGTGGGGG 7 80 

TCTATGCTGA CGGAAGACCG C7WVGCCGGT 84 0 

CCCGGACAGG ATATAGACGG CGACAACTTT 900 

CTCGGTTTCC GCTCATACTA TAAGACCGGT 960 

AGCATGCAGG AGTACCGTCG TGGTGGCGAC 1020 

ATAGCGGAAT ATCTCCAGCA CTATATCAAT 1080 

AGCGGTGGCA AGGATTTCTT CAGTCTGTAT 114 0 

TACTACGGGG GTGGCGACTA TACCGAAAAT 1200 

ACCGAATCGG ACGAATACAA CGATGCTTTC 12 60 

GGATTCGATT TGCAAGGAGG AGGTATGTAC 1320 

ACCGGCGGAC TCGAATATAT CTACGGCCAA 1380 

AAAATAGATC AGAATACCTC TACTTTTAGT 14 4 0 

AAGTTAAGTG CCCTTATCGG AGCACGTATC 1500 

CGCTATATCG ATCCGCTCTT CATTTTCAGT 1560 

AAGAATCTCA GCTTCCGACT CTCATACAGC 162 0 

GAAGATCTGC ACGTAGAGTT GGCCGGTGGT 1680 

CTGAAAGAAG AACGTTCACG AAGCATCAGT 174 0 

GAATGGCAAT TCAATATCAT GGGAGAAGCC 1800 

CCATCCGATA AGGTCGAAAC CACGAGCGAT 1860 

AACGACAAGG ATGGAGTATC GAAGGTATAT 1920 

AACAAATCGT TCGACCTCCA GCTCGGCGGT 1980 

TATACCGCTG TGGAAGCGGA CAAAACAACG 204 0 

GTACGCACTC CGAATCTGTA CGGCTATTTC 2100 

GCCATCAS.TC TCTCCGGTAC ATTCACGGGC 2160 
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10 



20 



25 



AAAATGGATG TAGTACACGA AGCCTATGAA GGCGATATTC CCGCAGAACA CATAGCTCCG 222 0 

GACGGATCGT TCGACTTTGA AATGAATGGT CAGCAATTCA AAGGTTTGGC CGAAGGTCAT 22 80 

GCCAAGCTCG TCAAGACTCC GGCCTTCGCC GATATAGACC TCAAGCTGAG CCACGACTTC 2 34 0 

CACCTTGCTT CCACTATGAC CTTGGAATTG AATGCCGGAA TACAGAACAT ATTCAACAGC 2 4 00 

TATCAGAAAG ACACGGACA.H GGGACCGGGT AGAGCTTCTA CTTACGTATA CGGTCCTATG 2 4 60 

CAGCCCAGAA GGATTTTCGT CGGTACAT^G ATCAATTTC 24 99 

(2) IK FORMAT I OH FOR SEQ ID MO: 178 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 2673 base pairs 

(B) TYPE; nucleic acid 

(C) STRAItDEDMESS: double 
15 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv) ArrTI-SEIiSE: UO 

(vi) ORIGINAL SOURCE: 

(AJ ORGAIfISM: PORYPHYROMOHAS GIHGIVALIS 



(i:0 FEATURE: 

(A) MAI-IE/ KEY: mi sc_f eature 

(B) LOCATIOJ) 1 . . . 2673 

30 Ixi) SEQUENCE DESCRIPTION: SEQ ID 110:17 8 

ATGTACAAAA AGATTATTGC CGTAGCAGCT CTCTTCTGCG CCAGCATAGG GATCCTGAAA 60 

GGACAGTCCT CGGATCTGAC CCCTCAGGAT ACT ATA TATA GCCCTGAAAT ATCCTATGCC 12 0 

AAGCCTATTC ATAAGACCAT AGCATCTATT GAGATCGAGG GAATGAGGTC TTTCGATGAC 180 

35 TTTGTCTTGC GCAATCTTTC AGGCTTGGCT GTAGGTGATG AAGTCCTGAT TCCTGGAGAT 2 40 

GCCATGTCTG CTGCCGTGAA TAGAATTATG CGTCAGGGCT ACTTCTCAAA TGTGCGAATC 300 

ATCGCGGATA AATATGTCGG CAATAAAGTC TATCTGAAAA TCATTGTCAC TGAACGTCCT 360 

CGCATCAGTA AGGTTACTTT TAGCGGGGTA AAGAAGTCTG AGAGAGAAGA TCTTGAAATG 4 20 

AAAATCGGTC TTCGCGAGGG GATTCAGATG ACCAGAAATA ATGAAGACAA GGTCAGGCAA 4 80 

40 ATCGTACAGA AGTATTTTAG TGAGAAAGGT TATCGCGATG CCAGCATACG GATAACGCAG 54 0 

GAACCGGATC TTTCCAAAGA TGGCTTTGTC AATGTGCTTA TCTCGATTGA GAAGAAAAGC 600 

AAAACCAAGG TGAATGAAAT TTATTTTTCC GGCAACAAGG CCCTTAGCAA TCATAAGCTA 660 

AGAATGGCGA TGAAGAACAC CAATGCCAAA TTCAGTCTTA G?iAAGCATAT TCGCTCATCT 7 20 

TTCTTGAAAC TTTTTAGTAC TCATAAGTTT GTGGAAGAGA GCTACCGTGA AGATTTGGTC 7 80 

45 CGATTGATAG AGAAGTATCA GGAATATGGA TATCGTGATG CTGAAATACT GACCGACAGT 84 0 

GTCGTGAAGG CTCCTGACGG CAAAAGAGTG GATATTTATC TCAACATCGA AGAGGGGCAG 900 

AAGTATTATA TTAAGGATGT CAACTTTGTG GGCAATTCAC AATATCCATC GGAGTATTTG 960 

GAACGAGTGC TCGGAATAAA ATCCGGAGAT GTGTACAATC AGAGACGATT GGCTAAGCGT 1020 

CTCAATGAAG ATGAAGATGC TGTGGGGAAC CTGTACTATA ACAATGGCTA TATTTTTGCG 1080 

50 TGGGTCGATC CCGTGGAAAC AAATGTAGTG GGGGATTCTG TTTCGCTTGA TATTCGTATA 114 0 

GCGGAGGGGA AGCAGGCCAA TATCAATAAG GTGATCATCA AAGGAAATAC TGTCGTGTAC 1200 

GAAGACGTAG TACGCCGAGA GCTTTACACA AAGCCCGGCC AGCTCTTTAG TCGCGAGGAT 12 60 

ATCATTAACT CTATTCGTCT CATCAATCAG CTTGGGCATT TCGATGCCGA AAAATCTATT 132 0 

CCCCGTCCGA TTCCCAATCC CGAAACAGGA ACAGTGGATA TAGAGTATGA TTTGGTGCCG 138 0 

55 CGTAGCAGTG ACCAATTGGA GCTTTCTGTC GGTTGGAGTC AGTCCGGACT TCTGTTCCGA 14 4 0 

GGAGCCATTA AGTTCACGAA CTTCTCTGTC GGCAACTTGC TCCATCCCTC GATGTATAAG 1500 

AAAGGGATCA TTCCGCAAGG GGATGGGCAA ACACTATCAC TGAGTGCTCA GACCAATGGA 1560 

AAGTACTATC AGCAGTATAG TGTCACATTT ATGGATCCAT GGTTTGGGGG CAAGCGGCCG 162 0 

GATATGTTCA GCTTCAGTGC ATTCTATTCC AAGACTACGG CGATTGACTC CAAGTTCTAC 1680 

60 AATAGCAATG CCGGCAACTA CTATAATGCC TACTATAATA GCTACTACAA ChACTATPJVT 17 4 0 

AGTTATTACA ACGGTATGTC G.AACTATACC GGCGACCTCT ATACTCAGGC CAGCGATCCG 18 00 

GATCGTTCGC TTCAGATGTT AGGTACTTCG ATCGGTTACG GTAAGCGTTT GACTTGGCCG 1860 

GACAATTGGT TCCAGATTTA TACTTCTCTG AACTACACCT ACTATAGACT GCGAAATTGG 1920 

AGCTACAATA CCTTCCAAAA TTTCCATCAT GGCTCGGCTA ATGATCTCAA CTTGGAGCTG 1980 

65 CGTCTCTCTC GTACTTCCAT CGATAATCCT ATTTATACCA GAAGCGGATC GGATTTCATG 204 0 

GTTTCTGTTG CTGCTACTCT TCCTTATTCT TTGTGGGACA ATCATGACTA TGCCAGCCAG 2100 

AACCTCAGCG TAAGCGATCG TTACAGATTT ATCGAGTATC ACAAGTGGAA GTTTAGAGGA 2160 

CGAGTTTTTA CTCCATTGCT CA.ATCCTGCT ACGCATAAAT ATACACCGGT GCTCATGAGT 2220 

CGAGTGGAAG GAGCAGTTCT TGGTTCGTAT A^ATTCCAATA AGAAATCTCC TTTCGGTACT 22 80 

70 TTCTATATGG GAGGTGATGG TATGTCCAGC TATTATGGTG GCTACATG7\A TGAGACTATA 234 0 

GGTTTGCGTG GTTATAAGAA CGGATCTATT GCCGGTAATA ACTACGACTA TGCATATGCT 24 00 

TATATGCGGC TTACGATGGA ACTACGTTTC CCGATTCTGT TTGAAAACTC ATTCAATGCG 24 60 

TGGCTCTTAG CTTTTGCCGA AGCAGGCAAT GCGTGGCGCA GTATCGACAA TTATAATCCC 2520 

TTTAACCTGA AGCGATCGGC CGGTGTAGGA TTGCGTGTAA CGTTACCGAT GGTCGGAATG 2580 

75 CTCGGTATCG ATTGGGGATA TGGCTTTGAC CGTCCGGACA ATTCTCTACA GCGAGGAGGA 264 0 
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AGCAATGTCC ACTTTGTGCT CGGACAGGAG TTC 2 673 

(2) lUFORMATIOli FOR SEQ ID MO: 179 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIJDEDMESS: double 
10 (D) TOPOLOGY: circular 



15 



20 



60 



70 



(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: 110 

(iv) AMTI-SEUSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOMAS GIHGIVALIS 



Ux) FEATURE: 

(A) NAME/KEY: mi£c_f eature 

(B) LOCATION 1 . . . 522 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID?JO:179 

ATGAATGGCG ATATGAAACG GTTTTTGATT TTGATCGGCT TTGCACTGGC GGTAGCTTTC 60 

TCCGGTTTTT CCCAAAAGTT CGCTTTGGTA GATATGGAAT ATATCCTCAG GAATATTCCT 120 

GACTATGAGA TGATGA^^CGA ACAGCTGGAA CAGGTGTCCA AGAAATGGCA AAATGAAATC 180 

30 GAAGCTCTCG AAAATGAAGC CCAATCTATG TATAAGAAGT ATCAGAGCGA TCTCGTATTC 2^0 

TTGTCTGCTG CACAGAAGAA AACCCAAGAA GAGGCTATCG TAAAGAAAGA GCAGCAAGCA 300 

TCCGAGCTCA AGCGGAAGTA TTTCGGCCCG GAGGGGGAGC TGTATAAGAA ACGCTCCGAT 360 

CTGATGAAGC CTATTCAGGA TGAGATTTGG AATGCTATCA AAGAGATTGC CAAGCGTAAC 4 20 

AACTATCAGA TGGTGCTTGA TAGAGGTACG TCCGGAATTA TCTTTGCCAG TCCGTCTATT 4 80 

35 GACATTAGCG ACCTTGTACT GAGCAAGATG GGCTTTAGCA AG 522 

(2) INFORMATION FOR SEQ ID NO: 180 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

45 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

50 { i V- ) AM T I - S E N S E : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROI10MAS GIHGIVALIS 

55 (ix) FEATURE: 

(A) MAI'IE/KEY: misc_f eature 

(B) LOCATION 1 ... 510 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 



ATGAAACGGT TTTTGATTTT GATCGGCTTT GCACTGGCGG TAGCTTTCTC CGGTTTTTCC 60 

CAAAAGTTCG CTTTGGTAGA TATGGAATAT ATCCTCAGGA ATATTCCTGA CTATGAGATG 12 0 

ATGAACGAAC AGCTGGAACA GGTGTCCAAG AAATGGCAAA ATGAAATCGA AGCTCTCGAA 180 

AATGAAGCCC AATCTATGTA TAAGAAGTAT CAGAGCGATC TCGTATTCTT GTCTGCTGCA 240 

65 CAGAAGAAAA CCCAAGAAGA GGCTATCGTA AAGAAAGAGC AGCAAGCATC CGAGCTCAAG 300 

CGGAAGTATT TCGGCCCGGA GGGGGAGCTG TATAAGAAAC GCTCCGATCT GATGAAGCCT 360 

ATTCAGGATG AGATTTGGAA TGCTATCA7\A GAGATTGCCA AGCGTAACAA CTATGAGATG 4 20 

GTGCTTGATA GAGGTACGTC CGGAATTATC TTTGCCAGTC CGTCTATTGA CATTAGCGAC 4 80 

CTTGTACTGA GCAAGATGGG CTTTAGCAAG 510 



(2) INFORMATION FOR SEQ ID NO: 181 



(i) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 489 base pairs 
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BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



122/490 



(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 
(iii) HYPOTHETICAL: NO 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(iv) AMTI-SEIISE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGAI«aSM: PORYPHYROMOHAS GIIIGIVALIS 

(ix) FEATURE: 

(A) HAIIE/KEY: mi sc_f eature 

(B) LOCATION 1 ... 489 

{;•:!) SEQUEHCE DESCRIPTIOH: SEQ ID HO: 181 



ATGAAGAAAT 
GTGGCAGTGG 
AAACAGCTCA 
TTTGCCAAAA 
AATCGTCGTC 
ATGCAGGAGG 
GCTGATGCCA 
ATGATGCTTT 
GGAATCAAG 



TTTTTCTCAT 
TAAACACTGA 
ACGAATTGGC 
AGACAGAAGA 
AGCAGGAA.CT 
ATTTGCAAAA 
TCAAGAAAGT 
ACACCGGAGC 



GCTTCTGATG 
GGAGATCATT 
CGAAAAGTAT 
ATTTGTAAAG 
TCAGGATATT 
GCGCCAACAA 
GGGTGACGAA 
TACTGCTATT 



GCTCTTCCTT 
TCCAAAATGC 
CGCCTTGATC 
GAAAAAGACT 
CAAACTCGTT 
CAGCTTTTTG 
GAAAACTGTG 
GACTTGACCG 



TGAGCCTCTT 
CGGAACAAGT 
TCAAGAGTAT 
CTCTACTGGA 
ATCAGCAGTC 
CTCCTATCCA 
CCTACATCAT 
CAAAGGTAAA 



GGCACAAAAG 
AGCTGCTACC 
GGACGATGAG 
GAACATCCGC 
ATACCAAACG 
ACAAAAGGTG 
GGAGGCCGGT 
AGCGAAACTC 



(2) IHFORt'IATION FOR SEQ ID HO: 182 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 2481 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDMESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AI-ITI-SEMSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOHAS GINGIVALIS 

(i;:) FEATURE: 

(A) NAIIE/KEY: misc_feature 

(B) LOCATION 1. . ,2481 



60 
120 
180 
240 
300 
360 
420 
480 
489 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:182 



55 



60 



65 



70 



75 



ATGAAGGAAG 
TCATTCATTC 
GTATCGGGCT 
CGTGGTACGG 
GCCACGACGG 
AGCTTTCCGT 
GAGCTTTCGA 
AATACCCGAG 
ACCTACGCAG 
AGCTACGATG 
CGCTCTGCAC 
TTCTCCGCCG 
CGCTACAAGC 
GCCTACTATG 
GCCAAATCGC 
GGACAGACAT 
ATTTCGCAAA 
AGCGATGCCA 
TACTTCGGTG 
CTTTCGGCCT 
AACGATGTGC 
AACGGCTTGG 



CTATTCCCCG 
TGCTATGCTG 
ATGTGCTCGA 
GGACAGGCGC 
ACAGCATCAC 
CTCTGACCAA 
GCGTGACGGT 
ACCTTCGTGT 
GAGTAACGCA 
AGAATATGGT 
AGCAGGAAGG 
GAGGGTTCAC 
AACCGCAGGA 
GCAGCAGTGC 
TCTTGGGCAC 
TCATGACGTA 
CTCGCTACAA 
AAAAGTTGAA 
CCTTCAGCAT 
TCAACAGTAA 
AGCTGGGGGC 
GCATCGGGCG 



AAAGAACAAG 
CCTGCTATGC 
CCGTGGGGAA 
AACGACGAAT 
GATCGAGTTC 
GGACACTCGG 
ACAGGCCACA 
CAATGCAGGG 
GAACAATGAA 
CTATGTAAAC 
TCTGAGCTTC 
GGCCGACTAT 
GAAGGAAGGA 
CGGAGCCTTC 
TACGGACACG 
CCGTTTCAGC 
GTTTGTCCCT 
GATCTTTTTC 
GAACTTCGTG 
CGAACGGGAG 
GGACGGAACT 
CAATCACGAG 



TATATAAAGC 
TCTCAGGCAG 
AAGCCGATCC 
CTGAAAGGAT 
AGCTCCATGG 
CTGAATGTTC 
AAACGCAGAC 
CCTACGGGAG 
CTAAGCTCGC 
GGAGTGGAGG 
GTCAATCCGG 
GGCGACAAGA 
GCGGTACTCC 
AGCCAAATCA 
AAAGCCGAAT 
CCCAAGCTGT 
CAGACCCGTG 
GACGGTCAGG 
CCGGACGACA 
ACCTACGATA 
GCTTCGATGG 
CATGCGCGCA 



TCAACGGTAT 
CTATGGCACA 
CGTTCGCCGG 
ACTACGAGTT 
GGTACCAAGG 
GTTTGGCAGA 
TCAACACGAT 
GGGTGGAJ^TC 
AATACTCGGT 
TTTATCGCCC 
ATCTGACACA 
TGTCCTCCGT 
TCGGGATGCT 
CGGGTGTACG 
ACGATCCGAT 
CGGTTAGTTT 
AGACGAGCTT 
AACAAGATCG 
AACAGCGGCA 
TTCAGGGAGA 
CTTCGGGCTC 
ACAGGCTGAG 



ATACAGATTG 
AGGCGTCAGG 
AGTCAAAGTG 
TCGGATGAAG 
GGTAAGTCGC 
GGCCGAGATG 
GGAGCGCGTC 
GCTCATCAGT 
TCGCGGAGGA 
GCTGCTGGTT 
ATCCGTACAG 
ACTGGATATT 
ACAATCGAGT 
CTACAAGAGT 
CTATGCGGAC 
CCTCGGCAAT 
CGGTACACTG 
TTTCCTGACC 
TACGGTTACG 
ATACTTTCTG 
AGAGAACTCC 
CTACCGCGTG 



60 
120 
180 
240 
300 
360 
420 
480 
S4 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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CTGAACATGG GTTACAGAGG GGAGATGAAG 
GTATCGGCAC AGATGGAGAA AATAGCCGAC 
GTAGGATACA ACCTACCTCA CTCGGAGACC 
GATACGCAGA TGAGGGGAAC GCGCTTGTCG 
5 ATGGGAGGAG GTACATTTTC TCTCATTCCG 
AAGGAGTTGC TCGTCAGCCC ACGTATCAGC 
TTGGTACTGC GTGCAGCCGC CGGACTTTAT 
CAGACGCATA AGGATGCCGA AGGCAATAAC 
CAGGGAGCTT TTCACATTCT CGCAGGAGCA 

10 TACAAGTTTA CGGCAGAGGC TTACTACAAG 
GAGAACGTGA AGATCCGCTA TCTGGGCGAJ\ 
GATCTCAAGC TCTTCGGCGA ACTGGTACCC 
ATAAAAGCCC GTCAGAAACT GGATGGCTAC 
TACAATTTCT CCTTCTTCCT TCAGGAGTAC 

15 CTGCGGGCTG CACTAAGCGG AGGATTGCCC 
CCGGCCTTTA CCGCACCGGC CTATAAGCGT 
GACCCGGATG ACTCCTTTGC CGGCCGAAGC 
ATAGGGGCTG ACCTCTTCAA TCTGTTCGAC 
TCGGATGCCT ACCAACAGCA ATACGCCGTA 

20 CTGCGTCTCC TCGTCGAATT C 



CTGAACGAGA AGCATCGCCT GCAAGCCGGC 1380 

CATATCAGCG AATGGGAACG GAGGGATTCG 14^0 

GTATTGCTGA TGTACAATAA CCTATATGCC 1500 

GCATTCGTAC AGGATCGATT CAACTTCAGC 1560 

GGTATCAGAG CTTCGTGGTG GAGCTTCAAC 1620 

GTGGGTTATT CTCCCGAAAG CAACCCGGCT 1680 

TATCAGGCAC CGTTTTACAA AGAGCTAAGG 17^0 

GTGGTTGTCC TCAACGAGAA GATCCGCTCT 1800 

GACTATACCT TCGAAATGGG GGGGCGAAAA 18 60 

AGCCTGTTCA ACATCAACCC GTATATAATA 1920 

AACATCGGTT CGGGTTATGC TGCGGGTATC 1980 

GGAGTGGATT CGTGGCTGAC GGCTTCCATT 204 0 

GGTTCTTTAC CACTGATGAA CGCACCCACT 2100 

GTGCCGGGCA ATAAACGCAT CACAGCCACC 2160 

CAGCTCAATC CGAGCAAAGG GCTTAGCTCG 2220 

GTCGATCTGG GGGTAATGTA CAAATGGCTC 2280 

AAATGGCTAA TGGGAGTAAA AGGGGCCTAC 2 34 0 

ATGACCAACG TCAATTCTTA CTACTGGGTG 2 4 00 

CCGAACTACC TGACACGCCG CCAATTCAAC 24 60 

2481 



(2) I! 1 FORMAT I O] J FOR SEQ ID 110:183 

25 (i) SEQUEnCE CHARACTERISTICS: 

(A) LEMGTH: 2016 base pairs 

(B) TYPE: nucleic acid 

(C) STRAllDEDHESS : double 

(D) TOPOLOGY: circular 

30 

{ii) HOLECULE TYPE: DHA (genomic) 
(iii) HYPOTHETICAL: NO 
35 (iv) AMTI-SEtJSE: HQ 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHVROMOMAS GlilGIVALIS 

40 (ix) FEATURE: 

(A) HAIIE/KEY: misc_featur= 

(B) LOCATION 1. . ,2016 



(xi) SEQUEtJCE DESCRIPTION: SI 

45 

ATGTACAGCG GACATCATAA AATCCATTAT 
TTTGCTGCCT GCAAGAGCGT GAAGTTGAAA 
TATACCAAGG CTGCCGATAT GTACAATACA 
GAGATGAAAG CTTATACGGC TTTCCGATCC 

50 GCCAAAGCTT TGCGTGGCTA TCTGAATGCC 
CTGCTCCGTT TGGCACAGAC TTATCAGCAA 
TTCCGTGGAT ATCTGGAAGC TTATCCGAAA 
TGTCTCTTTG CCCGCCAGCA AAAGGAATAT 
GAGTGGAATT CGGCACGGGG CGACTTCGGC 

55 TATTTCACAT CGAGCAGAAG CAAAGACGAC 
AAACCCAACG ACATTTATAT CATCAAACGA 
AGCGTGTCCG GAGGAATCAA CACTCCATGG 
GGTAGTACCA TATATTATAC GTTGGCGCAG 
ATCTATTCCG CCGCTCGGAG CGGAGAAGGC 

60 ATGCGCGATT CGCTCCGTAT GGCTGCTCAT 
TATTTCGTCA GCAATATAGG CGGTAGCTAT 
TCGGATCGTT CTTATGGTTC ACCGGAGAAT 
GAAATGTTTC CCTTCATAGA TGGGGATAGT 
GGTCTGGGAG GACTGGATAT TTTCAAAGCC 

65 GTCAATATGG GACAACCGGT CAATTCCTCT 
AAAGGCAAAA ACAAAGAAGA AGCTTTGCCG 
AACCGAGGCG ATGCACGCGG ATGGCCGCAC 
ACCGAGATTC AAGGTTATGT GATGGACAGA 
AGGATCGTAG GCGAACGCGG CCCCGTAGGA 

70 TCCTATAAGA TGAGCGTGCA GGGCGATACT 
TATTTGAATC AGTACGTAGA ACTCAAGACC 
GTGGACTTTT TCCTTGCATC GCGTGAGAAA 
TTCGATAAAG CTACTCTTCG CCCCGAAAGC 
CTCACGGACA ATCCGGATAT TCGGATCG.AA 

75 GATGCTTACA ACCTCGGACT ATCTGACCGC 



•.Q ID HO: 183 

CCTTTTCTTA TCCTGTTGGT ATGCCTTGCT 60 
GATGCGGAGA AGGCACATGA TCGCCAAGAG 120 
TTATACAGGC GTACCCGACG AAAGCAGGTG 180 
GGTGAAAACT ATCGTGCCGC CGGCAGACA-A 24 0 

AGACGCTACG GGTATCCGGA TTCTGTGGTA 300 
GGAGGTAACT ATAAGGAAGC CGAGGTACTC 3 60 

AGTTATTTTG CAGCTATCGG TTTGGAGGGG 4 20 

CCTACACGTT ACCGGATACG GCGAGCTGCC 4 80 

CCGGCCTATG CACCCGATGC TTCGGCTCTC 54 0 

GGTTTGGATA ATAGCAGCAT AACGGGACTG 6 00 

GATGCACAAG GACGATGGGG ACGTCCCGAT 660 
GATGAAGGCG TGCCAACGAT CACGCCCGAT 7 20 

CAAGGAGCCG ATTACGACCG TACGGTACAG 7 80 
GGTTGGAGCA ACGGTTCGCT CGTGGACATT 84 0 

CCCTCTATGT CGGCATCCGG CGATTACCTG 900 
GGCGGCAAGG ATATTTATCG TGTCAAGGTG 960 

TTGGGGCCTG ATATCAATAC GCCGGGGGAC 1020 

ACCCTTTTCT TCGCTTCGGA CGGACACGCC 108 0 

ACGCTGGACT CTACCGGCCA ATGGCATGTA 114 0 

GCCGATGATT TCGGCTTGGC TGTGGAGCCT 12 00 

GACAACGGAG TCAAAGGTGT ATTTTGTTCC 12 60 

CTCTTCCATT TCGAACTGCC GGCTATCTAC 132 0 

GAAGAAAATC CCATAGCCGG AGCCACTGTC 13 80 

CAGGGATTCG TGACTACTCG TGACGATGGC 14 4 0 

CGCTATGTA.^ TGCTTGCCGG AGCATCGGGT 1500 

GATACCGCCA AGCAGAGTGA GACCTACTAT 1560 

GCCGAGGGCT TGCAAAATAT TTTCTATGAT 1620 

ATGAAGAGCT TGGACGAACT GATTCGTATC 168 0 

TTGGGTTCGC ATGCCGACAG GAAAGGCCCC 17 4 0 

AGAGCCAAA.T CCGTGGTGGA TTACCTCACG 18 00 
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AGTCGTGGCA TAGCGGCCGA CAGGCTTACG TGGAAAGGCT ACGGTAAGTC TGTCCCCAAG 1860 

ACGGTGACAG CCAAAATTGC CGAACGGCAC GATTTCCTGA AGGAAGGCGA TGTGCTCACC 192 0 

GAGGAATTCG TAGCACCTTT GACCGAGGAG CAGCAGTCAG TCTGCGACCA ACTGAACCGT 1980 

CGTACCGAGT TCCGTGTGAT CGAAGT^AGAG TTGCGT 2 016 



10 



15 



{2} IIIFOKIIATIOM FOR SEQ ID NO: 184 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 
{iii } HYPOTHETICAL: NO 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



(iv) ANTI-SEHSE: HO 

( V i ) OR I G I NAL SOURCE : 

(A) ORGANISM: PORY PHYROMOtrAS GINGIVALIS 

(iM) FEATURE: 

{A) NAME/ KEY: mi sc__ feature 

(B) LOCATION 1 . . . 2124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 



ATGAAAAAGT 
ACGGACAACG 
TCTACACGCA 
CGC7\ACATCA 
CTCGATGTCA 
TCCGGCAAGT 
ACGCTCAACA 
TACGGCACCA 
ATCCATGGCA 
TTGGGTGGCC 
AACAAGGACT 
ATAGTAGATG 
ACGGGACGTC 
AACGTATTCC 
TCCAAAAAAA 
CACACGCTTC 
GATCCCACCG 
GACAAGATTT 
ATGGAGTCAG 
GCCACGAACA 
TCGATATCTG 
CTCAACAATG 
TATGAGTTTG 
GATGCTTTCC 
GACCTGAAAC 
TGCGGGATCC 
TTGTCCAGCC 
ATGAGCGGTA 
TCTCTCCG CG 
ACCGATGCCC 
GAATATCGTG 
ATCGAGCAAA 
GCAGCAGAAG 
TTCAATGCCT 
AACATCTTGG 
ATGGGTAAGG 



TTTTCTTCGC 
TCCCGACAGA 
CGGCCGTACC 
AGCAGTCCGG 
TACAATACCC 
ATGTAACCGT 
CGAGCAACAT 
ATGCCATGGG 
ACGTTTCTCT 
GCTTCGAGGA 
ATAAGACCGG 
TAAATGCTAC 
TGCGTTTCGG 
TCGGAGATGC 
ATCTG7VATCG 
AATTCTCCCC 
GTTTCATCAA 
CCTTTGGAGG 
AAAGATTCGA 
ATATCGGTTT 
CCGGTGCACG 
AAGCCAAACA 
TGAAAGGCCT 
AAAAAGCAGG 
CCGAAAAGTC 
AAGCCGACGT 
CTGACTATGC 
TCGAGGCCCT 
CATTTGCGAA 
CTTGGAGCGA 
GCAAAGAAGG 
ACTGGTATGC 
AGCCTGAATT 
CTGCTTACTA 
ATGAGCTTTA 
TTATGGTCAA 



GCTACTATCG 
TTCGCTACGA 
TCTGAAAAAG 
CTTTAACAAC 
GGGCTTTAGT 
ATTGGTAAAC 
CGAACAAATC 
CGGTGTGGTG 
CTTCGGCGGT 
TATTTTCTCA 
ATCAAACAAT 
CAAAAACAAG 
TATCGACTTC 
GATCCCCGTA 
TTCTTCGACC 
CTACTTCAAC 
CTACAAAAGC 
ACAAAATATC 
GCAGGCAGGA 
GTTCGGACAG 
TGCCGACTTC 
GGAAACTCAT 
TACAGCTCAT 
CCAATACGTA 
CATGACCTGG 
AACCTTAACC 
TAATAATATC 
TTTGTCTTAT 
TGCCACGATC 
AATGTACTAC 
ACTTGAAGTG 
TTACTACCCC 
GGCTGCTCAG 
CCACATGAAC 
TACGGAGAAA 
CTTC 



ATTGGTATTT 
GTACACAATC 
ATACCGGCCA 
ATGACCGACA 
TCGAACATCG 
GGCATCCCTG 
GAGATCCTCA 
AACATCATCA 
AGCTACCAGA 
TTCGATCTTA 
TTCCTATCCC 
AAAATGAAGG 
ACGCCCGAAT 
GGAGGATCTA 
TCTTTCGAGC 
ATAGAGAAAT 
GACTACTACA 
GTACTCGGTG 
GTGAATACAA 
GCCAATTTCT 
ATGTTCTTTG 
AACGTAATCA 
GGTACATTCG 
GGCCCGTTCG 
GACTTCGGTA 
TATTTCCACA 
ACCACATACA 
GACTTCGGCA 
ATGCTCAATT 
GTTCGCAAGC 
ATGCTC7\ACG 
GAAGTTCGCC 
GG ACTGCTCC 
AAGTATCTCA 
GACGGCTACC 



CAGCGCAGGC 
TTCAGACCGT 
AGATGGAACT 
TCCTCAAGAC 
GTATCCGCGG 
CGGGAACGGA 
AAGGCCCGTT 
CCCACAAATC 
CCATGGCCGG 
GTCTGGGCTT 
TGAGCAAACT 
GGAGCGACTA 
GGTCGCTGAA 
TATGGGGCGT 
TGCTCGGCAA 
CGGAGAACTA 
CCTATGGTGC 
TCGACAGCCG 
AGGCATACAA 
ACCTGCTGAA 
ACCTGAAAGC 
ATCCGAATGT 
GTAGTGCATT 
GCACGACCAT 
TCGGATACAG 
CCGACCACAA 
TCAATGCCGA 
GCCTCTTTGC 
CCG AGATGAA 
AGAACATCAC 
GTCGCTTCAT 
CCGAACTCCA 
GTCATCCGCA 
CCTTCGGTGT 
ACATGCCCGG 



TTTTGCCAAG 
CACGGTCTAT 
CATCTCATCG 
GCAAAGTTCG 
TTTCAAGCCC 
CAATATCTCT 
CTCTTCCATC 
CAAGGACAAG 
ATCATTCAAC 
GGACAAGCAG 
GGAAGAAGCT 
TACTGTAGCA 
TCTGTATCAA 
TTACGGAGAA 
ACATGGCTGC 
TAACAATGCC 
CCTACTCCAG 
AAACATGACG 
CCCCGGATAT 
CGATGCTCTA 
GAACGAGTAT 
CGGAATCAAA 
CAGTGCTCCC 
AGGCAATCCT 
CAATGCACGC 
AGATCTGATC 
CAAGGCTCGT 
CAACAAGTTC 
GAAAAGCCAG 
CTTCGGTATC 
GGGACGCAGG 
GCAACTGCTT 
AGCAATGGTG 
GAACTTGAAC 
ACGTAACATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2124 



(2) INFORMATION FOR SEQ ID HO: 185 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1306 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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20 



25 
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(ii) MOLECULE TYPE: DIIA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv) AriTI-SEMSE: NO 

(vi) ORIGIMAL SOURCE: 

(A) ORGAH I SH: PORY PHYROMOI IAS GIMGIVALIS 

(ix) FEATURE: 

(A) MAJIE/KEY: misc_feature 

(B) LOCATIOM 1. . .138 6 

SEQUEMCE DESCRIPTION: SEQ ID IJO:185 

ATGAACAGGT TTTCAAATCA TTGGCCCTGC ATCCTCGTGG GGTTTGTACT CTGGTTTGTA 60 

TCGGCGAGTC GGACTGTGGC ACAAAACGCC TCCGAAACGA CGGTATCGTA CGATACGGAT 120 

ACCGCCGTAC TCTCCGAAGC CGATGTGCTT CGGATCGCTC TTAGTGAGAA TGCCACAGTG 180 

AAAGTGGCCG ATATGGATGT GCGCAAACAG GAATATGCAC GTAGGGCAGC ACGTGCCGAT 240 

CTCTTCCCGA AAGTAGACCT CAATGGCGTT TACAGCCATA CGCTAAAGAA GCAGGTCTTA 300 

TATATAGATA TGCCCGGTTT CAGCAGTAGC GAAGGTATCG AAATGGGGCG TACACACAAT 360 

ACGCAAGGAG GGGTGAACGT CTCCATGCCA TTGGTGTCGG CACAGCTTTG GAAAAGCATT 420 

GCCATGACCG GAGAACAGCT CGATCTGGCT CTGGAGAAAG CTCGCAGCTC CCGAATCGAT 4 80 

TTGGTGGCAG AGGTGAAGAA GGCTTACCTC AGTGTATTGT TGGCCGAGGA CTCTTATGGC 54 0 

GTATTCAAGC GCAGCTATGA CAATGCTCTG GCCAATTATA AG.AACATATC CGACAAGTTC 600 

GATCGTGGAC TTGTGGCCGA GTATGATAAG ATTCGAGCCA ATGTACAGGT ACGCAACATC 660 

GAGCCTAACC TCTTGCAAGC GCAGAACTCC GTAGCCCTTG CTCTCTGGCA GCTCAAGGTC 7 20 

CTGATGAGCA TGGAAGTGGA AACTCCGATC AGACTCTCCG GTTCATTGTC CGACTATAAA 7 80 

GAACA/i-GTCT ATACCGGCTA TTTTGCCGCC GATACGCTTA TTTCCAACAA CTCCTCCCTG 84 0 

CGTCAGCTCG ATATACAGCG TCGTCTGGCT GTCAGTGCAG ACAAGCTGAA CAAGTACAGC 900 

TTCCTGCCTA CACTCAATCT GGGAGGGCAG TACACCTATT CGCTCAACAG CAACGACATC 960 

AAATTCTGGG GCGAGGGACA ACGCTGGACG CCTTTCTCCA CCATATCGCT CAGCCTGTAC 102 0 

ATTCCTATAT TCAATGGAGG CAAACGTCTG TACAACGTGA AGCAAAGTGC TTTATCGATC 1080 

CGTCAGATCG ATCTGCAACG ACGCCACATA GAGCAATCCA TCCGAATGGG AATCAAGAAC 114 0 

CAAAATGACC GTCTGCGTAC CTGTATGCAG AGATTTGTGG CCTCGGAAGA GGCTGTCCGA 1200 

AGTGCAGAAA AGGGCTATCA GATAGCAGAG AAACGCTATC AGACAGGCGA AGGCACTCTC 12 60 

GTCGAGCTCA ACGATGCCGA TGTGGCTCTT TTGCAGGCTC GACTCAATTA TAATCAGGCC 1320 

ATATTCGACT TTATGACCGC A7\AGGCCGAA TTGGACAAGA TGAACGGCAT GGGGATTCCC 138 0 

GAACAA 13 86 



45 



50 



55 



(2) IliFORIIATIOIJ FOR SEQ ID NO: 186 

(i) SEQUEtlCE CHARACTERISTICS: 

(A) LEMGTH: 14 76 base pair^ 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) riOLECULE TYPE: DIJA (genomic) 

(iii) HYPOTHETICAL: UO 

(iv) AIITI-SEMSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOHAS GINGIVAL IS 



(ix) FEATURE: 
60 (A) NAME/KEY: misc^feature 

(B) LOCATION 1 ... 1476 

{y.i) SEQUENCE DESCRIPTION: SEQ ID NO: 1-86 

65 ATGTGGGGGG ACAGCCATGG AGTGGCGCCG AACCAAGTGC GCCGAACGCT GGTGAAGGTA 60 

GCCTTAAGTG AATCCCTTCC TCCGGGTGCA AAACAGATTC GTATCGGATT CTCTCTTCCG 120 

AAAGAAACGG AGGAAAAAGT CACCGCCCTA TATCTCCTTG TGAGTGATTC TTTAGCGGTG 180 

CGCGACTTGC CGGACTACAA AGGGCGAGTC TCTTACGATA GCTTCCCGAT CTCAAAGGAA 240 

GATCGTACCA CAGCCCTTTC TGCGGATTCG GTAGCCGGAC GCCGCTTCTT TTATTTGGCT 300 

70 GCGGATATAG GGCCTGTTGC TTCTTTTTCC CGATCCGATA CGCTGACTGC CCGTGTGGAA 360 

GAGGTGGCTG TCGATGGCCG CCCTTTGCCG TTGAAAGAGC TGTCGCCTGC CTCCCGTCGT 4 20 

CTGTATAGGG GGTATGAGGC CCTCTTTGTA CCCGGTGATG GCGGATCGCG GAACTATCGT 4 80 

ATCCCGGCCA TTTTGAAAAC GGCTAATGGA ACACTCATAG CGATGGCCGA CAGACGAAAA 54 0 

TATAATCAGA CGGATCTGCC GGAGGATATA GATATAGTCA TGCGGCGCAG TACGGACGGA 600 

75 GGGAAATCGT GGAGCGATCC CAGGATTATC GTACAGGGAG AGGGGCGCAA TCATGGCTTT 660 
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GGCGATGTAG CCCTGGTGCA 7VACCCAAGCA GGAAAGCTCC TGATGATCTT TGTCGGTGGA 7 20 

GTAGGCCTGT GGCAGTCTAC CCCCGATCGT CCTCAGCGCA CTTATATATC GGAAAGTCGG 7 80 

GACGAAGGAC TGACTTGGTC GCCTCCTCGG GATATAACCC ATTTCATCTT CGGCAAGGAT 84 0 

TGTGCCGATC CGGGACGCAG TCGCTGGTTG GCCTCCTTTT GTGCTTCGGG ACAAGGGCTT 900 

5 GTGCTGCCAT CCGGTCGTAT CACGTTTGTG GCTGCCATCC GCGAATCAGG GCAGGAGTAC 960 

GTCCTGAACA ACTATGTCCT CTATAGCGAC GATGAGGGCG ATACATGGCA GCTTTCCGAC 102 0 

TGTGCATACC GCCGTGGCGA TGAGGCAAAG CTTTCATTGA TGCCCGATGG CAGGGTACTG 1080 

ATGAGCATAC GCAATCAGGG ACGGCAGGAG AGCCGACAGC GTTTCTTCGC TCTCTCCTCC 114 0 

GACGATGGCC TTACTTGGGA GAGAGCCAAG CAGTTCGAGG GCATCCATGA CCCCGGCTGT 1200 

10 AATGGAGCTA TGCTTCAAGT GAAAAGGAAC GGAAGGGATC AAGTGCTGCA CTCCCTGCCT 12 60 

CTCGGCCCGG ATGGGCGTCG CGATGGAGCT GTCTATCTCT TCGATCATGT CTCCGGCCGC 132 0 

TGGTCCGCTC CCGTTGTTGT CAATTCAGGA TCGAGTGCCT ACTCGGATAT GACTCTGCTG 1380 

GCGGATGGAA CGATCGGTTA TTTCGTCGAA GAGGGCGATG AGATCTCATT GGTTTTCATT 14 4 0 

CGGTTCGTCC TTGACGATCT CTTCGATGTC CGGCAA 14 76 



15 



30 



(2) IliFORIIATIOJJ FOR SEQ ID NO: 187 



( i ) SEOL'EMCE CHARACTERISTICS : 
20 {A) LENGTH: 735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISIl: PORYPHYROMOHAS GINGIVALIS 



(ix) FEATURE: 
35 (A) NAI'lE/KEY: mi^c_f ea t ure 

(B) LOCATION 1 , . . 735 

{::i) SEQUENCE DESCRIPTION: SEQ ID NO: 187 

40 ATGAAAAAAG AAAAACTTTG GATTGCGATC GTCGCCGGTT TGGCTTTCGT ATTGGGCCTT 60 

TATGCTCTTG GCCGCAGTGT CGCTCAGCTA CGCCGCTCTC AGCCTTCGGT GACTGTGACC 120 

GGTATGGCCG AGCGTAATTT CAAATCCGAT CTGATCGTTT GGACTGCTTC GTACCAGCTC 180 

CAGATGATGG ATCTCGAATC GGCCTACAAG GCTTTGAj^GG A7\AAACAGAT ATTGGTAGCA 24 0 

GACTATTTGA A7\AACAAGCA GCTGCCCGAT TCGTCTTATA TCTTCTCAAG CGTAGCCATC 300 

45 TCTAAAGAAT ACAACTACTA TTACGATCCT CGGCAGGAAC AAAACGTCAG GACCTTTGCC 3 60 

GGGTATCTGC TCAGCCAGAC AGTTACGGTG ACCTCACAGG ACATCGAACA TGTGGAGAAA 4 20 

ATATCTCGCG ATATAACGGA GCTGATCAAT CAGGGGGTAG AGATTACCTC CGACCGTCCG 4 80 

GCCTATTACT ACACCAAGCT CAATGATCTG AAGGTGGAGA TGCTGCGCAA TGCCTCCGAA 54 0 

GACGCTTTCA ATCGTGCTTC GGTCATTGGG GAGGGGAGCG GTTCCTCCGT GGGTAAGATG 600 

50 CTATCTTCTT CGATGGGCGT GTTCCAGATA GTGGGGCTCA ACTCG7VACGA AGATTATAGC 6 60 

TGGGGAGGTT CGTTCAATAC GTCTTCCAAG ATGAAGACGG CAAGCATAAC GGTTAAGGCT 7 20 

TCTTTCGCTT TGAAG 7 35 



55 (2) INFORMATION FOR SEQ ID 110:188 

(i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS; double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

65 (iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 
70 (A) ORGANISM: PORY PHYROHONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAflE/KEY: mi£C_feature 

(B) LOCATION 1 ... 828 

75 
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(xi) SEQUENCE DESCRIPTIOH: SEQ ID MO: 188 

ATGAAAAAAA CAATTGCAAT TATCGCCTCA GCCCTCTTGG CTTTAGGAGC CGTCGGCTGT 60 

AAGAAAAATG CTGACACTAC CGCTGTCAGT GAAAAGGATA GCATAGCCTT GTCCATGGGT 120 

5 ATTTTGTACG GACAGGATTT TGCCAATCAG TTCGAAATGT CCCGCTTGCA AGGCCAGCCG 180 

ATTGATTCGG TAGCTTTCTT GGACGGTTTC AAATATGGTA TCGATACGAC GCGCTTCTCG 24 0 

TACAATCTGG GAGCCATCTA TGCTTCCAAT ATAGCTCGTC AGCTGGCTCA TGATTCCATC 300 

GATATCGACA AGTTCTATGC AGCCATGCGT GCGGCTCTTC TTAAAGACAC CGTATCTATC 360 

GCCATGAAGC CTGCAGATGC ACAGGCTTTC ATGCAACGAA TCCAAGCCAA AAAGCAGCGA '1 20 

10 GAAAACAATA TGAAGCAGTT TGGCCAGAAC ATCGAAAAGG GTAATGAATA CATCGATACC 4 80 

TTTAAAAAAG AAGATGGTGT AACTGTTACG ACAACTGGTC TGGCATACAA GACTCTTCAG 54 0 

GAAGGTACGG GAGCTACTCC CTCTTTGGCC GATACTGTAC GTGTCAAGTA TGTGGGTACT 600 

CTGGTCGATG GTAAAGAGTT CGACAAAAAC GAAGAAGGAA TCGAATTTGC CGTTACCGGT 660 

GTGATTAAAG GCTGGACGGA GATGCTCCAA CTCATGAAGG TCGGTCAGAA AGTTCGCGTG 7 20 

15 GTAATCCCAC AGGAGCTGGC TTATGGGGAG ACCGGCAACT ATACCATCGA ACCGTTCTCT 7 80 

ACCCTGACGT TCGAGATGGA ACTTATCGGG ATCAAGCCCG GGAAAAAG 828 



20 



30 



35 



( 2 ) I N FORMAT I OH FOR SEQ ID I ID : 1 8 5 



(i) SEQUEIJCE CHARACTERISTICS: 

(A) LEilGTH: 2325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDIIESS: double 
25 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AMTI-SEHSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOMAS GIMGIVALIS 



(ix) FEATURE: 

(A) NAI'IE/KEY: misc_featur- 

(B) LOCATION 1, . . 2 325 

40 (>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 189 

ATGAAAGTAT TACGGCAAGT ATTCCTCCCC ATCCTTTTTG TCCTACTGAC AGGTGCCTGC 60 

TCCACCACAA AGAATCTGCC GGAAGGCGAA CAGCTGTATA TCGGAATGGG CAAGACACAG 12 0 

ATACTCCGGC AGGACAAGAG CCACGCCGGC CAACAGGCTC TGACCGAAGT GGAGAGTACA 180 

45 CTGAAAGTTA CACCCAATGG AGCTATTTTC GGCAGTGCAA GTGCCTCCTT ACCC.^GATA 240 

CCATTCGGGC TATGGCTATA CAACAGCTTC GTGGGGG ATT CCACTGTCAT TTCGAAATGG 300 

ATATTCGACA AGTTTGCAGC CAAGCCGGTT TTCATCAGTC AGGTCAAATC CGATAGCCGG 360 

GCTAAGGTGG CGACGAACAT CCTCCGCGAA CACGGGTACT TCGATGCTAA AGTAAAAAGC 4 20 

AGTGTGACCA CTCTGAAAAA GGACTCGCTC AAAGCCAAAA TCTCCTATAC GGTGGATATG 4 80 

50 GCCTCTCCTT ATCATTACGA CAGCATCATT CCCTTACCGA TCAGCACTTT CCCCGACAGC 54 0 

ATTCTGGCTT ACAGGCAGAC TCCGTCTTTG ATCAGGAAAG GAGACCAGTT C7\ATTTGGCA 600 

ATi.GCTGCACG AAGAGCGTCA GACCATCAGT GCCCTGCTGA GAGACAATGG TT ACT ACT AC 660 

TTCCGCCCAC AGGATATTAT CTACGAAGCC GATACCCTCC TCGT7\AGAGG TGCCGTATGC 7 20 

CTGCGAGCCA AGCTCTCGGA AGATACTCCA CCCCAAGCCA TGCGCCCGTG GAGGATAGGG 780 

55 AAACGGACAG CAGTCCTGCT CGGAATGAAC GGAGAAAGCC CGACAGACTC GCTCGAAGTG 84 0 

GAGGATATGA AAGTCCTTTA CTATCGTAAA ATGCCGGTTC GCCCCAAGAT TTTGGCCAAA 900 

CGCTTTCGTT TCTTCTCCGG CAATCTGTAT CGGCAGAAAG ACGATGAGAC GACACGCAAA 960 

TCCTTGGCTC GTTTGGGAGC CTTCTCCGTT ATCGATCTCA ATTTTTTGCA ACGCGATTCC 1020 

ATTTCCGGCC TTTTGGATGT GCGACTGCTA ACCACCCTCG ACAAACCTTG GGATGCATCA 108 0 

60 TTAGAGACCT TGTTCACGAG CAAAAGCAAT GACTTCATCG GTCCCGGACT GAATTTTGCT 114 0 

CTTGCTCGGC GCAATGTATT CGGCGGAGGA GAAAATCTTT CTTGGAATAT CGGTGGATCG 1200 

TATGAGTGGG AGACCGGCAA TCGTCCCGAA AATAGCAGCA ATCGGCTGAT CGATATAAAT 12 60 

TCGTACAACA TGAATACGGC CGTGAACCTC TCGTTTCCCT CGATTGTATT TCCCGGTCTG 132 0 

CTGGATAAAT ACT ATT ACTA CCCCACGACT ACGACTTTTC AGGCTTCTGC CACCGCGCTG 138 0 

65 AACAGGGCAC ACTACTTTAG CATGTACTCT TTCGGCTTTT CGACCACCTA CGAATTTCAG 14 4 0 

CCCTCCAAGG 7VACACCGGCA TGCTATTTTC CCGCTCAAGC TCAACTACAA CCTCCTGGGG 1500 

CATCAGACAG AAACTTTCCA GGCCATTACG GCGAACAATC CGCCCCTGCT GCTCAGCCTT 1560 

CAGAGTCAGT TCCTTGCTCA AATGGGGTAT ATCTATACGT TCAACAAATC CGTTTCAGAG 162 0 

AAAAGTCCTC ATCATCTTTG GATGCAATTC GGACTATCCG AGGCAGGCAA TCTCCTGAAT 168 0 

70 CTGATCTATC TGGCAGCCGG CAAGAAGTAC AGCGACACCA AGAATTTCGT CGGCGTCCCC 17 4 0 

TTCTCTCAGT TCATCAAAGC CACGGGAGAA CTGCGCTATT CCTATACCAT AGACCGCAAT 18 00 

CAGTCACTGG CAACCCGTTT CGGGACAGGC GTGATATATA GCTATGGCAA TATGCGAGTG 1860 

GCACCCTATA GCGAGCAGTT CTATGTAGGC GGTGCCAATA GTATCAGAGC TTTCACCGTC 1920 

CGTAGCATCG GCCCCGGACG GTTCAATCCG GATTCCGACA ATCAGTATTC CTATTTGGAT 198 0 

75 CAGGTGGGCG AATTCAAACT CGAAGCCAAC GTGGAATATA GAGGCAAGCT TTTCGGGGAT 204 0 
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CTCCACGCAG CCGTTTTCCT CGATGCGGGC AACGTTTGGC TCTTGAGGGA GGATTCTTCC 2100 

CGTCCGGGCG GTGCTCTGTC CGAAGTGGGA TCGGTGAGCA ATTTCCTGAA TAGCATCGCT 2160 

CTCGGCACCG GTGTCGGCCT TCGCTACGAT CTGGCATTTC TCGTGGTTCG TGTCGATGTC 2220 

GGCTTCGGTC TCCACCTTCC TTACAATACG GGTAAGAAAG GTTACTACAA TATCCCACGC 22 80 

TTTAAGGATG CCATCGGTTT CCATTTGGCT GTCGGCTATC CCTTC 2325 



(2) IHFORIIATIOH FOR SEQ ID MO: 190 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LEtJGTH: 2322 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDHESS: double 

(D) TOPOLOGY: circular 

15 

(ii) MOLECULE TYPE: DHA (genomic) 
{iii} HYPOTHETICAL: NO 
20 (iv) ANTI-SEHSE: NO 



( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

25 iir.) FEATURE: 

(A) NAME/ KEY: miEc_f eature 

(B) LOCATION 1 . . . 2322 



ixi) SEQUENCE DESCRIPTION: SI 

30 

ATGTCCTCGC ATTCCGTTCG GTATCTAATC 
GCTTCCTCCT GCTCGGTCAC CCGTTATGTG 
ACGATCGCAA GCGAAACGGG CAGTATCGCT 
CAGCAACCCA ATTACAGACT GTTCGGGATG 

35 TCGAATCCGA ACAGCAACAG CTGGTGGAAC 

GTCCTCATCG ATTCTGTCCT CACCGATCGT 
GGCGATGGCT TTCTCGATGC TACTGCTCGT 
GCTCGCATTA CTTATCTGAT TCAGCCCGGA 
GATGTGAAGA ATCCACTCCT TCCTCCCGTT 

40 AAGGTCGGGA TCAGCGAGGG TTCTCCCTTG 
GCGATAGCTC GTCATATGCG CAACAACGGC 
TATGAAGCAG ATACTACCGT TTCAGGAGGA 
GTGGTCAATG GCATCGGGCG TTATCCATAT 
TATGATCCTC TCG7\ATCGGA CTTCAGAGTT 

45 CGTGGCGATT ACACTGTTTA CTATGGGAGT 

ACGCGGTCGG TGTCCGTTAC ACCGGGAGCT 
TATATCAAGC TGAATGCGCT CCCTATCGTT 
AATGGTAAGG ATGAGATTGC TCTGGCGGAT 
ACCGTTCCGG CCAAGAGCAA ATCGTTCGAA 

50 GACTTCGGGG CGGCTTTGTC TCTCGGTTTC 
ATGTTCAATA TCAAACTCAA GGGTGCTTAC 
ATGGAATATG GGGTGGAAAG CTCGCTCCGT 
GACGAAJ\CGC GCCGGCGGCT ACGGGCATCC 
ACACGTCCGG AGTTTGATCG GGTGATTCTC 

55 TACCTGCACA ATCGTCTGCG TCATACGATC 
CCGTACATCG ATCCCGACTT CGCCCAATCC 
ACGGAGCAGT TTATCCTCGG CTCGGCATAT 
GAGCGTACCG TATCCAATCC TTTTACGGCA 
CTGCAAGCCA TTTCTTATCT GACCGATTCT 

60 TTCGGTCTGC ACTATGCTCA GTTCGTCAAG 
CTCGAAAAGG ACAATACTTT GGCACTGCAT 
AATGCTCGCC ATATACCCTT TGAGTTACGT 
GGCTGGAGTG TCCGTACCCT CGGCCCGGGG 
TTCGATCAGA TGGGTGATAT TCGTCTGGAT 

65 TGGAAGTTTC GCGCAGCAGC TTTTGTCGAT 
GAGAATCAGG AGGACGGTCT CTTTCGTTTC 
TACGGTCTGG GGCTTCGTCT CGACTTCGAT 
AAAGCCTACG ATCCTCAGCA GACAGGGCGT 
TCTTCCAATT TCGCTTGGCA CATTGCAGTA 

70 



;Q ID NO: 190 

GGCATTGCCG GCTGCTTGCT CCTCATGCTT 60 

CCGGACGGTA GCAGACTATT AGACAGGGTA 120 

CTGCCGGAAG ATATTCGGGA CTATACCCTC 180 

ACTCGCTGGC TACTGCGCGT CTATAGCAGC 24 0 

CGTTCGCTCC GGAAAATGGG CGAACCGCCT 300 

ACTGCCAACC GTCTGGCAAA GGCGATGGCC 360 

GCCGTGGTAG ACACCGGCTT GTACAAGAAA 4 20 

AGCCGTTATT ATATACGCAA TATGGCTTTG 4 80 

GCGCTTGGCA ATTCGCTTCC TTCGGCATAC 54 0 

TCGCCCATCG TACTCGATGA AGAGAGAAAG 600 

TTCTGGAAGT TCTCCGCCGA GGATGTTTAT 660 

TCGGGTACGA AATCTGCCGA TCTGAAATTA 7 20 

CGGATCGGCA GGGTATTCTT TCATGCCGAT 7 80 

CAGGAGCTGC CACGTATCGA TTCGATTTCG 8 'JO 

AGGGGACGTT ATATCCGGGC ATCGGCTCTC 900 

TTTTTCTGCG AGGATGATGT GGAACGCTCT 960 

CGGAACGTGA ATATCCGATT TGTGGAGCAC 1020 

AGCTCTCGCC TTGTGGACTG CTATATTCTT 1080 

GCCGAAGTCC TCGGCACCAA TTCCGCTGGA 114 0 

ACCGATCGCA ATTTGTTTCG TGGGGCGGAG 1200 

GAAGCCATTC GCAAGGGTTC GCACAGCTTC 12 60 

TTCCCTCGTC TCCTCTTCCC ATTCATTTCT 132 0 

ACGGAATGGA AGATCGGGTA TAATTACCAG 1380 

TCCGCTCAAC TCAATTATTC ATGGCAGACC 14 40 

CGCCTGCTGG ATGTCGATTA TCTCCATCTC 1500 

CTTCCGCCTA CGACTGCACT GTATAACTAC 1560 

ATACTGAACT ATACCACGGC TTCGTCCATG 1620 

CGGTTCAGTA TCCAGACAGC CGGCAACCTG 1680 

CCGAAAGACG AACACGGGTT GTATAAAATG 17 4 0 

CTCGATCTCG ATCTGGCTAA AACCGTTCTT 18 00 

CTGGGTTTCG GACTGGCTTT CCCTTATGGC 18 60 

TACTTTGCCG GAGGATCGAA CAGCGTTCGC 1920 

AGTATGAAGA TGACTCCGGA CAAGACCTTC 1980 

CTGAATGTCG AATACAGGAC AAAGCTGTTC 204 0 

GCCGGCAATG TCTGGACGAT AAAGGAGTAT 2100 

GATCGCTTCT ACAAGGAAAT AGCTTTGGCC 2160 

TATTTCCTTG TGCGGCTGGA TGCCGGACTG 2220 

TACAAATGGG CTATCACACG CCCAAACCTT 2280 

GGCTATCCGT TC 2 322 



(2) INFORMATION FOR SEQ ID NO: 191 

(i) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 2601 base pairs 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



(B) TVPE: nucleic acid 

(C) STRAIIDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AliTI-SEIISE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGAMISM: PORY PHYROHOI IAS GiriGIVALIS 

(ixj FEATURE: 

(A) HAllE/KEY: misc_f eatuire 

(B) LOCATION 1...2 601 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 

ATGAGAAAAA GAATTCTACA ACTTTTCCTG ACCGCATTGC TGCTGGCATT AGGCTCCTCT 6 0 

CTCGCCATAG CGCAAACAGT GGTGACCGGT AAGGTGATCG ATTCAGAAAC GTCCGAACCG 120 

CTCATCGGTG TATCCGTAAG CACCGGTCAG GGAGCATCCC TCCGCGGTGT A^\CCACCGAT 180 

ATGGATGGTG GCTTCCGATT CGAAGTACCG GCCAAATCTG TCTTGACTTT CCGTTGCGTA 24 0 

GGTTATGCTA CCGTAACTCG CTCTATAGGC AGAGGTTCTC AAGAAGACCT CGGTACGATT 300 

CTCCTCGATC CCCAGGCCAT CGGCTTGGAT GAGATTCAGG T.hATAGCCTC TGTGGTGCCC 360 

AAAGACCGTA TGACGCCGGT ACCCGTTTCC AATATCCGTG TGGCTGATAT TCAGGCAGCA ^12 0 

TCGTTGAATG TCGAATTTCC CGAACTGGTT AAATCCACTC CCTCTACCTA TACGACAAAP. 4 80 

GGAAGCGGAG GTTTCGGTGA TGGTCGTACC AATGTGCGTG GATTCGACAC TTACAACTTC 54 0 

GGTGTACTCA TCAACGGAGT TCCTGTCAAT GGTATGGAAG ACGGGAAAGT ATATTGGAGC 600 

AATTGGAGTG GTCTGATGAA TCT^AGCCAGT ACCATTCAGA TTCAGCGCGG ACTCGGAGCC 660 

TCCAAGCTCG GTATCAGCTC GGTAGGTGGT ACGATGAACA TTATCACGAA GACTACGGAC 7 20 

GCCAACACCG GAGGTTCGGC TTATGTCGGT ATGGGTAATG ATGGATTGCA CAAAGAATCG 78 0 

TTCTCCATTT CTACGGGTAT GAACGACGGT TGGGCTATCA CCATTGCAGG CTCCCATATG 84 0 

ACGGGTCTGG GTTATGTGAA GGGGCTGAAG GGACGTGCAT TCTCTTACTT CTTCAACGTT 900 

TCGAAGAAGT TCAATGAACG TCATACCCTC TCTCTTACCG GATTCGGTGC ACCACAATGG 960 

CACAACCAAC GTTCTTCCAA ATATTCTGTA GCCGACTATG ACAAATACGG CATCCGTCAC 102 0 

AATCAATCCT TCGGCTATCT GCGAGGCGAA CTGACTCCTA CGGCTTATGC TTACAATACG 1080 

TACCACAAGC CCCAGTTCTC GCTGAACCAC TTCTGGAAGA TGGATGAAAA TACCTCTCTT 1140 

TATACcgCAIJ CCTACGCATC TTTGGCTACC GGTGGAGGTC GTCGCGCTTA TGGAAAGAAC 1200 

AGTAAGTGGG TATTGATCAA CTACAACACC GGACAACCCT ATGAACAAAC AAAGGTGACT 1260 

CCCGATGGAC TTATCGACTA CGATGCCGTA CTGGCTGCCA ATGCTGCGGC GAGCAATGGC 1320 

TCGGAAGCAA TTTTTGCCCT TGGCTCCAAC TCTCACAAGT GGTTCGGTCT ACTCTCTTCA 1380 

TTCAAGAAGA AACTTAATAG TTCGCTGACT TTGACAGCCG GATACGATGG GCGTTACTAC 14 4 0 

CGTGGCGACC ACTATGACAA GATCACCGAT CTGCTCGGCG GTAGCTACTA CATAGAGGAT 1500 

CCCAAGACAA AGCTCGCATA CCATGCGGAA GGTCAGCAAC TGAAAGTGGG TGACATTGTA 1560 

AATCGGGACT ACACAGGCGA AATCATGTGG CACGGCCTCT TCGCACAGAT GGAGCATTCG 1620 

TCCGAATGGA TCGATGCATT CGTATCAGGA TCTATCAACT ACGAACTATA CCGCAATCAC 1680 

AACTATGGCG GTAGCAAGTC CACCGGCTAC CTGCCCGGCG TATCGCCGTG GAAAAGCTTC 17 4 0 

CTTCCGTGGA GTGGCAAGGC AGGTCTGAGC TACAAGTTCG CACAGGGACA CAATGTATTC 1800 

GCCAATGGCG GTTTCTTCAC ACGTGCACCA CTCTTTGGCA ATATCTATGC TGCGGGGGCT 18 60 

ATCATTCCCA ATGACAAAGC CAATATGGAA AAGGTGCTTA CAGGAGAGGT CGGCTATGGA 1920 

TTCACGAATC ACAAAAACTT CGAGTTCAAT ATCAACGGAT ACTATACGAA GTGGATGGAT 198 0 

CGCGTGACCT CGAAGAGAAT CGGAAACGAG TATGTTTATC TCAATGGCGT TGATGCTGTT 2 04 0 

CACTGTGGGG TAGAGGCTGA GGTCAGCTAT CGTCCTATTC GTCAGATCGA CCTTCGCGGT 2100 

ATGTTCTCTC TCGGTGACTG GACTTGGCAA AACAATGTAA GTTACACTTC TTACGACGAA 2160 

GCCGGCAATG AGACAGGGCA GGATATAACC TATATCAAGG GTCTTCACGT CGGAGATGCA 2 220 

GCACAGATGA CGGCTGCTGT ATCGGCAGAC ATAGAGCTGT TCAAGGGTTT CCATGTCATA 2280 

GGTAAGTACA ACTTCCTTGG CAAGAACTAT GCAGGATTCA ACCCCGCAAC GCGTAATGCA 234 0 

CAGCAGTACG AAGCGGATGG CAAAGAAATC GT&GAATCAT GGAAGTTGCC CGATGTAGGT 2 4 00 

CTGTTCGATC TGTCTGCATC CTACAATTTC AAGCTTGGTT CACTCAGCAC CACATTCTAT 24 60 

TTCAACATGG ACAACGTAGC CGACAAGCGA TATGTGAGCG ATGCCGACGA CAATATCATC 252 0 

GGTAAGAAAC ACGATGAGGC TTCGGCTCTC GTATGGTACG GTTTCGGCCG CACTTGGTCT 258 0 

ACCGGTATTC GTGTAAACTT C 2 601 



(2) IHFORllATION FOR SEQ ID MO: 192 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



75 



(ii) nOLECULE TYPE: DIIA (genomic) 
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(iii) HYPOTHETICAL: MO 

(iv) AIITI-SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOHAS GINGIVALIS 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 
10 (B) LOCATION 1...1293 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 

ATGAAGTTTT CAATCCGCCT TTTCCTCTGC ATCATCTTTC TCCTCTCTGC ATTTATCCTG 60 

15 CCTGCTCTCG GACAAAAATC CAAGCAGGTA CAGCGACTTG AGAAGCAACG TAAGGAGGCC 120 

CTCAAAGCCA TCGAAAAAAC CGATCGCGAA CTACGAAATA CCAAGAAAGA CAAGCAAGAC 180 

AAACAAAAGC ATCTCAACCT CCTGAACAAG CAGGTTGCTC AACGCAAGCA GATGGTACAA 24 0 

CTCTTGGACA ATGAGGTCAA AGAGTTGCAA TCCGACATTG ATTCCATGAC GGGTGTATGT 300 

CATCAGCTCT CTGTAGAAGA GAAAGCCCGA TCCGATGAAT ATGCCCAAGC TCTACAGTCT 360 

20 ATGC/VAAAGC GGAAACGCTC GTTGGATCGC ATCCTTTTCA TTTCATCGGC CAAGAGCTTT 420 

GACGAAGGCA TGCGACGGAT GCGTTTCTTG GAACAATACG CTTCTGCATA CAAGCTGGCA ABO 

TCTGTCCGGC TGCGCGATAC ACGTAGCAAG TTGGAGACTG AACGTGCGAC TGTAGMGAC 54 0 

GCCAAAAAGG AGAAAGGACA TCTCTTAGTC ATCAGAGAAG AGGAAAAAAA GA7y\CTCGAA 600 

GGACAGCAAG CCGAGCAACG TCGGCAGGTG CAGGCTTTGG GAGCCAAACA AJVAAGACTTG 660 

25 GAAGCGCAGC TGCGAAAGCA GAAAAAGCAA GCCGAAGCTC TGAACAGAAA GATCGAGAAA 7 20 

CAGATTGCCA AGGAAATAGA AGCTGCCGAA CGTCGTGCTC GAGAAGAACG TGAACGGTTG 7 80 

GCACGCGAAG CCAAAGCCAA GGGTAAGCCG GTTCCTGCCG AACCGGAACG GAAGGCGGAG 84 0 

ACCAAAGGCG GCTATGCTAT GGATGCCTCT GAGCGTGCTC TCTCGGGCAG CTTTGCACAG 50 0 

AACAAAGGTC GCCTGCCCGG CCCCGTTCGC GGCAGATACC GAATCGTAAG CGACTTTGGC 960 

30 GTGCATCAGC ACAGTGAGCT GAAAAAAGTA CAAGTTAATA ATGGAGGTAT CGACATCGCT 102 0 

GTAGCAACAG GATCCGATGC TACCAGCGTA TTCGATGGTG TAGTGTCCAG TGTATTCGTG 1080 

ATACCCGGTT ATAATTCGGC CGTAATGGTT CGTCACGGTA ACTATATCAC GGTTTATGCG 114 0 

AATCTGAGCA AAGTGTATGT AAATTCCGGC ACTCGTGTTA AAACGGGTCA GGCTCTTGGT 12 00 

CGTGCCTATA CGGATCCTTC CAACAACCAG ACCATTATTC ACTTCGAAAT CTGGAAAGAA 12 60 

35 CGCAGCAAAC AAAACCCAAG ACTATGGTTA CGA 12 93 

(2) INFORMATION FOR SEQ ID NO: 193 

40 (i) SEQUEtlCE CHARACTERISTICS: 

{A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

45 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL; NO 

50 (iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

55 (ix) FEATURE: 

(A) NAI^IE/KEY: misc_f eature 

(B) LOCATION 1 ... 999 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 193 



ATGAAAAAGT ATTTGTTATA TGCCTCGTTG CTAACGAGTG TTTTGCTCTT TTCCTGTTCA 60 

AAGAACAATC CTAACGAGCC GGTGGAAGAC AGATCCATCG AAATTTCTAT AAGGGTAGAT 120 

GATTTCACCA AAACGGGTGA GGCAGTACGC TATGAAAGGA ATCAAGGAAG TGCTGCCGAA 180 

AGGCTCATTA CCAATCTTTA CCTCTTGTTG TTCGATCAGT CAGGGGCGAA TCCGGCGAAA 24 0 

65 TACTATATTA CCGGTAACAC TTTCACCGGA GGGACCTGGC TTCCTGACGA TATGAAGGTG 300 

AAGTTGGATA TGACACAATC CGAGGCCGGA GAGCGCAAAG TATATGTCGT AGCCAATGTT 360 

GATAATGCGG TTAAAACGGC TCTTGATGCT GTCGCTAACG AAAGCGATTT GCAGACTGTA 4 20 

AAGAGGACGA CTGCAATGCC GTGGTCGACC GATATAGCCT CTCCTTTCCT GATGTCCGGA 4 80 

AACAAGACAC ACGACTTCTT GGCCAATCGT CTTTTGGACA ATGTGCCCCT TGTGCGTGCC 54 0 

70 ATTGCCAAGG TGGAGCTGAA TATCTCGCTG AGTGAGAAAT TTCAGATTGT GCCGATAATT 600 

GTCAATGGTA GTTTGAGTGA GTTCAAGTTC AGATACGTAA ACTTCGACAA GGAGACCTAC 660 

GTAGTGAAGC CAACGACCAA GCCGGACAAT CTCATTAGTT CTGCTAATGG TGTTTGGCCT 720 

CAGATTACAG ATTGGACTGT ATGGGGTGCT TCCTTAAATA CTTCTCCTGC TCCGGATGCG 780 

GGCACAGGTT ATACATTGGA TGCAAATGGC AAGGTAACGG CACTACGGAT TGTTACCTAT 84 0 

75 CTGAATGAGC GCGATAGCAA AGGGGCTACG GTAGAGGTCG CATTGCCTCG TGTGGATGAT 900 
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GGCACCCTTC CTCCTCCGGA ATTCGGTCCG GAGCTTTATC GTTTGCCTTT GCCGGACAAG 960 
ATCCTGCGCA ATCATTGGTA CAAGTATGAA GTCGAGATT 99 9 

5 (2) I M FORMATION FOR SEQ ID MO: 1 9*5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA {genomic) 

15 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SEl^SE: NO 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: PORYPHVROMONAS GINGIVAL IS 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 

(E) LOCATION 1 ... 945 



25 



45 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 



ATGATCAG/\A CGATACTTTC ACGATATGTA TCCTCGAACT TTTGGAGTCG GGGAGCTACC 60 

TTTTTTTTCA CGATTTTCCC GGCCTTCATC CTCGCCGCTA CTGCTTTGCC GGCTTGTGGA 12 0 

30 GGGGGTACTG CTTCAGGCTC CGATCGTACG CTGGCTGTGA CCATCGAGCC ACAGAAATAC 160 

TTCATCGAGT CCATTGCGGA TAAGTCGGTG CAGGTGGTGG CATTGGTACC GGCCGGCAGC 24 0 

AATCCGGAGG AATACGACCC TTCGCCTACC GTGATGAAGC GTTTGTCCGA AGCAGATGCC 300 

TACTTCTATA TAGGAGGACT GGGGTTCGAG CAAAGAAATC TCGCTGCCAT TCGGGACAAT 360 

AACCCTAAGC TCCCTCTTTT CG7WVTGGGC AAAGCCTTGG CGGA.TGCCGG AAGTGCAGAT 4 20 

35 CTCCACGGCT CCTGCACAGA TCATTCTCAT ACAGACCTGC ATGCCCATGA TCCGCACTAT 4 80 

TGGAGCAGTG TGGTAGGGGC AAAGGCACTC AGTCGTGCTG CATACGACGC GCTTGTGGAG 54 0 

CTTTATCCGA ACGAGAAAGA CA7\ATGGGAC AAAGGGCACG ACCGTCTCAA CGGACGTATC 600 

GACAGCGTGA AGAGACTCGT CGATACCATG TTTGCCTUXTG GCAAAGCAGA CAAAGCCTTC 660 

GTCATATATC ACCCATCGCT CAGCTTTTTC GCCCAAGAGT TCGGCCTGCG GCAGATCGTC 72 0 

40 ATAGAGGAAG ATGGGAAAGA GCCTACGGCT GCCCACCTTC GTCGTGTGAT CGATCAGGCA 7 80 

CGTGCCGATG GTGTCAGAAT CGTATTTATC CAACCCGAAT TTGAAACGCG TCAGGCGGAG 84 0 

GACATCGCAC GCGAGATCGG TGCTCGTCCG GTAAGGATCA ATCCTCTGCG CAGCTCGTGG 900 

GAGGAGGAAA TTTTACATAT TGCTCGCGCT TTGGCTCATG AACGG 94 5 



(2) INFORMATION FOR SEQ ID MO: 195 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2544 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DHA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



60 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: FORYPHYROMONAS GINGIVAL IS 

(i;-;) FEATURE: 

(A) NAME/ KEY: rnisc_f eature 
65 (B) LOCATION 1...254 4 

ixi) SEQUENCE DESCRI PTIOIi : SEQ ID HO: 195 

ATGATCGGAA AAAAAATCTT TTTTATCCTG CTGGCGCTCA TTGCGTTCAG TGGGCTGAAC 60 

70 GCAGCGACAG ACACTGAGTT CAAGTACCCG ACCGATGCCA ATATCATCGG TCACGTCAAA 12 0 

GACAGCAAGA CGGGTGAACA CCTTGTCGGT ATCACTATTG CTATCAAAGG CACTACCTTT 180 

GGTACATCTA CAGATGCAAC CGGGCACTAC TATCTTCGTA ACTTGCGTCC GGGTGAGATC 24 0 

ACTTTGATTA TGCGTGGCAT GGGCTATAAG AGCCAGGAGC GCGTAGTCCG CGTAGAAAAG 300 

GACAAGACTA TCGAGGTGAA TTTCGAAGCA GAAGAGGATG CCATCAATCT GGACGAAGTC 360 

75 GTGATTTCGG CCAACCGCGA ACTGACGCTT CGCCGTCTTG CTCCTACTCT GGTAAATGTA 4 20 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



132/490 

TTGAACGAAA AAGTCTTCTC GC/V^.GTCAAT GCTTCTAACC TGGCTCAAGG CTTGTCATTC 4 80 

CAGCCGGGAG TTCGTGTAGA GAACAACTGT CAGAACTGTG GTTTCAATCA AGTTCGTATC 54 0 

AATGGACTGG ATGGTCGTTA TGCACAGATC CTCATCGACA GCCGTCCCAT CATGAGTGCC 600 

CTTGCCGGTG TTTACGGTCT GGAGCAGATC CCTGCCAATA TGATCGAACG TGTGGAGGTA 660 

O GTACGTGGTG GAGGATCGGC CTTGTACGGT TCTTCTGCTA TTGCCGGAGT GGTGAATATC 7 20 

ATCACCAAGG AACCTTCTCA CAATTCTTTC ACATTCAATG AATCTCTGAG CTTTACCGGT 7 80 

TTCAGCAAGC TGGATAACAA CACGAACTTC AATGCCTCCA TCGTCAGCGA TGACAACCGT 8AQ 

GCCGGTGCCA TGGTATTCGG GCAGGCTCGT TACCGCAACC ATTGGGATGC TAACAATGAC 900 

GGTTATTCCG AATTGGGTAA AATAGATGCC CGCTCGCTGG GAGCGCATTC TTATTTGCGC 960 

lU TTGAGCGACT ACAGCAAATT GACGGGAGAG TTTCACACGA TCAGTGAATT CCGCCGTGGT 102 0 

GGCGATCGTA TCGATTTGCC TCCTCACGTA GTGGGTGTAG CTGAACAAAC TGACCATAGC 108 0 

GTATTTAGCG GAAACTTGAA ATACGATCTC TTCTCTTCCA ACTATAAACA CCACTTCCAG 114 0 

GCTTATACTT CCGGACAGAT CGTAAATCGC AAGAGCTATT ACGGAGGTAT CGGAGAGATT 1200 

GACGTCAATG GCCACCCCGG TGGTACGGAA GGCTACCCTA TCCCTCAAGA TCAATACGGC 1260 

iO AATAATTATG GCGTGACCAA AGGCAAGACA TATATGGGCG GTATCCAGTA CAGCTACGAC 132 0 

TTGGACAAAT TCCTCCTCAT GCCTTCGCAA CTTTTGTTCG GAGCCGAATA TACGCGTGAT 138n 

GAACTCAATG ACGTGATGCC CATCCTTTCA TGGCAGACCG GCGAGGATGC CAATGGGAAT 14 4^ 

ACCATTCCCC TCTATCCCGA ATTGGATCAG AATATCAACA ACTACAGCCT ATTCGGTCAG 1500 

AACGAATGGA AA.AATGACAG ATGGAGCATC CTTGTTGGCG CTCGCTTGGA CAAGCATAGC 1560 

ZU GAAGTCAAGG ATATGATTCT GAGTCCTCGT ACCACACTGC GTTTCAACGT GAATCCGGAC 16-^0 

ATCAACCTGC GCGCTACATA TGCAAAAGGG TTCCGCGCAC CGCAGGTATT CGATGAAGAC 1680 

TTGCACGTAG GGGTTGTAGG CGGTGAGGCA CAGAAAGTAT TCAACGATCC GAACCTCAAG 17 4 0 

CCTGAAATTT CTCATGCATT CAGTTTGAGT GCCGATATGT ATCATCGTTT CGGTAACGTC 18 00 

CAGACCAACT TCCTTGTGGA AGGCTTCTAT ACTCGTTTGC TGGATGTATT CACCAACGAG I8 60 

GAGCAGCCTG ATCAGCACGA TGGCATCAAA CGCTACACGC GTATCAACGG TAGCGGAGCC 1C)90 

AAAGTATTCG GTCTCAATCT GGAAGGTAAG GTCGCATACA AGTCCTTCCA GCTCCAAGCC 198n 

GGTCTTACCC TGGCCAGCAA CAAATACGAC GAAGCACAGG AGTGGGGTCT GAATACGGTG 2 04 0 

AAAGACACCA ACGGAGCTTT TGTTACCGAG GCCAATGCAA ATGGACAACA GGAATACAAG 2100 

MCGAATCCA TGACGGATAC GCAGATCACC CGTACCCCCA GCGTATACGG TTATTTTACT 2160 

6U TTGGCCTACA ATCCTGCTCA CTCATGGAAC ATAGCCCTTA CGGGAGCATA TACCGGTCAG 222 0 

ATGTATGTAC CCCACGCTAT CGAATATGGT GTGAAGTCTG CCGAACTGGA TATTATGCAG 22 8 0 

AACAATCCTG AGATTACCGA CGAAACCGGA AAGGCTCCCC GTATTGATGA GCTGAAGAAG 234 0 

ACACCTGCAT TCTTCGATTT GGGCTTGAAA GTGGGTTATG ACTTCCACGT ATTCCAGGCT 2 4 00 

ACTGAGGTTC AACTCTATGT AGGTATGAAC AATATCTTCA ACTCTTTCCA GAAGGACTTC 24 60 

JO GATCGTGGAG CTGCACGTGA CAGCGGATAT ATCTATGGTC CTACGCAGCC GCGTACAGGC 25-? 0 

TACATGGGCT TGGTAGTGAA GTTC 254 4 



25 



40 



45 



50 



55 



60 



75 



(2) INFORMATION FOR SEQ ID MO: 196 

{i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

{ i V ) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GTNGIVALIS 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 

(B) LOCATION 1 . . . 606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 



(2) INFORMATION FOR SEQ ID NO: 197 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



ATGACAGTAA AGCGCGCAGT GCGAATAGCA CTTCTCACGC TGATAGGCAT TCTTTTTT^C 60 

TCACCTTCTC TTGTTCGGGC GCAAAGTCTT TTCAGCACCG AACATGTCTT GCAACTATAC 120 

AACAAGATAC TCTATGGAGA GTCGGCGGCG GATACCGTCG CAGAGAAAAC GGCAGGTGAG 180 

t3^> TCGGCATTTC CTTTTATAGA CAAACTCATC AATCTCGGCC GCACTTTCCT CGGCAAACCA 24 0 

TATCGCTATC GCGGTCCTTC CCCATGGCCG ATGGACTGCT CGGGCTATGT GTCTTACCTC 300 

TACTCCAAAT TCGACATCAA ACTCCCACGT GGTGCGGCAG CACAGAGCCA ATATACGAAT 360 

CCTATCGAGC GCGAGGATGT TCGTCCGGGC GACCTCCTTT TTTTCAAAGG CCGCAATGCA 4 20 

CGCAGCAACe GTATCGGGCA TGTAGCTTTG GTCGTATCTG TCGATG7iJ\GA TGATATTACC 4 80 

/U ATGATGCACA GCCGCAATTC GCGAGGGATC GTGATCGAAA AACTCAATCG CAGTGCATAC 54 0 

TTCTCCCGTC GCTTGGTGAG CTATGGCAGG GTACCCGGAG CCAAGAGAGT GATCCCACGA 600 
AAAAGT 



BNSDOCID: <WO 9929870A1_ 



wo 99/29870 



133/490 



PCT/AU98/01023 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRAIIDEDNESS : double 

(D) TOPOLOGY: circular 

{ii) MOLECULE TYPE: DHA (genomic) 

10 (iii) HYPOTHETICAL: NO 

(iv) AUTI-SEHSE: MO 

(vi) ORIGIUAL SOURCE: 
15 (A) ORGANISM: PORY PHYROMOHAS GINGIVAL IS 

(ix) FEATURE: 

(A) NAIIE/KEY: mi sc_f eature 

(B) LOCATION 1...1365 



20 



70 



(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 197 



ATGAAACGGA CAATCCTCCT GACGGCACTG ACCGTCCTAT CTTCGCTCTC CTTGCTTCGT 60 

GCACAAAATG AATCCGAAGC ATCAACCAAT CCGATGTCAG GCCTCTCCCT GGAAGACTGT 120 

25 ATCCGGATAG CCAAGGAGCG CAACCTGAAT CTGCGCAGAC AGGAGATCGA ACAAGAAAAC 180 

CGAATCATTA GTCTCGATGC AGCACGACAC AGTTTCCTGC CCTCGGTCAA TGCAGGCATC 24 0 

GGACACAACT ATAGCTTCGG ACGTTCGAAA GACAAAACGG GAGTAACCGT AGATCGCTCC 300 

TCGATGAATA CCAATCTCAG CATCGGAGCT TCGGTGGAAG TATTCAGCGG CACACGTCGT 360 

CTGCACGACC TCAAGCAGCA AAAGTACAAC GTGGAGGATG GTATAGCCCG ACTTCAAAAA 4 20 

30 GCGCGTGAAG ACCTCAGCCT GCAAATCGCG GCTCTCTATA TCAATTTGCT CTTCCGTCAG 4 80 

GAAATGACTC GTACGGCAGA AACACAGTTG GCACTGATTC GCGAGCAACG CAATCGCACG S40 

GCCG7\AATGG TTCGCGTAGG TA/^ATGGGCA GAGGGTAAGC TCCTCGACAT AAATGCCCAG 600 

ATGGCCAAGG ACGAACAACT TCTCGTACAA TATCGTTCGG AGGAGGAGCT GGCTCGTCTG 660 

GACTTGGGGC AAGCCCTCGA ACTGGAGCAC CCCGAAAGCA TTGCAGTCAA GGCTCCCGAC 720 

35 ACAGACGTTC TCGTAGCAGA AAGGTTGGGA TCTCTCCTTG CTCCCGAAGA GATCTATCGC 7 80 

ACGGCTCTCG GCTTGAAACC GGCACTGCAT TCGAGCGAGC TGCAAATAGC TTCGGCACGC 84 0 

GAAGGTCTGG CCTCGGCTCG TGCGGCATAC TTCCCGACGC TCAGCCTCTC TGCCGGATAC 900 

AGCAACGGTT ACTTCCGCGA CCTCGGCAAG GAGTATGCCG CCATCAACCC CTCCTTCTCC 960 

GAACAGTGGA AGAACAACGG CAGCTACAGT ATCGGACTCT CTTTGAATAT CCCCATCTTC 1020 

40 TCTGCCATGC AAACGCAAGA TCGCGTTCGG AGCAGTCGCC TGCAAATACG CTCAAGCGAG 1080 

CTTCGACTCG TCGAAGAGAA AAAAGCCCTC TATAAAGAGA TCAGGCAAGC ATACAGCAAT 114 0 

GCCGTGGCAG CCGATAAGGC CATCGCAGCA GCCGAAAACA GCAAGGCCGC TACGCTCAAG 1200 

GCATACGAAT ACGCTCGCGA CAGCTTCGAG GCAGGGCGCT TGTCTGCCTA CG/^TATGCC 12 60 

GAGGCAAAAA CAAAATACGC CCTCAGCCAA GTGGAAGAAC TTCGTGCCAA GTATGACTTC 1320 

45 ATATACAAAG CCAAAGTTTT GGATTTCTAT CAGGGCAAAG ACTTC 1365 

(2) IMFORMATION FOR SEQ ID NO: 198 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1332 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 

(D) TOPOLOGY: circular 

55 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: MO 

60 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GINGIVALIS 

65 (ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1 . . . 1332 



(xi) SEQUENCE DESCRI PTI 01 ) : SEQ ID HO: 198 



ATGCGTTTCC AACATTATCT CATCTGTACG GCTGCCGTAG CGGCTTTGGC TGCGAATCCC 60 

CTTACGGGCC AATCGAATAT GACCCTCG7\A GAGTGCATAG ACTATGCACG CCGGCACAGT 120 

TCGGCCGTGG CGCTGTCCGC TGCGGAACTG GAGCAGTCCA AGGCCGATTA CCTTCAGGCC 180 

GTCGGCAATT TTCTGCCCCG TGTATCGGCC GGAACCGGTG CTTCGTGGAA TTTCGGACGC 24 0 

75 GGATTGGATG CCGAGACGAA TACCTACACC GACATCAACA GCTTCAACAA TTCGTACAGC 300 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



134/490 



ATACATGCCA CGATGACCCT TTTCGACGGT 
CATGCACGCC GGGAGGCTTC GCGCCTCTCC 
GGCACCACGG AGGCCTACTA CGACCTCGTC 
CAGAAGTACG AGGAGAGCAG CCGCCTCCAC 
5 ATGAAGAGTC GTCCCGATGT CCTCGAGATG 
CTGACTCAAG CGGACAATCA GTGCATCATC 
TTCCCCATCG ATGACGAACT CGTCGTAGAC 
ATGGCCGAAT CGGACAGCTC GGCCGGCGTC 
CTCCTCCGTG CCAAACTCGA CGAGCAGGCT 

10 GCATTCCTGC CGAGTGTGTC GGTATCCGGA 

AATGGATCGG ACTATACGCC CTTCAGCGAG 
AGTCTGAATC TGAGTATCCC CATCTTTTCG 
GCGCGTGCCG AACGCAGGGC GGCAATCGTC 
AGCGAGATCG CCCAAGCCAT GGCCGACCGG 

15 AAGGAGCATA CCGACGCCAT GCAAACCGCT 
GGGCTGAATA CGGCCATCGA CCTGACCACT 
CAGCGACTGA GAGCGGCCAT GACCTACCGG 
TGCCTTTCGG AC 

20 

(2) IN FORMAT I ON FOR SEQ ID MO: 199 



TTGCAGAGTG TCTATCGACT GCGGATGGCG 360 

GTTCGCGAGC AGCAGGAGCT GGCAGCTCTC 420 

TATGCGCGCC AAATGCAAGA GCTGGCCATG ^80 

CGGCAGACGG CTCGAATGGA AGAGCTGGGG 54 0 

CAGTCGCGAA TGGCCGGTGA CCGTTTGGCC 600 

GCTCTGATCC GGCTCAAAGA AAAAATGAAC 660 

GATATGCCGG CTGACAGTCT CTCCGCCGAC 72 0 

TTCGCCCGTG CTGCCCATCA TCATCCCGTC 78 0 

GCCACCGACC GTTTGCGAGC CGCGCGAGGT 84 0 

GGATGGAACA CGGGATTCTC ACGCTTTTTG 900 

CAGTTTCGGA ACCGTCGGGG GGAATACGTC 960 

GGATTCAGCC TTGTGAGCCA TCTGCGTCAG 102 0 

CGACGGGGCG AAGCGGAGCG CAGGCTCTAC 1080 

GATGCCGCTC TGGCTTCCTA CCGCCAGGCG 114 0 

TACGAAGCCG TCTTGCAGCG TTATGAGGAG 1200 

CAGGCC7\ATC GGCTCCTGGA TGCCCGTGTG 12 60 

CTCAAATGCA AACTCATAGC CTATTACGGC 132 0 

1332 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 2820 base pair? 
25 (B) TYPE: nucleic acid 

(C) STRAIJDEDMESS: double 

(D) TOPOLOGY: circular 

(ill MOLECULE TYPE: DMA (genomic) 

30 

(iii) HYPOTHETICAL: MO 



(iv) AIJTI- SENSE: MO 

35 (vi) ORIGIMAL SOURCE: 

(A) ORGANISM: PORYPHYROI-IOMAS GIMGIVALI3 



( i.\) FEATURE: 

(A) MAI IE/ KEY: misc_feature 
40 (B) LOCATIOM 1...2820 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 199 



ATGAACAA;i.T TTTACAAATC ACTTTTGCAG 

45 ACTGCACTGA CCGCTTCTGC ACAGATTTCG 

AGATCCGCCG GAACGCATTC ATTCGACGAT 
AATCCGGAAG ACCTGATCGC ACAGAGCCGT 
CGGATAGGAC AAGTAATACC GGTGGATGTG 
TCCATCGGAG ACGTAGATGT ATATCGCCTG 

50 ACGCTTTATT ACGATGCATT CAATATTCCG 
GACCATGAAA TTGTGTTGGG AGCATATACG 
GCCACAGAGC CGGTACCGGG GAGTGAGCTT 
ACTTTGCCTG ACATCAAGAT CTCCGGTGCG 
CCCGTAACGG ATAACCATTA CGGGATCGGT 

55 AACATCAATT GTCCTGAAGG TGCAGACTGG 

ATCATGGTAA AAGGACAGTA TATCTCAATG 
GGAGACTTTA CTCCGCTGAT CATTTCTGCC 
GGTGTAACGC AATCCGAGTT GGATAAGTGG 
TGCAGCAATG GTACATTGGC CATCTTCCGT 

60 GCTTTCCTCC CGATCAAAGG TAAATCCGAT 
CCTCTGCGCT ATCGTGTCTA TTACAATGGA 
GGTGCCGGTA TTCATCATCC GGCCGGAGAT 
CCGGCTCTGA ATACATGGAT CTCCTCCAGT 
TTCAAATACG ATCAAGGTGG TACGGAAGGA 

65 TVATAAGCACG TGGTCGGCAC ACTGACCGGA 
TACGGCAGAC TGAACAGTCA TTGGAACGAG 
GACATCTATC TGGATCCCCA AAACAATGGC 
GACGGTTATA AGCCTTTGCC CTCTGTGCCC 
GTCGAATTGA ATTGGACGGC TGTTCCTGCC 

70 TACCACATAT TCCGAAATGG AAAGGAAATA 
GCCATCGACG AAAGTATTAT CGGTAGCGGT 
ATTTATCCCT CGCCGTTGGA TGGAGTGGAA 
GACCTTGCCA TAGGAGACAT TCAGACCAAG 
GGAGGAGTAT CATTAAGCTG GAAAGTTCCT 

75 GAAAGCCCCA ATCCTGTGTT CAAAACCTTT 



TCAGGACTGG CTGCCTTCGT GTCGATGGCA 60 

TTCGGAGGGG AACCCTTGAG TTTCTCTTCA 12 0 

GCAATGACTA TCCGCCTTAC TCCGGATTTC ISO 

TGGCAATCGC AAAGAGATGG CCGGCCCGTC 2*10 

GACTTTGCAT CCAAGGCTTC GCACATCTCT 300 

CAATTCAAGT TGGAAGGAGC CAAAGCCATT 360 

GAGGGCGGAC GCCTCTATAT CTATACCCCC 4 20 

AACGCCACTC ATCGCCGCAA CGGAGCTTTT 4 80 

ATTATGGATT ATGAAGTGTC TCGCGGAGGG 54 0 

GGTTATATAT TCGACAAAGT CGGCGGACGC 600 

GAGGACGATT CCGATTCGGA TTGCGAGATC 660 

CAGGCAGAGA AGAACGGTGT GGTGCAAATG 72 0 

TGCTCAGGCA ACCTGCTCAA TAATACGAAA 7 80 

GGACACTGTG CTTCCATAAC AACCAATTTC 84 0 

ATCTTCACTT TCCACTATGA TVAAAAGAGGA 900 

GGCAACAGTA TCATCGGAGC TTCCATGAAG 960 

GGTCTCTTGC TGCAACTCAA CGATGAAGTC 1020 

TGGGACAGTA CGCCCGATAT TCCCTCGAGC 108 0 

GCCATGAAGA TTTCCATCCT AAAGAAGACT 114 0 

GGTTCCGGAG GGACTGACGA TCACTTCTAT 12 00 

GGATCGTCCG GTTCTTCTCT CTTCAATCAG 1260 

GGTGCCGGCA ATTGTGGCGG GACGGAGTTC 132 0 

TATGCATCCG ATGGCAATAC GAGCCGCATG 138 0 

CAGACGACCA TCCTCAA.CGG AACGTATCGT 14 4 0 

CGGCTATTGT TGCAGTCTAC AGGCGATCAG 1500 

GATCAATATC CATCATCTTA TCAGGTCGAA 1560 

GCTACGACAA AGGAGTTGTC CTATTCGGAT 162 0 

ATCATTCGAT ACGAAGTAAG CGCACGCTTC 168 0 

TCTTATAAGG ATACGGACAA GACTTCTGCC 17 4 0 

CTGAAGCCGG ACGTAACACC TCTCCCCGGA 18 00 

TTCTTAAGCC AGTTGGTTTC CCGATTCGGA 1860 

GAAGTGCCCT ATGTTTCTGC CGCAGCCGCA 1920 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



135/490 



10 



15 



CAAACCCCCA 
CCCGAAAAGG 
CACCTCTTCC 
TGGCAGGCCG 
CATATGCTTT 
TATGTAAGAA 
GGAGTCTCTT 
CTGGTGGTAA 
AAGGGAACGA 
ACATTTATCG 
AGCGATGAAT 
CAGATTGAGA 
ATTAAGAACG 
CGTTCTTGGA 
ACTTATATGC 



ATCCTCCCGT 
CTGCTATCGC 
TCAAGAGCAA 
GAACATGGTT 
TTGCCGGTAT 
ATCCGGATAA 
TCGAAGGCTA 
ATACCGATGC 
ATGTGGCTCC 
GCACACAGGA 
ACGAAATAAA 
ATAACAATGC 
CTCATATGGT 
ACAACCTCCG 
TCGTTATGCA 



TGGCGTAGTC 
TGCCGTTTAT 
CACAAACAGA 
GAGGATCAAT 
CAGAATGCCT 
CCTTTTCTCC 
CGGAATACCC 
TCCGAAGATC 
ATTCCCCGAA 
TCCATCCGTC 
ACTGGTATAT 
TGTCGTTGCA 
TCACGCTGCC 
CAATGGCGTG 
GACGGCAAAC 



ATTGCAGACA 
GTAATGCCAT 
AGATTGCAGA 
TTGGATAAGC 
AATAAGTACA 
ATTACCGGTA 
TCGCTCTTGG 
GATATGTCGC 
TTGGTCGGCA 
ACAACTTATT 
AAGGGATCGG 
TATCCGTCTG 
GCCCTCTACT 
ACATTCAGTG 
GGCCCTGTGA 



AGTTTATGGC 
CCGCTCCGGA 
AGGTGACAAC 
CGTTCCCGGT 
AGCTCAATCG 
AGAAGATTTC 
GCTATATGGC 
TTGTACAGGA 
TATATGTCTA 
CGGTTTCAGA 
GCATTTCGAA 
TTGTAACAGA 
CATTGGATGG 
TTCAAGGACT 
GCCAAAAGAT 



CGGTACATAT 
CTCTACTTTC 
TCCCTCCGAT 
GAATAATGAC 
TGCTATCCGT 
ATATAACAAC 
TATCAAATAT 
GCCTTATGCT 
TAAGAACGGA 
CGGAACAGAG 
TGGCGTTGCT 
TCGTTTCAGC 
CAAGCAGGTT 
TACGGCCGGT 
CGTGAAGCAG 



1980 
20*50 
2100 
2160 
2220 
2280 
234 0 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



(2) INFORMATION FOR SEQ ID NO: 2 00 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

25 

(ii) MOLECULE TYPE: DNA {genomic) 
(iii) HYPOTHETICAL: NO 
30 (iv) AlITI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGAl-JISM: PORYPHYROHOI IAS GINGIVAL IS 

35 (ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . ,2010 



{xi} SEQUENCE DESCRIPTION: SI 

40 

ATGAAATATC TTATCAGACT CTTCTTATCA 
ACACACGAGG AGCTCTCTAT TTGCGATGGC 
GGTAAAGCCC CAAATGCTCG TGCCACAGAA 
GTAGGCTCCA TTTCTGTGCT CTTCTATTTA 

45 ACAGACTATC AAATCCATGA AGGGGCCTAT 
CTATTCAATG GCAAC7\ACAA CTTCAGCATC 
CCGGCCACAG AAGCTGCGCT TTCTCAATTT 
ACGACAGCCC CTGCCGATTT CGTAATGCTT 
ACGACAGAAG GGAAACTGTT GGGGGATTAT 

50 ATGATAAAAC CCACCATCAA TGTGCAAGGA 
TTTCGCAATT CGGTAACGAA GGGGTTCCTT 
GCATCCTATA AGACATCGGA ATATCTTGAT 
CATTTCTATT CTTACTATAA CAAATGGACA 
TTCATCATGG TCAAATTCAA AAAGACAGGA 

55 TACAGAGTGC CCCTCGAATC TCAGGACAAT 
AATGTGAAAA TCGAAATCTT GGGTTCTTTA 
ACACTCGCAA TAGAAGAATG GATTCTCCAT 
TACTTGATAG TGG7\ACAGCA CGAAATCTTC 
TATCAAACTT CGCAGAAACC AATCAGCATT 

60 TCTTCTGATG GCACTCAGCA CAATGATCTT 
ATTACAAGCG ATAATACAAG CATCATAATC 
AAGAAGATCG TTTTTGAGGT AACTAATGGG 
CTCCAATATC CTGCACAATT TATTGTCAAT 
GACGGATCTT TGGCTCCGGG GCTTMCAAT 

65 CCACCCGAGA ATTTATTTGA AGATGGGACA 
ACAATTTCTT TTCATAAGAA AGAGAACAAT 
ACGACAAAAC AGGACCTTGA GACATCAAGA 
CAACTTGGGG CTACTCTCCC GATGCCCTAT 
CTTGACTATT CGGGAAACTA TAATAATAAG 

70 TGGGAGAAAA GAAAAGTTAA TAACGAAGAA 
GAAGCTGAGA TCAAATTGAT AGATAAGCTG 
ATCATGACAG GGAATTATTA TTGGGATAGT 
GGAGGAGGGG GCCAAGGAAA TTCCTCCAAA 
AAGCCGATTC GTGACAAGAA GTCAGGTAAG 



;Q ID NO: 200 

TTGATGTTAC TCTCTCTCTG GACGGGCTGT 60 

GAGAATACGC TTGTTTTACG CGTAGAGACC 120 

CCCGGTCAGG GCATATACAA TGAGAATAAA 180 

GAGGGACAAC TTCGTTGGCA GGTGAAGTCT 24 0 

ATCATTCCGG TCAAAGAGCA AATGCGACCA 300 

TATGTAGTGG CCAATCTCGA TTTCAATGCT 360 

GTGGTAGAGA AATCTATTGA AGTCTCTTCT 4 20 

GCTCATGGCA ATAAGCAGAT CTSiATATGGCT 4 80 

AAACTCAAAC GAGTGGCAGC AAAGATTCGC 54 0 

TATGAAGTGG TCGGAAATAT ACAGGCAAAG 6 00 

ACGACAGAAG CTCAAGAGAT CCCAGCTGCT 660 

ATTGCAGAGT CGGCACCTGC CAATTCTATC 7 20 

CTCTCCACAC CGGAGAAGCG ACCGGAATTC 7 80 

CAGCCGGACA ACACAGCCAA ACCGTACTAC 84 0 

CAGGTCAAGA GCAATGTCCT CTATAATCTG 900 

CAAGAGCCGG AAGCTGTTTC TGTAAACGGC 960 

CAGGATGCAT TCAATCTGCC TGCCACCAAT 102 0 

ATGAATAACG TGAACACATA CTCGGTGAAA 1080 

AGCATACAGT CAGTTACCTT TAGCTACGTC 114 0 

GTAGCAAGTA GTAGCGACCA GTATCCTACG 12 00 

ACTTCCAAGA TACCGGTTAA TAACGTACCA 1260 

GTAGCCGGTT TGAAAGAGAC TGTCACAGTA 1320 

ACACTTGGCA CAGCATCGGC ATGGAGACCA 1380 

AAAGCGATTT ACCATGTCGT AGTACTGGTT 14 4 0 

CAGACAATCA TCGGTTATCC CCCCACTGAA 1500 

ACCTATCCGA TAGTATGGTC TGACACAAAT 1560 

ATGATTTCAC CTTCCTTTGA GTTAGCCTCC 1620 

CTCGAGTATT GGCCAGGGAC ATCATATCTC 1680 

AGATACGCCT TGTTTAATTG CGCTTTTTAC 17 4 0 

ATT7W\TTCG ATGACTGGCG TTTGCCGACA 1800 

CAACATAATG AGCAGAGTGC TGTCCAAGCT 1860 

TACTCTGCAA ATGGGTCTTA TAAAATGCAA 1920 

GCCTATGTTC GTTGCGTGCG GGATGTGAAA 1980 

2010 
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(2) TIIFORI-IATIOII FOR SEQ ID NO: 201 

(i) SEQUEIICE CHARACTERISTICS: 
5 (A) LENGTH: 3846 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAM DE DUES S: double 

(D) TOPOLOGY: circular 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: HO 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGAIJISM: PORYPHYROHONAS GIHGIVALIS 



(i::) FEATURE: 
20 (A) NAI-IE/KEY: misc_feature 

(B) LOCATION 1. . .384 6 

(Mi) SEQUENCE DESCRIPTION: SEQ ID NO: 201 

25 ATGCGA/VAAA TTTTGAGCTT TTTGATGATG TGCTCTCTGC ATTTAGGTCT ACAATCTCAG 60 

ACTTGGCATG GAGATCCGGA CTCAGTGGCA GCCCTACCTT CTATCGGTAT TCAAGAGTCA 120 
AGTTGTACCC GAATCACGTT CGAGGTTGTT TTCCCCGGAT TTTATAGTGT GGAAAAACGA 180 
GAAGGCAACC AAGTCTTTCA GCGCATTTCC ATGCCGGGTT GTGGCTCGTT TGGGAATCTG 24 0 

GGCGAAGCTG AATTGCCTGT TTTGAA7W\G ATGATAGCCG TTCCGGAATT TTCAACAGCT 300 

30 AACGTTGCTG TAAAAATCAA AGAGACGGAG ACATTCGACA ATTATAATAT CTATCCTAAT 360 
CCTACCTATG TCGTAGAGGA GTTGCCTGAG GGGGGGACTT ATCTGGTAGA GGCTTTCGCG 4 20 

ATAAACAATG ACTATTATAG CCAAAATGTA AGCCTCCCTT CTACTCACTA TGTCTATTCT 4 80 

CAAGACGGGT ATTTTCGCTC ACAAAGATTT ATCGAAGTTA CCCTGTATCC TTTTCGATAC 54 0 

AACCCTGTCC GACAAGAAAT TCTATTTGCA AAAAAAATCG AGGTTACAAT AACTTTCGAT 600 

35 AATCCTCAGC CACCTTTACA AAAAAACACC GGCATATTTA ACAAAGTAGC CTCCTCTGCA 660 
TTTATT7VATT ATGAAGCTGA TGGCAAATCG GCGATAGAAA ATGATATGGT GTTCAGTCGT 72 0 

GGTACAACAA CGTACATAAG CGGAAATGTT GCCAGCAACC TCCCTCAGAA CTGTGACTAC 7 80 

TTGGTTATTT ACGATGATAT GTTCAACGTA AATCAACAAC CACACGACGA AATCAAACGG 84 0 

CTGTGCGAAC ATAGAGCCTT CTACAACGGC TTTGATGTAG CTGCTGT7VAG TATAAAGGAC 900 

40 GTATTGAATA GCTTCCCATC ATU^TGCCACC TCATACATCA ACGAAACTAA ACTGAAAAAT 960 

TTCATTCGCT CAGTTTACAA CCAAAGCAAT GCGAAGAGGA CTTTAGATGG CAAACTGGGA 1020 

TACGTGCTAC TGATCGGAAA ACCATTGAGC AAATATTTGG CTGACACTGA TAATACAAAA 108 0 

GTCCCAACCT CTTTTATTCA TAATGTCTCC TT/\ATTCCAA GTCATCCAAC TTTTGGTTCC 114 0 

ATATGCGCCT CCGACTATTT TTTTAGTTGT GTTTCGCCCC TTGATACTGT CGGCGATTTG 1200 

45 TTTATCGGTC GATTTAGCGT CACCAATGCT CATGAATTGC ACAATCTGAT TGAAAAGACT 1260 

ATCAACAAAG AAATCTCATA TAATCCTATT GCACACAAAA ATATTCTTTA CGCAGAAGGG 132 0 

AAAGGCTGCG ATGCTCCAAT CTTACGTTTA TTCTTAAAAG AAATCGCCTC TGGTTACACA 1380 

GTCAACTCTA TCTTAAAATC T7\ATCAGGTC TCTGCAATAG ACTCGATATT TGACTGCTTG 144 0 

AATAATGGTT CCCATCATTT TTATTTTAAC ACTCATGGAA TGCCGACTGT TTGGGGGATA 1500 

50 GGGCAGGGAC TCGACGTCAA TACTCTAACA GCCCGATTGA ACAATACATC TTCGCAGGGA 1560 

TTATGTACGA GTCTATCATG TAGTTCGGCT GTAGCAGATT CAACTATTAG ATCGCTTGGA 1620 

GAAGTCCTGA CCACATACGC ACCTAACAAG GGATTCTCGG CTTTCTTAGG AGGAAGCAGA 1680 

GCCACCCAAT ATGCCGTTTA TTTAGAAGGC CCCTGTCCTC CGTCAGAATT TTATGAATAT 17 40 

TTACCTTATT CTTTATATCA CAATCTCTCG ACTGTTGTTG GCGAAATGTT GCTATCATCC 18 00 

55 ATTATCAATA CTAATTCTGT TGATACGTAT TCGAAATTCA ACTTCAATTT GCTTGGCGAC 18 60 

CCTGCACTAA ACATTATGGC TCATGGCATG GAGGTTAGTA ATTGTATTAC ACTACCAAAC 1920 

AACACCATTA TAAGCAGTCC GATAACAATA AAAAATGGTG GCTGCCTAAA AATACCGGAA 1980 

AAAGGAGTTT TGCATTTTAC TAATAATGGC TCCATACAAG TCATGTCCGG AGGAACTCTG 204 0 

GAAATAGGCA ATCAGGCTAA AATATCCGGA GAGACCGGTG CTAACCCCAC CTTTATTACC 2100 

60 GTTTACGGCG ATGGTCTTGC GATTAACAAG CAGGTAGAGA TAGACAATAT AGACCGACTT 2160 

;iJ\CTTGTTTT CTACGCATTC GGTCATGCCC AAATTTCATT TTGACAGTGT GAAATTCAAC 22 20 

AGTGCCCCGC TGTATAC7\AC GAACTGTATT GTGGAGATA/^. GCAATTGCGA ATTTACCAAT 2 2 80 

CGAAGTGACA TTATTTC/W^ GAATTGTGAC CTAAGCGTTG AAAACAGTAT GTTTAGCAGT 2 34 0 

TCGGGGATAA CGGTATTCAA GCCTATGGCT ACAAGCTCCA TCACCGGATT ATCTACAAAA 24 00 

65 GCAAAGATTA CCGACAATAC TTTTTTTGCG ACAGGAAACT TCGCCTACCA TATCACAAAC 2 4 60 

ACGCCAGGCT TAACAGCAAC CTCCAATGCT GCCATCAAGT TAGACAATAT TCCTGAGTAT 2520 

TACATTTCCG GTAATAAAAT AGTCAATTGC GATGAGGCTC TTGTACTAAA TAATAGTGGC 258 0 

AACAGAACGA ACAGACTCCA CAATATCACA CGGAATGTGA TAAAAAACTG TAGGATTGGG 2 64 0 

AGCACGCTTT ATAATTCCTA TGGTATTTAC AACCGAAATA AGATCAGTAA CAATCATATA 27 00 

70 GGAGTACGTC TCCTCAACAA CAGTTGTTTT TATTTCGATA ATGCTCCTGT AATCAATGAA 27 60 

GAAGATAAGC AGACGTTTAT TTCTAATAGG ACTTGGCAGC TCTATTCATC AAACGGTACA 2 8 20 

TTCCCTCTCA ACTTCCATTA CAACAGCTTG CAGGGGGGAG ATACAGATAC ATGGATTTAC 28 80 

AACGACACGT ATACGAATCG CTATATTGAC GTTTCAAATA ATCACTGGGG CAACAATGAT 2 94 0 

TTGTTTGATC CGAATCAGGT TTTCAATACG CCAGACTTGT TCATTTGGAT ACCTTTTTGG 3000 

75 GATGGATTGC CAAATGGGAG ATCGGGCAAT AGCTCTGCTG AAGCAGTAGA ATTCGAAACA 3060 
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GCATTGGACT GTATTGGCAA TAGCGATTAT CTTTCGGCAA AAGTGGCTCT CAAGATGATG 3120 

GTTG7VAACCT ACCCGGAATC CGACTTTGCA ATAGCTGCTT TGAAGGAATT GTTCAGGATA 3180 

GAGAAAATGT CAGGCAACGA TTACGAAGGC TTGAAAGATT ATTTCAGATC CAATCCAACC 32-10 

ATCATCTCTT CCCAGAACTT GTTCCCGACA GCTGATTTCC TGTCTGCGCG ATGCGATATT 3300 

5 GTGTGTGA7\A ACTATCAGTC TGCCATCGAT TGGTACGAAA ATCGCTTGAA TAGTGAAATC 3360 

TCCTATCAGG ACAGTGTTTT TGCAGTCATT GACCTTGGTG ACATTTATTG GAATATGCAG . 34 2 0 

TTAGACTCAC TCAGAGGGAC TGGTATAGAT TTGAACATAC TTTCCTGTGA ACAAAGGAAA 34 8 0 

TCGCTCGAAA GCCATCAAAA TGTAAAAAAT TATTTGTTGT CAACTCTTCC CGAATCAACA 354 0 

GGTACTCTCC TGCCTCCATT AGAATGCAAC AAATCAAGCC TTGATAAATC CAAGATAATC 3600 

10 TCTATTTCGC CCAATCCGGC GAAAGCTGTT GTAACAATAA TCTACTATAC CGATAACCCT 3660 

TCCTGTTCTG TAATAAAAAT ATATGGAATA AATGGAGCCT CGGCTGATAT AACCGGGTTG 37 2 0 

CCCAAACATC TATCCGAAGG TTATTACAGC ATACAGTTCA ATACATCCAA CTTTGATCCC 37 80 

GGTTTCTACC TGGTAACGCT AAATGTTGAT CAGAAAATTA TAGATACGGA AAAATTACGA 38 4 0 

ATCAAA 38 4 6 



15 



30 



{2} INroRI^iATIOlI FOR SEQ ID NO: 202 



(i) SEQUEIICE CHARACTERISTICS: 
20 (A) LENGTH: 3822 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDMESS: double 

(D) TOPOLOGY: circular 

25 (ii) MOLECULE TYPE: DtlA (genomic) 

(ill) HYPOTHETICAL: NO 
(iv) AIITI-SE14SE: HO 



(vi) ORIGIIIAL SOURCE: 

(A) ORGAtlISM: PORYPHYROMOrJAS GIIIGIVALIS 



(iM) FEATURE: 
35 (A) MAIIE/KEY: mi sc_f eatu re 

(B) LOCATION 1 ... 3822 

CLi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 02 

40 ATGATGTGCT CTCTGCATTT AGGTCTACAA TCTCAGACTT GGCATGGAGA TCCGGACTCA 60 

GTGGCAGCCC TACCTTCTAT CGGTATTCAA GAGTCAAGTT GTACCCGAAT CACGTTCGAG 12 0 

GTTGTTTTCC CCGGATTTTA TAGTGTGGAA AAACGAGAAG GCAACCAAGT CTTTCAGCGC 180 

ATTTCCATGC CGGGTTGTGG CTCGTTTGGG AATCTGGGCG AAGCTGAATT GCCTGTTTTG 24 0 

AAAAAGATGA TAGCCGTTCC GGAATTTTCA ACAGCTAACG TTGCTGTAAA AATCAAAGAG 300 

45 ACGGAGACAT TCGACAATTA TAATATCTAT CCTAATCCTA CCTATGTCGT AGAGGAGTTG 360 

CCTGAGGGGG GGACTTATCT GGTAGAGGCT TTCGCGATAA ACAATGACTA TTATAGCCAA 4 20 

AATGTAAGCC TCCCTTCTAC TCACTATGTC TATTCTCAAG ACGGGTATTT TCGCTCACAA 4 80 

AGATTTATCG AAGTTACCCT GTATCCTTTT CGATACAACC CTGTCCGACA AGAAATTCTA 54 0 

TTTGCAAAAA AAATCGAGGT TACAATAACT TTCGATAATC CTCAGCCACC TTTACAAAAA 600 

50 AACACCGGCA TATTTAACAA AGTAGCCTCC TCTGCATTTA TTAATTATGA AGCTGATGGC 660 

AAATCGGCGA TAGAAAATGA TATGGTGTTC AGTCGTGGTA CAACAACGTA CAT7\AGCGGA 72 0 

AATGTTGCCA GCA^CCTCCC TCAGAACTGT GACTACTTGG TTATTTACGA TGATATGTTC 7 80 

AACGTAAATC AACAACCACA CGACGA/^ATC AAACGGCTGT GCGAACATAG AGCCTTCTAC 84 0 

AACGGCTTTG ATGTAGCTGC TGTAAGTATA AAGGACGTAT TGAATAGCTT CCCATCAAAT 900 

55 GCCACCTCAT ACATCAACGA AACTAAACTG AAAAATTTCA TTCGCTCAGT TTACAACCAA 960 

AGCTKATGCGA AGAGGACTTT AGATGGCAAA CTGGGATACG TGCTACTGAT CGGAAAACCA 1020 

TTGAGCAAAT ATTTGGCTGA CACTGATAAT ACAAAAGTCC CAACCTCTTT TATTCATAAT 108 0 

GTCTCCTTA/i. TTCCAAGTCA TCCAACTTTT GGTTCCATAT GCGCCTCCGA CTATTTTTTT 114 0 

AGTTGTGTTT CGCCCCTTGA TACTGTCGGC GATTTGTTTA TCGGTCGATT TAGCGTCACC 1200 

60 AATGCTCATG AATTGCACAA TCTGATTGAA AAGACTATCA ACAAAGAA^\T CTCATATAAT 1260 

CCTATTGCAC J\CJ\?J\AA'rAT TCTTTACGCA GAAGGGAAAG GCTGCGATGC TCCAATCTTA 132 0 

CGTTTATTCT TAAAAGAAAT CGCCTCTGGT TACACAGTCA ACTCTATCTT AAAATCTAAT 138 0 

CAGGTCTCTG CAATAGACTC GATATTTGAC TGCTTGAATA ATGGTTCCCA TCATTTTTAT 144 0 

TTTAACACTC ATGGAATGCC GACTGTTTGG GGGATAGGGC AGGGACTCGA CGTCAATACT 1500 

65 CTAACAGCCC GATTGAACAA TACATCTTCG CAGGGATTAT GTACGAGTCT ATCATGTAGT 1560 

TCGGCTGTAG CAGATTCAAC TATTAGATCG CTTGGAGAAG TCCTGACCAC ATACGCACCT 162 0 

T^CAAGGGAT TCTCGGCTTT CTTAGGAGGA AGCAGAGCCA CCCAATATGC CGTTTATTTA 1680 

GAAGGCCCCT GTCCTCCGTC AGAATTTTAT GAATATTTAC CTTATTCTTT ATATCACAAT 17 4 0 

CTCTCGACTG TTGTTGGCGA AATGTTGCTA TCATCCATTA TCAATACTAA TTCTGTTGAT 1800 

70 ACGTATTCGA AATTCA/vCTT CAATTTGCTT GGCGACCCTG CACTAAACAT TATGGCTCAT 1860 

GGCATGGAGG TTAGTAATTG TATTACACTA CCAAACAACA CCATTATAAG CAGTCCGATA 192 0 

ACAATAAAAA ATGGTGGCTG CCTAAAAATA CCGGAAAAAG GAGTTTTGCA TTTTACTAAT 1980 

AATGGCTCCA TACAAGTCAT GTCCGGAGGA ACTCTGGAAA TAGGCAATCA GGCTAAAATA 204 0 

TCCGGAGAGA CCGGTGCTAA CCCCACCTTT ATTACCGTTT ACGGCGATGG TCTTGCGATT 2100 

75 AACAAGCAGG TAGAGATAGA CAATATAGAC CGACTTAACT TGTTTTCTAC GCATTCGGTC 2160 
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25 



30 



ATGCCCAAAT 
TGTATTGTGG 
TGTGACCTAA 
ATGGCTACAA 
TTTGCGACAG 
AATGCTGCCA 
AATTGCGATG 
ATCACACGGA 
ATTTACAACC 
TGTTTTTATT 
AATAGGACTT 
AGCTTGCAGG 
ATTGACGTTT 
AATACGCCAG 
GGCAATAGCT 
GATTATCTTT 
TTTGCAATAG 
GAAGGCTTGA 
CCGACAGCTG 
ATCGATTGGT 
GTCATTGACC 
ATAGATTTGA 
AA7\;iATTATT 
TGCAhCAAAT 
GCTGTTGTAA 
GGAATAAATG 
TACAGCATAC 
GTTGATCAGA 



TTCATTTTGA 
AGATAAGCAA 
GCGT'J'GAAAA 
GCTCCATCAC 
GAAACTTCGC 
TCAAGTTAGA 
AGGCTCTTGT 
ATGTGATAAA 
GAAATAAGAT 
TCGATAATGC 
GGCAGCTCTA 
GGGGAGATAC 
CAAATAATCA 
ACTTGTTCAT 
CTGCTGAAGC 
CGGCAAAAGT 
CTGCTTTGAA 
AAGATTATTT 
ATTTCCTGTC 
ACGAAAATCG 
TTGGTGACAT 
ACATACTTTC 
TGTTGTCAAC 
GAAGGCTTGA 
CAATAATCTA 
GAGCCTCGGC 
AGTTCAATAC 
AAATTATAGA 



CAGTGTGAAA 
TTGCGAATTT 
CAGTATGTTT 
CGGATTATCT 
CTACCATATC 
CAATATTCCT 
ACTAAATAAT 
AAACTGTAGG 
CAGTAACAAT 
TCCTGTAATC 
TTCATCAAAC 
AGATACATGG 
CTGGGGCAAC 
TTGGATACCT 
AGTAGAATTC 
GGCTCTCAAG 
GGAATTGTTC 
CAGATCCAAT 
TGCGCGATGC 
CTTGAATAGT 
TTATTGGAAT 
CTGTGAACAA 
TCTTCCCGAA. 
TAAATCCAAG 
CTATACCGAT 
TGATATAACC 
ATCCAACTTT 
TACGGTW^AA 



TTCAACAGTG 
ACCAATCGAA 
AGCAGTTCGG 
ACAAAA.GCAA 
ACAAACACGC 
GAGTATTACA 
AGTGGCAACA 
ATTGGGAGCA 
CATATAGGAG 
AATGAAGAAG 
GGTACATTCC 
ATTTACAACG 
AATGATTTGT 
TTTTGGGATG 
CAAACAGCAT 
ATGATGGTTG 
AGGATAGAGA 
CCAACCATCA 
GATATTGTGT 
GAAATCTCCT 
ATGCAGTTAG 
AGGAAATCGC 
TCAACAGGTA 
ATAATCTCTA 
AACCCTTCCT 
GGGTTGCCCA 
GATCCCGGTT 
TTACGAATCA 



CCCCGCTGTA 
GTGACATTAT 
GGATAACGGT 
AGATTACCGA 
CAGGCTTAAC 
TTTCCGGTAA 
GAACGAACAG 
CGCTTTATAA 
TACGTCTCCT 
ATAAGCAGAC 
CTCTCAACTT 
ACACGTATAC 
TTGATCCGAA 
GATTGCCAAA 
TGGACTGTAT 
AAACCTACCC 
AAATGTCAGG 
TCTCTTCCCA 
GTGAAAACTA 
ATCAGGACAG 
ACTCACTCAG 
TCGAAAGCCA 
CTCTCCTGCC 
TTTCGCCCAA 
GTTCTGTAAT 
AACATCTATC 
TCTACCTGGT 
AA 



TACAACGAAC 
TTCAAAGAAT 
ATTCAAGCCT 
CAATACTTTT 
AGCAACCTCC 
TAAAATAGTC 
ACTCCACAAT 
TTCCTATGGT 
CAACAACAGT 
GTTTATTTCT 
CCATTACAAC 
GAATCGCTAT 
TCAGGTTTTC 
TGGGAGATCG 
TGGCAATAGC 
GGAATCCGAC 
CAACGATTAC 
GAACTTGTTC 
TCAGTCTGCC 
TGTTTTTGCA 
AGGGACTGGT 
TCAAAATGTA 
TCCATTAGAA 
TCCGGCGAAA 
AAAAATATAT 
CGAAGGTTAT 
AACGCTAAAT 



2220 
2280 
2340 
2*^00 
2460 
2520 
2580 
2 6-5 0 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 

32 4 0 
3300 

33 60 

34 2 0 
3480 
3540 
3600 
3660 
37 20 
37 80 
3822 



{2) II J FORMATION FOR SEQ ID 110:203 



35 



( i 1 SEQUENCE CHARACTERISTICS : 

(A) LEMGTH: 27 7 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAVIDEDIIESS: double 

(D) TOPOLOGY: circular 



40 



(ii) MOLECULE TYPE: DlIA (genomic) 
(iii) HYPOTHETICAL: NO 



45 



iiv) Aim -SENSE: HO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMGHAS GIHGIVALIS 



50 



(ix) FEATURE: 

(A) UAME/KEY: mi£c_f eat ure 

(B) LOCATIOM 1 . . . 277 5 



(xi) SEQUENCE DESCRIPTIOM: SEQ ID nO:203 



55 



60 



65 



70 



75 



ATGGCTATCA 
GCAGCGATCA 
GCTCCGGCTC 
GGCTGGCTTG 
TTCTCTGTAC 
TCAACAGCGG 
TGGGTATGCA 
GGGACTGCCA 
CCTCTTGTAT 
CGACATTACA 
TATAGGTCGA 
AATGTGGGAC 
GAAGAGTTGC 
AAAGATGTCT 
CAAGTGGAGT 
GGCACGCTGC 
CCCAATGGTT 
AATGCATACG 
GTTCCGGGTA 
GCCAAACGTG 
GCGGTGATGG 
ACCATGACAG 



TGATGAAAAG 
CGAATCCGAC 
AACCGGATAC 
AGATAGATGC 
CTTACGGACA 
GCAACTATCT 
ATCAGTATAG 
TTGAAGACTT 
GGCGTAGACG 
AAGTCACCGA 
TCGAAGGGCC 
GATTGACTTG 
AGCTTAGCGG 
CCATACTGGA 
ACTGCGTTAC 
ATTACGCCAC 
GGCTTGTGAT 
ACGCTTTTCC 
TAGGCCCGGT 
TCAAGTACTG 
CTTCGACAAC 
CGAAGCCGAC 



TATTGTTTTT 
TGCTCAAGAG 
TATCTTATAT 
TGATGCTGAT 
CAATGGCCTT 
GATTACACCC 
TACCAATCCG 
TGTTTTGTTG 
AATCGTGGAC 
CTCACACACA 
CGAACCTGCT 
GAACTATCCG 
CTACAACATC 
GTATGTGGAC 
AGCCGTTTAC 
GGATGCCATC 
AGACGCTGAT 
CGGCCATAAT 
GACTCCCGAC 
GGTAAGCACG 
GGGGACTGCT 
CGGCGCATGG 



AGAGCATTTC 
ATCTCAGGCA 
GAAAGTTTTG 
GGTGCCACTT 
TGCACCTACT 
AATATAGAAG 
GAACATTACG 
TTTGATGATT 
TTACCGGAAG 
GAATTCTTGA 
ACCGACTTCA 
GAGGATTATC 
TATGCG.AACG 
AGCACTTACT 
GATGAAAGCA 
CTTTATGAAA 
GGAGATGGAT 
GGAGGCCATT 
AACTATCTGA 
CAGGATGCCA 
GTCGGAGATT 
TATGAAAGAA 



TAACGATTTT 
TGAATGCATC 
AGAAT GGACC 
GGGGAAGCCC 
CCCATATACG 
GAGCCAAACG 
CAGTAATGGT 
CCATAACAGG 
GGACCAAATA 
AATTGGATGA 
CAGTAATCAA 
AACCGGAAGG 
GTACACTACT 
CTTTGCGAGA 
TAGAATCTTC 
ATTTTGAGAA 
TTAGCTGGGG 
GCTCCTTGTC 
TTACCCCCAA 
ATTGGGCAGC 
TCGTCATATT 
CCATCAACTT 



GCTCTCGTGG 
CTGTCTGGCT 
TGTTCCCAAT 
ATCAGGCTCT 
TTCCGGTATC 
GGTCAAGTAC 
ATCGACAACG 
GAAACCGACT 
TATTGCATGG 
TGTCACTGTG 
TATTGGTCAG 
AAAGGGGAAT 
GGCACAAATA 
CAATCCCTTG 
GACCGTATGT 
TGGACCTGTT 
ACACTATTTG 
GGCTTCTTAT 
GGTTGAAGGA 
GGAACATTAC 
GTTCGAAGAA 
ACCTGAAGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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ACTAAATACA TCGCATGGCG GCATTACAAC TGTACCGATA TATATTTCTT GAAGTTGGAC 1380 

GATATCACTG TATTCGGGAC TCCTGCATCA GAGCCCGAAC CTGTTACCGA TTTCGTTGTC l^l'lO 

TCGCTTATTG AAAACAACAA GGGACGATTA AAGTGGAATT ATCCTAACGG CTACGAACCC 1500 

GATAAGACTG ATGATAAAGA CCCATTGCAG CTTGCCGGCT ACAATATCTA TGCAAACGGC 1560 

5 TCGCTCCTTG TTCACATACA AGACCCGACT GTTTTGGAGT ATATCGATGA GACTTATTCT 1620 

TCACGAGACG ATCAGGTGGA AGTGGAATAT TGTGTCACTG CCGTTTATAA CGACAATATC 168 0 

GAGTCCCAAT CGGTTTGCGA TAAGCTGATT TATGATTCTC AATCGGACAT TATCTTATAT 174 0 

GAAGGCTTTG AGGCCGGAAG TATTCCTGAA GGCTGGTTGT TGATTGATGC TGATGGCGAC 1800 

AATGTTAATT GGGACTATTA TCCTTGGACT ATGTATGGAC ATGACAGTGA GAAGTGTATT 1860 

10 GCATCCCCTT CGTACTTACC GATGATTGGC GTTTTAACTC CGGATAACTA TTTGGTTACA 192 0 

CCCAGACTCG AAGGAGCCAA GCTTGTCAAG TATTGGGTAA GTGCGCAAGA TGCTGTTTAT 1980 

TCGGCTGAGC ATTATGCTGT GATGGTTTCT ACTACGGG7\A CTGCTGTTGA AGATTTTGTC 204 0 

CTCTTGTTCG AAGAGACAAT GACCGCTAAG GCTAACGGTG CATGGTATGA GCGAACTATT 2100 

ACATTGCCTG CAGGAACAAA ATATATTGCC TGGCGGCATT ATGATTGCAC CGATATGTTT 2160 

15 TTCTTGCTCT TGGATGACAT TACGGTTTAT CGTTCTACTG AGACTGTTCC CGAGCCTGTT 2220 

ACTGATTTCG TTGTCTCGCT TATTGAGAAT AACAAGGGTC GCCTGAAATG GAATTATCCT 228 0 

AACGGCTACG AACCCGATAA GACTGATGAT AAAAAACCAT TGCAGCTTAC CGGCTACAAC 234 0 

ATCTATGCAA ATGGCTCGCT CCTTGTTCAC ATACAAGACC CGACTGTTTT GGAGTATATC 2 4 00 

GATGAGACTT ATTCTTCACG AGACGGTCAG GTGG.AAATGG AATATTGTGT CACTGCCGTT 2 4 60 

20 TATAACGACA ATATCGAGTC CC7\ATCGGTT TGCGATA^XGC TGAACTATAC TATCACATCC 252 0 

TTGGATAATA TTCAATCTGA TACAAGCTTG AAAATATATC CTAAT CCGGC ATCGTATGTG 258 0 

GTAAGGATAG AGGGATTGAG TCGGAGCAAG TCGACAATCG AGTTGTATAA TGCGCTGGGA 2 64 0 

ATTTGCATAT TAAGGGAAGA GACTCATTCA GAGAAAACGG A.AATCGATGT TTCACGTCTC 27 00 

AATGACGGAG TCTACTTGAT TAAAGTAGTC GGTGGAAATA AAACAACAAC CGAAAAGGTA 276 0 

25 GAGATAAAGA GGCCG 27 7 5 

(2) I M FORMAT I OH FOR SEQ ID I ID: 2 04 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2766 base pairs 
(.B) TYPE: nucleic acid 

(C) STRAMDEDMESS: double 

(D) TOPOLOGY: circular 

35 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: 1 10 

40 (iv) AMTI-SEMSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGAJIISM: PORYPHYROHONAS GIMGXVALIS 

45 (ix) FEATURE: 

(A) tJAllE/KEY: misc_f eature 

( B) LOCATION 1 ... 2766 



50 



(xi) SEQUENCE DESCRI PTIOI) : SEQ ID NO: 204 



ATGATGAAAA GTATTGTTTT TAGAGCATTT CTA.^CGATTT TGCTCTCGTG GGCAGCGATC 60 

ACGAATCCGA CTGCTCAAGA GATCTCAGGC ATGAATGCAT CCTGTCTGGC TGCTCCGGCT 120 

CAACCGGATA CTATCTTATA TGAAAGTTTT GAGAATGGAC CTGTTCCCAA TGGCTGGCTT 180 

GAGATAGATG CTGATGCTGA TGGTGCCACT TGGGGAAGCC CATCAGGCTC TTTCTCTGTA 24 0 

55 CCTTACGGAC ACAATGGCCT TTGCACCTAC TCCCATATAC GTTCCGGTAT CTCAACAGCG 300 

GGCAACTATC TGATTACACC CAATATAGAA GGAGCCAAAC GGGTCAAGTA CTGGGTATGC 360 

AATCAGTATA GTACCAATCC GGAACATTAC GCAGTAATGG TATCGACAAC GGGGACTGCC 4 20 

ATTGAAGACT TTGTTTTGTT GTTTGATGAT TCCATAACAG GGAAACCGAC TCCTCTTGTA 4 80 

TGGCGTAGAC GAATCGTGGA CTTACCGGAA GGGACCAAAT ATATTGCATG GCGACATTAC 54 0 

60 AAAGTCACCG ACTCACACAC AGAATTCTTG AAATTGGATG ATGTCACTGT GTATAGGTCG 600 

ATCGAAGGGC CCGAACCTGC TACCGACTTC ACAGT^ATCA ATATTGGTCA GAATGTGGGA 660 

CGATTGACTT GGAACTATCC GGAGGATTAT CAACCGGAAG GAAAGGGGAA TGAAGAGTTG 7 20 

CAGCTTAGCG GCTACAACAT CTATGCGAAC GGTACACTAC TGGCACAAAT AAAAGATGTC 7 80 

TCCATACTGG AGTATGTGGA CAGCACTTAC TCTTTGCGAG ACAATCCCTT GCAAGTGGAG 84 0 

65 TACTGCGTTA CAGCCGTTTA CGATGAAAGC ATAGAATCTT CGACCGTATG TGGCACGCTG 900 

CATTACGCCA CGGATGCCAT CCTTTATGA^ AATTTTGAGA ATGGACCTGT TCCCAATGGT 960 

TGGCTTGTGA TAGACGCTGA TGGAGATGGA TTTAGCTGGG GACACTATTT GAATGCATAC 1020 

GACGCTTTTC CCGGCCATAA TGGAGGCCAT TGCTCCTTGT CGGCTTCTTA TGTTCCGGGT 1080 

ATAGGCCCGG TGACTCCCGA CAACTATCTG ATTACCCCCA AGGTTGAAGG AGCCAMCGT 114 0 

70 GTCAAGTACT GGGTAAGCAC GCAGGATGCC AATTGGGCAG CGGAACATTA CGCGGTGATG 1200 

GCTTCGACAA CGGGGACTGC TGTCGGAGAT TTCGTCATAT TGTTCGAAGA AACCATGACA 1260 

GCGAAGCCGA CCGGCGCATG GTATGAAAGA ACCATCAACT TACCTGAAGG GACTAAATAC 1320 

ATCGCATGGC GGCATTACAA CTGTACCGAT ATATATTTCT TGAAGTTGGA CGATATCACT 138C 

GTATTCGGGA CTCCTGCATC AGAGCCCG/^A CCTGTTACCG ATTTCGTTGT CTCGCTTATT 14 4 0 

75 GAAAACAACA AGGGACGATT AAAGTGGAAT TATCCTAACG GCTACGAACC CGATAAGACT 1500 
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GATGATAAAG ACCCATTGCA GCTTGCCGGC TACAATATCT ATGCAAACGG CTCGCTCCTT 1560 

GTTCACATAC AAGACCCGAC TGTTTTGGAG TATATCGATG AGACTTATTC TTCACGAGAC 1620 

GATCAGGTGG AAGTGGAATA TTGTGTCACT GCCGTTTATA ACGACAATAT CGAGTCCCAA 1680 

TCGGTTTGCG ATAAGCTGAT TTATGATTCT CAATCGGACA TTATCTTATA TGAAGGCTTT 17 4 0 

5 GAGGCCGGAA GTATTCCTGA AGGCTGGTTG TTGATTGATG CTGATGGCGA CAATGTTAAT 1800 

TGGGACTATT ATCCTTGGAC TATGTATGGA CATGACAGTG AGAAGTGTAT TGCATCCCCT 18 60 

TCGTACTTAC CGATGATTGG CGTTTTAACT CCGGATAACT ATTTGGTTAC ACCCAGACTC 1920 

GAAGGAGCCA AGCTTGTCAA GTATTGGGTA AGTGCGCAAG ATGCTGTTTA TTCGGCTGAG 1980 

CATTATGCTG TGATGGTTTC TACTACGGGA ACTGCTGTTG AAGATTTTGT CCTCTTGTTC 204 0 

10 GAAGAGACAA TGACCGCTAA GGCTAACGGT GCATGGTATG AGCGAACTAT TACATTGCCT 2100 

GCAGGAACAA AATATATTGC CTGGCGGCAT TATGATTGCA CCGATATGTT TTTCTTGCTC 2160 

TTGGATGACA TTACGGTTTA TCGTTCTACT GAGACTGTTC CCGAGCCTGT TACTGATTTC 2220 

GTTGTCTCGC TTATTGAGAA TAACAAGGGT CGCCTGAAAT GGAATTATCC TAACGGCTAC 22 80 

GAACCCGATA AGACTGATGA TAAAAAACCA TTGCAGCTTA CCGGCTACAA CATCTATGCA 23 4 0 

15 AATGGCTCGC TCCTTGTTCA CATACAAGAC CCGACTGTTT TGGAGTATAT CGATGAGACT 2 4 00 

TATTCTTCAC GAGACGGTCA GGTGGAAATG GAATATTGTG TCACTGCCGT TTATAACGAC 24 60 

AATATCGAGT CCCAATCGGT TTGCGATAAG CTGAACTATA CTATCACATC CTTGGATAAT 2520 

ATTCAATCTG ATACAAGCTT GAAAATATAT CCTAATCCGG CATCGTATGT GGTAAGGATA 2580 

GAGGGATTGA GTCGGAGCAA GTCGACAATC GAGTTGTATA ATGCGCTGGG AATTTGCATA 2 64 0 

20 TTAAGGGAAG AGACTCATTC AGAGAAAACG GAAATCGATG TTTCACGTCT CAATGACGGA 27 00 

GTCTACTTGA TTAAAGTAGT CGGTGGAAAT AAAACAACAA CCGAAAAGGT AGAGATAAAG 2 7 60 

AGGCCG 27 66 

25 {2) IM FORI lATI Oil FOR SEQ ID MO: 2 05 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 2763 base pairs 

(B) . TYPE: nucleic acid 
30 (C) STRAIIDEDHESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 

35 lili) HYPOTHETICAL: 110 

(iv) AMTI-SEMSE: NO 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: PORYPHYROHOHAS GINGIVAL IS 

(ix) FEATURE: 

(A) NAME /KEY: mi sc_f ea t ure 

(B) LOCATION 1 ... 2763 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 05 



45 



ATGAAAAGTA TTGTTTTTAG AGCATTTCTA ACGATTTTGC TCTCGTGGGC AGCGATCACG 60 

AATCCGACTG CTCAAGAGAT CTCAGGCATG AATGCATCCT GTCTGGCTGC TCCGGCTCAA 120 

50 CCGGATACTA TCTTATATGA AAGTTTTGAG AATGGACCTG TTCCCAATGG CTGGCTTGAG 180 

ATAGATGCTG ATGCTGATGG TGCCACTTGG GGAAGCCCAT CAGGCTCTTT CTCTGTACCT 24 0 

TACGGACACA ATGGCCTTTG CACCTACTCC CATATACGTT CCGGTATCTC AACAGCGGGC 300 

AACTATCTGA TTACACCCAA TATAGAAGGA GCCAAACGGG TCAAGTACTG GGTATGCAAT 360 

CAGTATAGTA CCAATCCGGA ACATTACGCA GTAATGGTAT CGACAACGGG GACTGCCATT 4 20 

55 GAAGACTTTG TTTTGTTGTT TGATGATTCC ATAACAGGGA AACCGACTCC TCTTGTATGG 4 80 

CGTAGACGAA TCGTGGACTT ACCGGAAGGG ACC7W\TATA TTGCATGGCG ACATTACAAA 54 0 

GTCACCGACT CACACACAGA ATTCTTGAAA TTGGATGATG TCACTGTGTA TAGGTCGATC 600 

GAAGGGCCCG AACCTGCTAC CGACTTCACA GTAATCAATA TTGGTCAGAA TGTGGGACGA 660 

TTGACTTGGA ACTATCCGGA GGATTATCAA CCGGAAGGAA AGGGGAATGA AGAGTTGCAG 72 0 

60 CTTAGCGGCT ACAACATCTA TGCGAACGGT ACACTACTGG CACAAATAAA AGATGTCTCC 780 

ATACTGGAGT ATGTGGACAG CACTTACTCT TTGCGAGACA ATCCCTTGCA AGTGGAGTAC 84 0 

TGCGTTACAG CCGTTTACGA TGAAAGCATA GAATCTTCGA CCGTATGTGG CACGCTGCAT 900 

TACGCCACGG ATGCCATCCT TTATGAAAAT TTTGAGAATG GACCTGTTCC CAATGGTTGG 960 

CTTGTGATAG ACGCTGATGG AGATGGATTT AGCTGGGGAC ACTATTTGAA TGCATACGAC 1020 

65 GCTTTTCCCG GCCATAATGG AGGCCATTGC TCCTTGTCGG CTTCTTATGT TCCGGGTATA 108 0 

GGCCCGGTGA CTCCCGACAA CTATCTGATT ACCCCCAAGG TTGAAGGAGC CAAACGTGTC 114 0 

AAGTACTGGG TAAGCACGCA GGATGCCAAT TGGGCAGCGG AACATTACGC GGTGATGGCT 1200 

TCGACAACGG GGACTGCTGT CGGAGATTTC GTCATATTGT TCGAAGAAAC CATGACAGCG 12 60 

;\AGCCGACCG GCGCATGGTA TGAAAGAACC ATCAACTTAC CTGAAGGGAC TAAATACATC 1320 

70 GCATGGCGGC ATTACAACTG TACCGATATA TATTTCTTGA AGTTGGACGA TATCACTGTA 138 0 

TTCGGGACTC CTGCATCAGA GCCCGAACCT GTTACCGATT TCGTTGTCTC GCTTATTGAA 14 4 0 

AACAACAAGG GACGATTAAA GTGGAATTAT CCTAACGGCT ACGAACCCGA TAAGACTGAT 1500 

GATAAAGACC CATTGCAGCT TGCCGGCTAC AATATCTATG CAAACGGCTC GCTCCTTGTT 1560 

CACATACAAG ACCCGACTGT TTTGGAGTAT ATCGATGAGA CTTATTCTTC ACGAGACGAT 1620 

75 CAGGTGGAAG TGGAATATTG TGTCACTGCC GTTTATAACG ACAATATCGA GTCCCAATCG 1680 
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GTTTGCGATA AGCTGATTTA TGATTCTC;iJ^ TCGGACATTA TCTTATATGA AGGCTTTGAG 17^0 

GCCGGAAGTA TTCCTGAAGG CTGGTTGTTG ATTGATGCTG ATGGCGACAA TGTTAATTGG 1800 

GACTATTATC CTTGGACTAT GTATGGACAT GACAGTGAGA AGTGTATTGC ATCCCCTTCG 1860 

TACTTACCGA TGATTGGCGT TTTAACTCCG GATAACTATT TGGTTACACC CAGACTCGAA 1920 

5 GGAGCCAAGC TTGTCAAGTA TTGGGTAAGT GCGCAAGATG CTGTTTATTC GGCTGAGCAT 198 0 

TATGCTGTGA TGGTTTCTAC TACGGGAACT GCTGTTGAAG ATTTTGTCCT CTTGTTCGAA 20^0 

GAGACAATGA CCGCTAAGGC TAACGGTGCA TGGTATGAGC GAACTATTAC ATTGCCTGCA 2100 

GGAACAAAAT ATATTGCCTG GCGGCATTAT GATTGCACCG ATATGTTTTT CTTGCTCTTG 2160 

GATGACATTA CGGTTTATCG TTCTACTGAG ACTGTTCCCG AGCCTGTTAC TGATTTCGTT 222 0 

10 GTCTCGCTTA TTGAGAATAA CAAGGGTCGC CTGAAATGGA ATTATCCTAA CGGCTACGAA 22 80 

CCCGATAAGA CTGATGATAA AAAACCATTG CAGCTTACCG GCTACAACAT CTATGCAAAT 23'10 

GGCTCGCTCC TTGTTCACAT ACAAGACCCG ACTGTTTTGG AGTATATCGA TGAGACTTAT 24 00 

TCTTCACGAG ACGGTCAGGT GGAAATGGAA TATTGTGTCA CTGCCGTTTA TAACGACAAT 24 60 

ATCGAGTCCC AATCGGTTTG CGATAAGCTG AACTATACTA TCACATCCTT GGATAATATT 2520 

15 CAATCTGATA CAAGCTTGAA AATATATCCT AATCCGGCAT CGTATGTGGT AAGGATAGAG 258 0 

GGATTGAGTC GGAGCAAGTC GACAATCGAG TTGTATAATG CGCTGGGAAT TTGCATATTA 264 0 

AGGGAAGAGA CTCATTCAGA GAAAACGGAA ATCGATGTTT CACGTCTCAA TGACGGAGTC 27 00 

TACTTGATTA AAGTAGTCGG TGGAAATAAA ACAACAACCG AAAAGGTAGA GATAAAGAGG 27 60 

CCG 27 63 



20 



35 



(2) I U FORMAT I on FOR SEQ ID 110:206 



(i) SEQUEIiCE CHARACTERISTICS: 
25 {A) LEMGTH: 1779 base pairs- 

(B) TYPE: nucleic acid 

(C) STRAUDEDMESS: double 
(D} TOPOLOGY: circular 

30 (ii) nOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: UO 

(iv) AUTI-SENSE: NO 



(vi) ORIGIIIAL SOURCE: 

(A) ORGANISM: PORYPHYROMOHAS GIHGIVALIS 



(ix) FEATURE: 
40 (A) IJAIIE/KEY: mi sc_f ea ture 

(B) LOCATION 1. . . 177 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206 

45 ATGAACAGCA TCATGAAATA TCAATTATAT ACGGCCGTCA TAATGGCTCT CTCTGTATCA 60 

TCCGTTTGCG GTCAAACCCC ACGAAATACA GAAACCAAAC GCCCCGACAC GCTGCGCAGG 12 0 

GAGCTTACTA TCGTTAATGA CCAGACTGTG GAGATGGAGC ATGCGGATCC GCTTCCGGCT ISO 

GCATACAAGG CCATCGAACC TCGATTAAAA CCTTTCCGTC CGGAATATAA CAAGCGTACA 24 0 

TTCGGATTTG TCCCTGAAGT TTCCTCTTCA GGCAGGAACA ATCTTCCGAA TATCCTGCCG 300 

50 ACGGAAGGTC ATATGAAGCA CCGGGGGTAC CTGAATATCG GTATCGGCCA TACGCTAAAC 360 

CAGCGAATGG ATGCCGGCTA TCGTCTGATA GATGCAGAGC AGGAGAGACT GAATCTTTTC 4 20 

CTCTCCTATC GTGGGATGAA ATCGGCTTTC AATACCGGTG ACTTCGACGG CGACAGAAAG 4 80 

GATAGACGAA TGATGGCAGG AGTGGACTAC GAGCAGCGCA GGCCTTCCTT TGTGCTTGCT 54 0 

ACCGGCTTGT ATTATTCGAA CCATTATTTC AATAACTACG GACGGGGAGC TACCACCAAT 600 

55 GTGGGCAGCA TCCCTCAGCT ATCGACACCT GTTACTCCTC AGATGGACAA CGGGACCCAC 660 

AACGTCCGTG TATACTTGGG TGCAAAAAAT GATGTGATCG ATGCCAGGAT CGACTATCGT 7 20 

TTCTTCCGTT CTATTCCCTA TCTGGGTACC GATCCGATGA AGGCTCTCAC AGAACATACG 7 80 

CCTGAACTGA ACGTGACGAT GAGTAATGAG TTGTCCGATG ATATTAAGCT CGGTGTCGAA 84 0 

GTTCGTACGG GAGGATTGTT TTTTGCCAAA AACAGCGAAA TGATTCAAAC GGGCGTTCTG 900 

60 TCCGAAACCG ACCGCAACCT GTATTATGTG GAGGGCGCGC CCACAATCGG ATTTGTCGGA 960 

GACTCGGACA ATAT-GCAATG GAACATACAG GCCGGAGTAG GGATTTCTTC CCATTTCGGA 102 0 

GCCAAAGGGA GGTTGTTTTT CTGGCCTAAA CTGGATGCTT CGCTTAGTAT CTTCCCTTCA 108 0 

TGGCGTGTGT ATGCGAAAGC CTTCGGCGGT GTGATTCGAA ATGGTCTCGC CGATGTTATG 114 0 

CAAGAGGAGA TGCCCTACCT GATGCCCMT ACGATTGTAC TCCCTTCGCG CAATGCTTTG 1200 

65 ACCGCCCAAT TAGGGGTGAA GGGGAATATA GCCGATGTGG TACGTATGGA GGTTTATGGC 1260 

GACTTCTCCA AGCTGACAGG TGTGCCTTTC TATACTCCGA CTCTACCCTT ATATAATCCA 132 0 

TCCGACTTGT ATCAGTATAA TGTGAGTTTC TTGCCGATAT ATGCCGACGG CAGCCGCTGG 1380 

CGCGCAGGTG GTAAGCTGGA ATACTCTTAT CGCGATATGC TCCGCTTTCT GGTAGACGCA 1440 

TCCTATGGCA AGTGGAATTT GGATGGAGGA CTTGTCGCCT CCATGCAGCC CGATCTTATA 1500 

70 TTGAAGGCAG AAGTAGGTGT TCATCCCATT GCCCCATTGG ATGTCAGACT CCGGTATACA 1560 

CAGCTGAACG GACGGTATCG GTATTCTTTC GGCTCGGCTG GCTCGG.^„-.GC CTTGGGTATC 162 0 

GGTAATGTAC ATCTTCTTAG TGCGGATGTT TCATACAA.GC TGAAAAP.G.^.A CTTGAGCCTT 1680 

TATCTCAAAJ\ TCGATAATAT GCTGGCGGAA ACGACAGAGC TTATCGGTTA TTATCCTATG 17 4 0 

CAGCCGTTCC ATTGTTTCGC CGGTTTTAGC TGGACTTTC 177 9 

75 
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(2) lUFORMATION FOR SEQ ID 110:20"? 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 17 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI-4DEDNESS: double 

(D) TOPOLOGY: circular 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) AI IT I -SENSE: NO 



15 



55 



70 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 



( i>: ) FEATURE : 
20 (A) NAT IE /KEY: miscf eature 

(B) LOCATION 1 . . .17 67 

SEQUENCE DESCRIPTION: SEQ ID NO: 207 

25 ATG.i^.-JXTATC AATTATATAC GGCCGTCATA ATGGCTCTCT CTGTATCATC CGTTTGCGGT 60 

CAAA.CCCCAC GAAATACAGA AACCAAACGC CCCGACACGC TGCGCAGGGA GCTTACTATC 120 
GTTAATGACC AGACTGTGGA GATGGAGCAT GCGGATCCGC TTCCGGCTGC ATACAAGGCC 180 
ATCGAACCTC GATTAAAACC TTTCCGTCCG GAATATAACA AGCGTACATT C&GATTTGTC 2^0 
CCTG.AAGTTT CCTCTTCAGG CAGGAACAAT CTTCCGAATA TCCTGCCGAC GGAAGGTCAT 300 
30 ATGAAGCACC GGGGGTACCT GAATATCGGT ATCGGCCATA CGCTAAACCA GCGAATGGAT 360 
GCCGGCTATC GTCTGATAGA TGCAGAGCAG GAGAGACTGA ATCTTTTCCT CTCCTATCGT ^20 
GGGATG7W\T CGGCTTTCAA TACCGGTGAC TTCGACGGCG ACAGAAAGGA TAGACGAATG 4 80 

ATGGCAGGAG TGGACTACGA GCAGCGCAGG CCTTCCTTTG TGCTTGCTAC CGGCTTGTAT 5*5 0 

TATTCGAACC ATTATTTCAA TAACTACGGA CGGGGAGCTA CCACCAATGT GGGCAGCATC 600 
35 CCTCAGCTAT CGACACCTGT TACTCCTCAG ATGGACAACG GGACCCACAA CGTCCGTGTA 660 
TACTTGGGTG CAAAAAATGA TGTGATCGAT GCCAGGATCG ACTATCGTTT CTTCCGTTCT 72 0 

ATTCCCTATC TGGGTACCGA TCCGATG7y\G GCTCTCACAG AACATACGCC TGAACTGAAC 7 80 

GTGACGATGA GTAATGAGTT GTCCGATGAT ATT7VAGCTCG GTGTCGAAGT TCGTACGGGA 8^0 
GGATTGTTTT TTGCC7WW\ CAGCGAAATG ATTCAAACGG GCGTTCTGTC CGAAACCGAC 900 
40 CGCAACCTGT ATTATGTGGA GGGCGCGCCC ACAATCGGAT TTGTCGGAGA CTCGGACAAT 960 

ATGCAATGGA ACA.TACAGGC CGGAGTAGGG ATTTCTTCCC ATTTCGGAGC CAAAGGGAGG 102 0 

TTGTTTTTCT GGCCTAAACT GGATGCTTCG CTTAGTATCT TCCCTTCATG GCGTGTGTAT 108 0 

GCGAAAGCCT TCGGCGGTGT GATTCGAAAT GGTCTCGCCG ATGTTATGCA AGAGGAGATG 114 0 

CCCTACCTGA TGCCCAATAC GATTGTACTC CCTTCGCGCA ATGCTTTGAC CGCCCAATTA 1200 

45 GGGGTGAAGG GGAATATAGC CGATGTGGTA CGTATGGAGG TTTATGGCGA CTTCTCCAAG 1260 

CTGACAGGTG TGCCTTTCTA TACTCCGACT CTACCCTTAT ATAATCCATC CGACTTGTAT 1320 

CAGTATA-ATG TGAGTTTCTT GCCGATATAT GCCGACGGCA GCCGCTGGCG CGCAGGTGGT 1380 

AAGCTGGAAT ACTCTTATCG CGATATGCTC CGCTTTCTGG TAGACGCATC CTATGGCAAG 144 0 

TGGA-Z-.TTTGG ATGGAGGACT TGTCGCCTCC ATGCAGCCCG ATCTTATATT GAAGGCAGAA 1500 

50 GTAGGTGTTC ATCCCATTGC CCCATTGGAT GTCAGACTCC GGTATACACA GCTGAACGGA 1560 

CGGTATCGGT ATTCTTTCGG CTCGGCTGGC TCGGAAGCCT TGGGTATCGG TAATGTACAT 1620 

CTTCTTAGTG CGGATGTTTC ATACAAGCTG AAAAAGAACT TGAGCCTTTA TCTCAAAATC 1680 

GATAATATGC TGGCGGAAAC GACAGAGCTT ATCGGTTATT ATCCTATGCA GCCGTTCCAT 17 4 0 

TGTTTCGCCG GTTTTAGCTG GACTTTC 17 67 



(2) INFORMATION FOR SEQ ID NO: 208 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1038 base pairs 

(B| TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

65 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
iiv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



(i;<} FEATURE: 
75 (A) NAME/KEY: misc feature 
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(B ) LOCATION 1 . , . 1038 



(xl) SEQUENCE DESCRIPTIOIJ : S] 

5 ATGATGGAAA A.ATGTATCTT TGCTCACTAT 
CATTTCGGTA TCATTTTGGG ATTTCTTTCT 
AAGCAGGTGT TTCATTTTCT GAACCTTCCG 
AAAGCTATCA CCATCGTAGA CGACAATCCC 
GGATATGAAT CCGGTGGCCG CGCCTTTCTT 

10 ATGGGCAATG CCTGTTATGC CTCGTCCGTC 
CGTTTCCTGA ACTACGGGTC TATGCAAGGA 
TTTAGTGCTT CGGATATAGC TGTACAAGGA 
CGCGGTGGAG TCAGCCTAAA AGCATTGTAT 
CTTGGTGTGG ATGTCGGTAT CAGTTATTAC 

15 CTGTTCAAGA ACGTAGGGGC GCAACTGAAA 
TGGGATTTCC AGCTCGGCTT TTCCCGCAGT 
ACGTTGTTCA ATCTGAATCC GCACTATTTC 
ATGCAAAAGT TCCTCCGACA CTTCTCGATA 
TGGGTCGGGC TGGGATATAC GCCACAGATT 

20 AAATGGGGAG GTCTTTCGGC CGGCGTCGGT 
TCTGCTGCCA CCTATCATCC TGCAGCTCTT 
GACGATAAGA GCATCTTC 



:Q id 110:208 

CCACATAACC TTGTGTTCAT GATTCGCAAG 60 

CTTGTGTTTT CGGCAGGTGC TC7\ACAAGAG 12 0 

GCTACTGCAC AGGCTTTGGC TGCCGGAGGC 180 

GGACTGGCTT TTGAGAATCC GGCTCTGCTC 24 0 

TCCTATTTAT ATTATATGAG TGGTTCGCAT 300 

GGAGAGCGTG GCATGTGGGG TGTTGGCATG 360 

TACGATCAGA ATGCGATTGC CACCGGCTCT 4 20 

TTTTACAGCC ATGAACTGAG CAACCACTTC 4 80 

TCTTCTATCG AGACGTATAG TTCCTTTGGC 54 0 

GACGATGACA AAGGATATTC CGCTTCCGCT 600 

GGCTATAATG AAGAACGGGA ACCGCTCGAT 660 

TTTATCAATG CTCCGTTTCG CTTGCACATC 720 

AAGCGTCTTG TACCACGCGA TCTGTCCAAG 7 80 

GGAGCAGAAT TTACTCCTTC CGAGAGGTTT 84 0 

GCACAGGATT TCGAGGTGGA AGGCGGCAAC 900 

TTCACTTCAG GTGTAGTACG TGTAGGCGTA 9 60 

TCGTTCATGT GTTCGGTAGG TATCCGTTTG 1020 

1038 



25 (2) IlIFORMATIOM PGR SEQ ID HO: 209 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LEHGTH: 1035 base pairs 

(B ) TYPE : nucleic acid 
30 (C) STRAIIDEDMESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

35 (iii) HYPOTHETICAL: MO 

(iv) AMTI-SEMSE: HO 

(vi) ORIGINAL SOURCE: 
40 (A) ORG AM ISM: PORYPHYROMOIJAS GIMGIVALIS 

(ix) FEATURE: 

(A) UAIIE/KEY: misc_feature 

(B) LOCATION 1...1035 



45 



(xi) SEQUEMCE DESCRIPTIOM: SEQ ID MO: 2 09 



ATGGAAAAAT GTATCTTTGC TCACTATCCA CATAACCTTG TGTTCATGAT TCGCAAGCAT 6 0 

TTCGGTATCA TTTTGGGATT TCTTTCTCTT GTGTTTTCGG CAGGTGCTCA ACAAGAGAAG 120 

50 CAGGTGTTTC ATTTTCTGAA CCTTCCGGCT ACTGCACAGG CTTTGGCTGC CGGAGGCAAA 180 

GCTATCACCA TCGTAGACGA CAATCCCGGA CTGGCTTTTG AGAATCCGGC TCTGCTCGGA 24 0 

TATGAATCCG GTGGCCGCGC CTTTCTTTCC TATTTATATT ATATGAGTGG TTCGCATATG 300 

GGCAATGCCT GTTATGCCTC GTCCGTCGGA GAGCGTGGCA TGTGGGGTGT TGGCATGCGT 360 

TTCCTG/U\CT ACGGGTCTAT GCAAGGATAC GATCAGAATG CGATTGCCAC CGGCTCTTTT 4 20 

55 AGTGCTTCGG ATATAGCTGT ACAAGGATTT TACAGCCATG AACTGAGCAA CCACTTCCGC 4 80 

GGTGGAGTCA GCCTAAAAGC ATTGTATTCT TCTATCGAGA CGTATAGTTC CTTTGGCCTT 54 0 

GGTGTGGATG TCGGTATCAG TTATTACGAC GATGACAAAG GATATTCCGC TTCCGCTCTG 600 

TTCAAGAACG TAGGGGCGCA ACTGAAAGGC TATAATGAAG AACGGGAACC GCTCGATTGG 660 

GATTTCCAGC TCGGCTTTTC CCGCAGTTTT ATCAATGCTC CGTTTCGCTT GCACATCACG 7 20 

60 TTGTTCAATC TGAATCCGCA CTATTTCAAG CGTCTTGTAC CACGCGATCT GTCCAAGATG 780 

CAA/^AGTTCC TCCGACACTT CTCGATAGGA GCAGAATTTA CTCCTTCCGA GAGGTTTTGG 84 0 

GTCGGGCTGG GATATACGCC ACAGATTGCA CAGGATTTCG AGGTGGAAGG CGGCAACAAA 900 

TGGGGAGGTC TTTCGGCCGG CGTCGGTTTC ACTTCAGGTG TAGTACGTGT AGGCGTATCT 960 

GCTGCCACCT ATCATCCTGC AGCTCTTTCG TTCATGTGTT CGGTAGGTAT CCGTTTGGAC 1020 

65 GATAAGAGCA TCTTC 1035 



(2) lilFORriATIOIl FOR SEQ ID MO: 210 

70 (i) SEQUEMCE CHARACTERISTICS: 

(A) LEMGTH: 990 base pairs 

(B) TYPE: nucleic acid 
to STRAMDEDMESS: double 
(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA {genomic) 

( i i i ) HY POTHETI CAL : NO 

5 (iv) AHTI-SEHSE: NO 

(vi) ORIGIIIAL SOURCE: 

(A) ORGAHISn: PORYPHYROI 10NAS GIMGIVALIS 

10 (i>:) FEATURE: 

(A) MAJIE/KEY: misc_f eature 

(B) LOCATION 1 , . . 990 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210 



ATGATTCGCA AGCATTTCGG TATCATTTTG GGATTTCTTT CTCTTGTGTT TTCGGCAGGT 60 

GCTCAACAAG AGAAGCAGGT GTTTCATTTT CTGAACCTTC CGGCTACTGC ACAGGCTTTG 12 0 

GCTGCCGGAG GCAAAGCTAT CACCATCGTA GACGACAATC CCGGACTGGC TTTTGAGAAT 180 

CCGGCTCTGC TCGGATATGA ATCCGGTGGC CGCGCCTTTC TTTCCTATTT ATATTATATG 2^3 0 

20 AGTGGTTCGC ATATGGGCAA TGCCTGTTAT GCCTCGTCCG TCGGAGAGCG TGGCATGTGG 300 

GGTGTTGGCA TGCGTTTCCT GAACTACGGG TCTATGCAAG GATACGATCA GAATGCGATT 360 

GCCACCGGCT CTTTTAGTGC TTCGGATATA GCTGTACAAG GATTTTACAG CCATGAACTG 4 20 

AGCAACCACT TCCGCGGTGG AGTCAGCCTA AAAGCATTGT ATTCTTCTAT CGAGACGTAT 4 SO 

AGTTCCTTTG GCCTTGGTGT GGATGTCGGT ATCAGTTATT ACGACGATGA CAAAGGATAT 54 0 

25 TCCGCTTCCG CTCTGTTCAA GAACGThGGG GCGCAACTGA AAGGCTATAA TGAAGAACGG 600 

GAACCGCTCG ATTGGGATTT CCAGCTCGGC TTTTCCCGCA GTTTTATCAA TGCTCCGTTT 660 

CGCTTGCACA TCACGTTGTT CAATCTGAAT CCGCACTATT TCAAGCGTCT TGTACCACGC 7 20 

GATCTGTCCA AGATGCAAAA GTTCCTCCGA CACTTCTCGA TAGGAGCAGA ATTTACTCCT 7 80 

TCCGAGAGGT TTTGGGTCGG GCTGGGATAT ACGCCACAGA TTGCACAGGA TTTCGAGGTG 8^0 

30 GAAGGCGGCA ACAAATGGGG AGGTCTTTCG GCCGGCGTCG GTTTCACTTC AGGTGTAGTA 900 

CGTGTAGGCG TATCTGCTGC CACCTATCAT CCTGCAGCTC TTTCGTTCAT GTGTTCGGTA 960 

GGTATCCGTT TGGACGATAA GAGCATCTTC 990 



35 (2) IJIFORMATION FOR SEQ ID NO: 211 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 972 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) IIOLECULE TYPE: DMA (genomic) 

45 (iii) HYPOTHETICAL: WO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



55 



(i:<) FEATURE: 

(A) NAIIE/KEY: misc_f ea ture 

(B) LOCATION 1 ... 972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 



ATGTGCCTCG AACCCATAAT TGCTCCGATT TCATCCGAGT TGCTCGAGCA GGAGCTGACT 60 

GCCGATCGTT TTCTGCGGAT GACAAACAAA GCCGGCAATG AGATCTATGT TTTTACGGCC 120 

60 GAAGAAGCTC CGCATTGCAT GAAAGAAGTA GGCCGACTGC GAGAAGAAGC CTTTCGGCAT 180 

TATGGCGGAG GTACTGGCAA GGCGATCGAT ATAGACGAGT TCGACACCAT GCCCGGGAGC 2*10 

TACAAACAGC TGATCGTATG GGATCCGCAA AACAAGGCTA TACTCGGAGG CTACCGCTTT 300 

ATCTATGGGC GGGACGTTGC TTTCGATACC GATGGCAAGC CTTTGCTGGC AACGGCAGAG 360 

ATGTTTCGCT TCAGTGATGC TTTTTTGCAC GATTATCTCC CCTACACAGT CGAATTGGGA ^20 

65 CGTTCGTTCG TGTCGCTCCA GTACCAATCG ACACGGATGG GCACAAAGGC CATTTTTGTG ^80 

CTGGACAATC TTTGGGACGG TATCGGAGCA CTCACTGTAG TCAATCCAGA GGCACTCTAT 540 

TTCTATGGCA AGGTGACCAT GTACAAAGAC TATGATCGGC GAGCTCGCAA TCTGATCCTG 600 

TATTTTCTTC GCAAGCACTT CTCCGATCCG GAAGGCTTGG TCAAGCCTAT TCATCCCCTA 660 

CCGATAGAGA TCAGTGCGGA GGACGAAGCC TTGTTCTCCT CATCCGACTT TGACACCAAT 7 20 

70 TACAAGACTC TCAATATAGA AGTGCGCAAG CTGGGTATCA ATATCCCTCC TCTCGTGAGT 7 80 

GCATATATAG CTTTGTCTCC GGAGATGCGT GTTTTCGGCA CTGCAGTGAA TGAGTCTTTC B40 

GGAGAGGTGG AGGAAACCGG CATATTCATT GCTGTGGGTA AGATCCTGGA AGAGAAAAAA 900 

CAACGGCACA TAGAGAGCTT CATCCTCAGC CGGAACGAAA AAAAAGGTCT CGACAGTAGC 960 

AATGGCCGAT CA 979 

75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 
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(2) JIf FORMAT I OM FOR SEQ ID MO: 212 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LEIIGTH: 16^1 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

10 (ii) nOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) AIITI-SEMSE: NO 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISII: PORYPHYROMOI IAS GIIJGIVALIS 



(i::) FEATURE: 
20 (A) NAl-lE/KEY: mi scf ea t u re 

{ B) LOCATION X . . .1641 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212 

25 ATGAAAACCA TTGTAAGATA CAGCCGCCTT CCGGTCGCTC TCTTCTTTTG CCTTTTGGGA 60 

GCTGTGCACT TGTCTGTCGA GGCGCAGATG CTCAATACTC CTTTCGAGCT GTCGGATCAG 120 

ATCGTCCTGT CTCCCACCGA AAGGCAATAC AGGGAGATTT GTGTGCAAAC GTWKGAAAAA 180 

AGGGGGGCCG ATCTTTTCCC GTTGAGCGAT AAGCTGCGCG ATTCGGCCTA TGTTCGTTTC 24 0 

GGCTCGGCCT ATGGCGATAT TGCGGGCGAC TATCTTCCGT ACAACGGCAA TAACTACTCC 300 

30 TCGCTCTCGC TCGAATCGGG TGGTCGCATC AGTGTCCGTA ACTATGGCAC ATTGCAGGGC 360 

AGTGCTTCCT ACTCACGTGG CATGCACAAA CGCATCGGCT GGAATGCTCT GCGCAACGCC 4 20 

GAAGCCTACT ATCCCTATTT GGTGTCCGAT TCGACCGGCG GAGACTATCA TTTCGAAGAC ABO 

TATCGGCTTG CCGGCTACTA TTCTTTTCGC GCCGGCCGCT TGCCCCTCGG TATAGGCTTC 54 0 

TCATACAGGG GCGAAGTTGC TTATCGGCTG ACCGATCCGC GTACGACCAA TACGACCGGT 600 

35 GCATTGGAGC TTTCTTGTGC TACCTCTTTG ACGCTGCCTC GAGAGAACAG GCTATCGCTT 660 

TCGGCTGCGT ATCTCTATCA TAGACAACAC CTCACACAGT ACAACTGGCG TCCCGGGCAG 72 0 

CAGGACAAAT TCTTCGTCAG CTACGGTTTC GGTCAGGTGG ATGTCAGCAA CAGCCCTATC 7 80 

TGGTTCGGTA TCTCCAGAAT GAACTACGTC AACGGATGGA AGCTTAGCTC CCGTCTGGAT 84 0 

ACCCGTAGGG GCGATGCCAT CGGTCTCGAC TACAGCGGCT ACTTCCTCGA TACCGAaGAG 900 

40 AGGTCGTCCA TCAATCTCTT TGCTTTGCTT TACAATCGCC TGCGACTCTA TGGTAGCTGG 960 

CATCTGTCGG ACTTCGATTT TTCATTTTCA GCCGACTATG CTCTGCGCCA AGGGATAGAG 1020 

CGGATATACG AAGACTACAA GCCGGATGAT AATTATCATA TCTACGACCT CCGTATCTTG 1080 

GCCATTCGCC GCTGGTATAT GCTCAATGAG TTTTCTGCCC AAGCCC7\AGC CTCCTACCGT 114 0 

ATTCGCACGG ATAGAGGTTG TGCCCTGAGA GTGAGTGCCG GTAGTGATTT CTACGGCTAT 1200 

45 GATGAGACGT ATCGCAAGCA TGGACATCAT ACCATGAGCG GAATGCTACG TCCTTTTGCC 1260 

GGTATAGCCT ATGACCATGC CGGATCCAAA TTGGATTTTG GACTTTCGCT TTCGGCTGCT 1320 

TATCGAATGG TGCTGACGCA TTCGTATAAG ATTCGTACCA TCCAGAAAGA GCAGCTCGAC 136 0 

TATCAGCTGG CCTATTTGCC CTATGCCTAT CGTAATAGAG AAGGCGTGGA GGTGCGTTCC 144 0 

TCTCTGTACG TCTCGATTCC GATGCAGAAT ACCCACCGCC TGATGACAGA GCTGCGGTTG 1500 

50 TATGGCGACC TGATGAAAAG AAAGGACGGT ATAGCCTATG GCAAAACGCC CGGTGTCATC 1560 

TCACATATCC TGTCCGATCC GCAAGCCGAA CGAACGTCCG GCCATACCAT CGGGGCTATC 1620 

TGCAATATCT CCTACCTCTT C 1641 

55 (2) INFORMATION FOR SEQ ID HO: 213 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2250 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRAHDEDHESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

65 (iii) HYPOTHETICAL: !JO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 
70 (A) ORGANISM: PORYPHYROMONAS GIIJGIVALIS 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 

(E) LOCATION 1 ... 2250 

75 
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(xi) SEQUEHCE DESCRIPTION: SI 

ATGAAAAAAC TTCACATGAT TGCCGCCTTA 
GCACCCGTCT CCAACAGCGA GATAGATAGT 
5 GTCGTAGCTA CTCGCGCCAC GGCGAAAACC 
GAACTTTCCA AGTCCAATTA TGGTCGTGAC 
GTGGTAGCCA CCAGCGATGC CGGTACGGGT 
ACCGATGCCA ATCGCATCAA CATAACTACC 
CAGTCCGTCT TTTGGGTGAA TATGCCCGAC 

10 CAGCGAGGTG TGGGTACTTC CACCAATGGT 
CGTACGGATA ATTTGGGACT GGCTCCTTAT 
GGCACATTCC GCCGATCGGT CAAACTCGGT 
GATGCCCGCC TGTCCA7\AAT CGGTTCGGAC 
AAATCCTATT TCGCACAGGT GGGCTATTTC 

15 TTCGGAGGAA AAGAAGTTAC GGGTATCGCA 
AAATATGGCC GCCGATACAA CAGTGCCGGT 
CACTACTACC ACAATACCGA CAATTACGAG 
AGCTTCTCTC CTTCCGTTAT CCTCAACCTC 
ACGGACGAAT ATCGTACCGG ACGT7\AACTA 

20 AACAGTGTGA CCGTGAAGAA AACGGATCTC 
GGAGGACTCA TCGGTTCGCT TAACTGGCAC 
TCGGGCAATA TCTATAAAGG AGACCACTTC 
CAGCCCTTAG CTCCCGACTT CGAATATTAT 
GCCTTTGCCA 7\AGCC7*J\CTG GCAGATCACT 

25 TATCGTACCA TCGGCTACAC GATAAACGGC 

AGTATGCAGC ACATCGATTT GGACAAGACC 
ACCTATAGTT TCGACG AT GC TCATACTGCC 
CCTAACAGAA CCAATTACAC CGAAGCCGGA 
ATCGACTATG AGCTGGGCTA CCGCTATGCT 

30 TATTATATGC AATACAAGGA CCAACTCGTG 
ATGCTCACAA GCAACGTCCC CGACAGCTAC 
CAGATCCTTC CTCGTTTGCT GCGTTGGGAT 
GACCGCTACG TACAATATAC ATCCGTATAT 
GAGGAGACCC TCGAAAGCAC GG AT AT AGCC 

35 CTTACCCTCT CTCATGCCGG TTTCGAAATG 
TATCTGGACA ATACACAGCG CAGCGATCGC 
CGCCTCGGCT ATGTGCTGCC GGTTCACTTC 
AATAATCTCT TCAACCTCAT GTATGCGTCC 
CAGGCATCCG GAGAACTAAG TGCATATGCC 

40 AATGCACTGG GTAGTCTGAC AATCGATTTC 



:Q ID HO: 213 

GCCGTCCTGC CTTTCTGCCT GACGGCACAA 60 

CTTAGCAATG TGCAGCTCCA GACCGTACAG 120 

CCTGTCGCTT ACACCAACGT TCGCAAGGCC 180 

ATCCCCTATC TGCTGATGCT GACTCCCTCC 24 0 

ATCGGATATT CCGGCTTTCG CGTGCGTGGC 300 

AATGGAGTAC CCCTC7VACGA CTCCGAATCT 360 

TTCGCCTCTT CCATCGAAGA CCTTCAGGTG 4 20 

GCCGGAGCTT TTGGGGCAAG TGTCAATATG 4 80 

GGCCGTGTCG ATTTGAGCGG AGGTTCGTTC 54 0 

AGCGGACGCA TCGGTCGCCA TTGGGCAGTG 600 

GGCTACGTGG ATAGAGGAAG CGTGGATCTG 660 

GGTAGCAACA CGGCTCTCAG GTTCATCACT 7 20 

TGGAACGGTC TTTCCAAGGA GGATGAAGCC 7 80 

CTTATGTACG TGGACGCGCA AGGAGTACCG 84 0 

CAGCGTCACT ACCATGCCAT CATGACGCAC 900 

ACGGCACACT ACACGGCCGG ATATGGCTAT 960 

AAGGAATATG CACTGCAGCC CTATGTGGAA 102 0 

ATCCGTCAGA AGTATCTGGA CAATGACTTC 108 0 

ACCGGTGCAT GGGATTTGCA GTTCGGGGCC 114 0 

GGCCGTATCA CTTACATCAA AAAGTAC7U\T 12 00 

CGGAACAGGG CAGACAAAAG AGAAGGTGCA 1260 

CCGGAACTGA ACATGTATGC CGACCTCCAG 1320 

ATCACGGACG AATATGATGA GGTACAGGGA 1380 

TTCCGCTTCC TCAATCCGAA GGCCGGTCTT 14 40 

TATGCTTCTG TTGCGGTAGC ACACCGCGAG 1500 

ATAGGACAGT ATCCTACGCC TGAGCGACTG 1560 

TCGCCCCTCT TGTCGGCCGG AGTAGGTCTC 1620 

CTGGATGGCC GTTTGAGCGA TGTGGGACAG 1680 

CGTATGGGAC TGGAGCTGAC TCTCGGTTGG 174 0 

GCTTCTTTCA CTATGAGTCG CAACAA7\ATC 18 00 

GATGCGGACT ACAACTGGCT CGAACTCAAG 1860 

TACTCGCCCA ATGTCATTGC CGGCAGCATG 192 0 

GCTTGGACGA GCCGCTTCGT CAGCAAGCAA 198 0 

ATGCTTTCCT CCTATTGGGT GAACGACCTC 2 04 0 

GTTAAGAGAG TGGCACTGGG CGTACAGCTC 2100 

AATGCCTACA TCTACGATGC CGGTTACGTA 2160 

GATCTGCGTT ATTATCCTCA GGCCGGATTT 22 2 0 

2250 



(2) II I FORMAT lOM FOR SEQ ID NO: 21 4 

45 (i) SEQUEtlCE CHARACTERISTICS: 

(A) LENGTH: 14 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

50 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: HO 

55 (iv) ANTI- SENSE: NO 

ivi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVAL IS 

60 {i>0 FEATURE: 

(A) HAJiE/KEY: mi sc_f ea t u re 

(B) LOCATION 1 ... 1482 



65 



(xi) SEQUErJCE DESCRIPTION: SEQ ID IJO:214 



ATGAAAAGAA GGTTTCTATC GCTGTTACTG CTGTACATAC TCTCTTCCAT CAGCCTTTCT 60 

GCTCAGCGGT TTCCGATGGT GCAGGGAATC GAGTTGGATA CCGATTCGCT TTTCTCTCTG 120 

CCCAAGCGTC CTTGGCGCGC CATCGGTAAA ACGATAGGCG TCAATCTGGC CGTATGGGGC 180 

TTCGATCATT TCATCATGAA CGAGGACTTT GCAGACATCA GTTGGCAGAC TATCAAGAGC 24 0 

70 AA.TTTCCAAA CAGGCTTTGG CTGGGACAAT GACAAGTTTG TCACCAACCT CTTCGCACAT 300 

CCTTATCACG GATCGCTCTA TTTCAATGCA GCGAGGTCGA ACGGTTTGAG CTTCAGGCAC 360 

TCTGCTCCGT TTGCCTTCTT TGGCAGTCTC ATGTGGGAGC TGCTTATGGA AAACGAGCCA 4 20 

CCGAGTATCA ACGACCTCTG TGCCACCACC ATAGGCGGTA TAGCTTTGGG GGAGATGGGG 4 80 

CACAGGCTGT CGGACCTGCT CATCGACT^AT CGTACCACAG GGTGGGAACG TATGGGGCGC 54 0 

75 GAGGTGGCTA TCGCTCTGAT CAATCCGATG CGCTTTCTCA ACCGTCTGAC AGCAGGAGAG 600 
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10 



15 



GTGACTTCTG 
GTCGATGCCG 
CTGACCCTGA 
TACGATTTCT 
CAGATCAATC 
TTGGTGGGAG 
AATTCGGAGG 
GGTCTTATCT 
ACCTACCTGA 
CGGGACTATA 
GATCTGTGGA 
GAAGAGCCGC 
AAGGCGCGCC 
GCCATCGTCG 
TTCGATACCG 



TCGGGAGTCG 
GCTTTCGCTT 
ATCTGAGATT 
TCCAATTCAA 
TGATCGGAAT 
GTCTCTTTCA 
AGGTACTCGT 
TCCAGCACCA 
ATGTCGTCCC 
ACCTCGGCAG 
GCTGGCTCTT 
ACCAGAAAAA 
TACTGGTGAC 
CTCGCCGTTT 
GCGACATACA 



CAGCGGACAG 
TTTGGCAGAC 
CGACTACGGC 
AGCCGGATTG 
CCTAAGCGGA 
GCACTTCGAC 
CACCCCATAC 
CGGAAAATTT 
GATGGGAGCC 
CGGATTGAGC 
GGGAGTCGAA 
TACCGATGTC 
GAGTTCCGAG 
CATCCGCAAA 
GCTGCGTGTC 



ATATTTCAGT 
AAGCGGCATG 
GATCCATTCC 
AGTTTCTCCG 
TGCCAACTGC 
TACTACAATT 
CGTATCTCGC 
CGACGACGTC 
AGTCTGTCGG 
GGCAAGCTAT 
AGCTATCGGC 
AGCTCTTTTA 
TTCGCATTTC 
ACAGCCTATC 
GGATTTCACT 



CTGTCCCCAT 
CCCGAACCGG 
GAAGCGAGAC 
AATCGCAACC 
TCGCACACGA 
CGGAAAAACG 
AAGTGGCAGC 
CTCTGGAGCT 
ATCACTACAA 
ACCTTGGTGC 
TCTACACATG 
TGGTGCAGGG 
ATCCTGGCCC 
AATTCTACCC 
TC 



AA^CATAGTC 
TGCCACGGCT 
TTTCTCTCCA 
TCTGCTGAGC 
ACGAACGGTT 
AAT7\AGCAAA 
TCTGGGAGGC 
ATATGCCGAG 
CGTGGACAAT 
TACGTACAAT 
GATCGGGTAT 
GGACGAAAGC 
CTGGCATGTA 
TAACGTATCA 



660 
720 
780 

900 
960 
1020 
1080 
1 1 '] 0 
1200 
1260 
1320 
1380 
I'l^O 
1 ^82 



(2) INFORMATION FOR SEQ ID MO: 215 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

25 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
30 { iv) ANTI-SENSE: NO 

(vi) ORIGIimL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVAL IS 

35 (ix) FEATURE: 

{A} NAI^IE/KEY: mi£c_f eature 

( B) LOCATION 1 . . . 882 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 



ATGAAACGAC TGATTGTTTT TCTGGCAATG GGTGGCTTGC TGTTCACCCT TGCGAACGCA 60 

CAAGAAGCAA ACACTGCATC TGACACTCCC AAAAAGGACT GGACTATAAA AGGTGTGACC 120 

GGACTAAATG CCTCTCAGAC TTCTCTGACC AACTGGGCTG CCGGTGGAGA AAACACGGTG 180 

GCAGGTAACC TCTATTTGAA CATAGATGCC AACTACCTGA AAGATAAATG GAGTTGGGAC 24 0 

45 AACGGTTTGC GTACAGACTT CGGTCTGACC TACACAACAG CCAACAAGTG GAACAAAAGT 300 

GTAGACAAGA TCGAACTCTT CACGAAGGCC GGCTATGAGA TCGGCAAACA TTGGTACGGA 360 

AGTGCGCTTT TCACTTTCCT CTCACAGTAT GCCAAAGGAT ATGAGAAGCC CTCGGATCAC 4 20 

TTGACAGGAG TCAAGCATAT CTCTAATTTC TTCGCTCCTG CATATCTCAC TCTCGGTATT ^80 

GGTGCGGACT ATAAGCCCAA TGAGAAGTTC TCTCTCTACC TCTCTCCTAC AACGGGCAAG 54 0 

50 CTGACTGTAG TAGCAGACGA CTACCTCTCA AGTTTGGGAG CCTTCGGGGT GAAAGTTGGT 600 

GAAAAGACAA TGTTCGAACT TGGTGCTTTG GTAGTGGGTT CGGCCAATAT AAATCTGATG 660 

GAGAATGTCA ATTTGATAAC CAAGGCTTCA TTCTTCTCGG CTTATACGCA CGACTTTGGC 7 20 

AACATTGACA TCAATTGGGA GGCTATGCTG GCCATGAAGA TCAACAAGTT CCTCACGGCT 7 80 

ACGATAGCCA CCAATCTTAT CTACGACGAT GATGTGAAGA TCAACGATGG CCCGAAAATC 84 0 

55 CAGTTCAAAG AAGTTGTGGG CGTGGGTGTT GCGTACACTT TC 8 82 



(2) INFORMATION FOR SEQ ID NO: 216 

60 (i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 612 base pairs 

(B) TYPE: nuclsic acid 

(C) STRANDEDNESS: double 
_ (D) TOPOLOGY: circular 

65 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

70 (iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAIIISM: PORY PHYROMONAS GINGIVALIS 

75 (i-) FEATURE: 
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(A) NAIIE/KEV: misc_feature 
{ B) LOCATION 1 ... 612 

(xi) SEQUENCE DESCRIPTIOM: SEQ ID tlO:216 

5 

ATGAAGAAAA TGATTTTGGC AGCTACTATG CTGCTCGCAA CAATCGGTTT TGCAAATGCT 60 

CAGAGTCGTC CTGCTCTTAG ACTGGATGCT AACTTTGTCG GTAGTAACTT AATGCAAAAA 120 

GTCGCAAACA CGAGCGTGAA CAATAAGATG ATCGTAGGCT TACGTGTTGG TGCTGCTGCT 180 

GAGTTCGCTC TTAGCAATGA TGGATTCTAT CTCGCCCCCG GATTGGCCTA TACGATGAGA 2 40 

10 GGTGCTAAGA TGGAATCACT AAGTGAAACG ACAACTCGCT TGCATTATCT GCA7VATACCG 300 

GTGAATGCCG GTATGAGATT TAGCTTTGCT GACAACATGG CTATTTCATT GGAAGCAGGT 360 

CCCTATTTCG CATATGGTGT CGCCGGAACG ATTAAGACTA AAGTTGCAGG CGTTACGGCT 4 20 

TCTGTAGATG CCTTTGGTGA TAACGGATAT AACCGTTTCG ACTTGGGCTT GGGCTTGTCT 4 80 

GCTGCCTTGA GCTACGACCG TTATTACGTA CAAATTGGAT ATGAGCATGG ATTGCTTAAT 54 0 

15 ATGTTGAAGG ATGCTCCGGA TAAGACTTCT TTGCGTAATC ATGACTTCTT TGTGGGTCTC 600 

GGTGTTCGCT TC 612 

(2) IliFORMATIOH FOR SEQ ID MO: 217 



(i) SEQUEJICE CHARACTERISTICS: 

(A) LENGTH: 729 ba^e pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDMESS: double 
25 (D) TOPOLOGY: circular 



30 



35 



55 



60 



70 



75 



(ii) MOLECULE TYPE: DIJA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

{ A ) ORGAJ J ISM : PORY PHYROI-IONAS GI NGI VALI S 



(ix) FEATURE: 

(A) NAllE/KEY! misc_feature 

(B) LOCATION 1 . . .72 9 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217 

ATGAAAAGGA TTTTTACTGT AGCCCTTGTG CTACTTGCTT CGGTCACTAT GGCCATCGGA 60 

CAAAGCCGCC CGGCACTTCG CGTAGATGCC AACTTCGTAG GCAGCAATCA GAGCATGAAA 120 

AGAGACGGAT ATGTGTGGGA CACCAAAATG AATGTCGGCC TGCGGGTCGG TGCCGCTGCC 180 

45 GAATTCATGA TCGGATCAAG AGGATTCTAC TTGGCTCCGG GTCTGAACTA TACGATGAAG 24 0 

GGCTCCAAAA CCGAATGGGA TATACCCGAA ATGGTTCCTG GTACCTATAT TACGATGGTT 300 

TCCACTCGCT TGCACTATCT GCAACTGCCG ATCAATGCCG GCATGCGGTT CGACCTGATG 360 

AATGACATGG CGGTTTCGAT CGAAGCGGGT CCTTTCCTTG CATACGGTAT ATATGGTACA 4 20 

TATCGGCAGA AGTTGGAAGG ATGGAAGCCG AACAACTACA GCACAGAGTT TTTTGGCCCA 4 80 

5U ACGCTTGGTG GCCCAACAAA TATCCGCTGG GACATCGGGG CAAACATAAT AGCCGCATTC 54 0 

CACTATAAGC GTTATTATAT ACAGATAGGC TATGAACATG GATTTGTGGA TATTGTGTCA 600 

GGTGGAGGTT CTGATATTCC CCGACTGAAC GACAATAGGC AATCCTCTTC GACGACCGCT 660 

CTAAGAGAAA AGGGAAATAA CGAATACGCT TATAATCGTG ACTTCTTCGT GGGCATAGGT 7 20 
TACCGCTTT 



{2} IHFORi'IATION FOR SEQ ID NO: 218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

65 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: MO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAItE/KEY: miser feature 



729 
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(B) LOCATION 1 . , . 621 



(xi) SEQUEI/CE DESCRI PTIOl I : SEQ ID IIO:218 



10 



15 



ATGAAAAGAA 
GCACAAGACG 
AA.CATCCCGG 
GAAGTGGGGC 
GCATCGAACG 
TCATTCTTCG 
CATCTGTTTT 
AATAAGCACA 
AATTCGAATA 
AACGTGGGGG 
GCCACGCTCT 



TGCTGCTGCT 
TTATCAGACC 
GAGGATTCAC 
TGTCCATGTA 
AAGGAGACGG 
ATGCAGGCAG 
GGCAGAATAG 
ATATTCATTT 
CCTACATCGG 
CGGGTGCCGC 
CCACTCATTT 



TCTCGTTGTA 
ATGGTCATTG 
CTATGGTTTC 
CAACTCCACA 
ATCTTTTCAG 
TGCCAACTGC 
CCGGCACT^AT 
CATCTATGGA 
TTACGGAGCA 
TGTAATGTAC 
T 



TTATTATATG 
CAGGTCGGAG 
TATTTGGGAA 
CGTCAAACAG 
GTAAATATGT 
TATATGATCG 
TTGTTTCTGG 
GACAAGGGAG 
CGTGTAGCCT 
GACCACGGCA 



GAATTGCAGG 
CGGGATACTC 
AGCGTATGGG 
CCAACAATGC 
CTTCTCCGAA 
TCGTCGGAGT 
CAGTACAAGC 
CCAAAGTCAG 
ACGAATATCA 
ATAAGATGCT 



CCGATTGGCT 
CGATACGGAG 
GAGCTTTCTG 
AGACTCCTTT 
TGAGAAGTGG 
CAATCCTCTC 
CGGCCTGTCC 
TATCTACACC 
AATTCATAAA 
TACGGCCATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
621 



(2) INFORMATIOH FOR SEQ ID NO: 21 9 

20 {i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2853 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDIIESS : double 

(D) TOPOLOGY: circular 

25 

(:^i) nOLECULE TYPE: DMA {genomic} 
(iii) HYPOTHETICAL: NO 
30 {iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGAJIIStI: PORYPHYROMONAS GINGIVALIS 

35 (ix) FEATURE: 

(A} NAJIE/KEY: misc^f eature 

(B) LOCATION 1...2853 



(xi) SEQUENCE DESCRIPTION: SI 

40 

ATGAGAGTAT CCGATCTCTG TTCCAGACTT 
TTGCTCTGTG CTACTTTGGT CGCTGCGGAA 
CACCGTCGGC ATGCTGCGCT GTCTGATTCT 
CCTATTCCTG ACAGTGCTTT TCGAGATTCC 

45 CAAGATAGCG TGTATGACGA TGAATTCGAA 
GATTCCATCG TTTTGCTCGG ACAGAATCGT 
TATCAAA.AGA GTCGCTTGGA GGCAAACTTC 
TATACTCGCT ATGTCCTCGA TACGGCCGGT 
GGAGAGCAGT CGTTCGAAGC CAAGAACTTT 

50 ATCAGCGGAG TGATCACGCA GCAGGGCGAA 
ATGCCCGACA ATATCATGTT TATGCAAGGA 
CATCCTCACT TCTATATCAA TCTTTCCAAG 
ACAGGTCCGG TCAATCTGGT TATCGCCGAT 
TATTTTCCCT TTTCCAACAA ATACTCTTCC 

55 AATCGCTATG GATTTTATTT GAGGAATGGT 
GATTTGGCAT TGCGTGGGGA GATCTTTTCC 
AAATATAAGA AGAGGTATAA GTACAACGGC 
TCCGGCGACA AATACGTGCC CGGAGACTAC 
ACACACAGTC AGGATCCGAA GGCCAATCCT 

60 GCCACCGGGA GCTATTTCCA GAATTCGCTG 
GCTACGACAC GAAGTTCGGC CGTGAGCTAT 
ATTACGGGTA GCATGGATAT CAGCCAGAAC 
CCGAATCTTT CGATTAATAT GTCCACGCGT 
CCGGAGCGAT GGTACGAGAA GTTGAGTGTG 

65 TTGACAAAAG AGAAAGATTT GCTCCAGAGC 
CGTCATTCCG TACCGATCAG TTTGACTGTC 
GGGGTTAACT ACAATGAGTG GTGGTACACG 
AAGAAAACAT TCCTGCCTTC GGACACGACC 
CTGTCGGCAG GCTTATCTAC CACATTGTAC 

70 TTCGGAGGCA ATCTCATTAT GATCCGTCAT 
ATGCCGGACT TCACGAAACG CCGATATGGC 
AACGGCAAGC TGCATACGCT GCTCTACTCT 
TCCATGGGCA ATGCAGGATC TGTCAATTTC 
AAATCCAAAT CGGATTCGAC AGGGATCAAG 

75 TCTACATCCT ATAATATGTT TGCCGATTCG 



;Q id NO: 219 

TCATGGTTAT TACCCGTAAT CCTTGTCGGA 60 

CGTCCTATGG CCGGAGCAGT CGGATTGCAC 120 

ACAGCGAAAG ACACGGTGCC TCTCGCAAAA 180 

CTTCCTGCCG ATTCCACCGG ATCGATGCGG 24 0 

TTGGAAGATA TAGTGGAGTA CGAAGCTGCC 300 

GCCTATCTTT TCGGCAAGAG CTATGTGAGC 360 

ATGTATCTCA ATACCGACAG CAGTACGGTT 4 20 

TATCCGATGG CCTTTCCTGT TTTCAAGGAT 4 80 

ACCTACAACT TCCGCACGGA GAAGGGGATT 54 0 

GGCTATCTGA CTGCCGGTAA GACCAAGAAG 600 

GGGCGTTATA CGACCTGCGA CAATCACGAT 660 

GCAAAGGTGC ATCCGGAGAA AGACATCGTC 7 20 

ATGCCGCTGC CGATAGGTCT TCCTTTCGGC 7 80 

GGTATATTGA TGCCCACGTA CGGAGAGGAC 84 0 

GGATATTATT TTGCCTTCAG CG ACT AT ATC 900 

AAAGGGTCAT GGGGCATTTC AGCCCAATCG 960 

TCGTTCGAAG CCAATTATCT GGTATCGAAG 1020 

AGCAAGACCA CCAGTCTGAA TATCCGATGG 1080 

TTGCAAACGT TGTCGGCCAA TGTCAATTTT 114 0 

AATACCACCT ATGATGTCAA TGCCCGTACT 12 00 

TCGCGCAAGT TTCCGGGTAC TCCTTTTTCG 12 60 

ATGCGCGATA CGACGGTGAG CCTTACCTTG 1320 

TATCCTTTCA AGCGGAAGAC CCGTGTAGGA 13 80 

GGCTATTCCG GTCAGCTTCG CAATAGTATC 14 40 

AATCTCGTGC GCGATTGGAA GAATGGTATG 1500 

CCTTTGTTGG ATTATATCAA TCTGACTATG 1560 

AAAGGCATAC GGAAGTCGTG GAATGAGGAT 1620 

TATAAATTCC GCAGACTGTA CGATTACAGT 168 0 

GGTATGTTCA AGCCTTGGAA ACCTTTTTCC 17 4 0 

CGCTTCACGC CCACTGTCAG TTTCTCCTAT 18 00 

TTTTGGGAGC TTCTTGAGCA TACGGATCAG 1860 

CCTTATTTCG AGCAGATATT CGGTGCTCCC 1920 

TCTTTTGACA ACAACTTAGA GGCCAAGATC 19 80 

AAGATCAGCC TGATAGATCA GTTCACATGG 204 0 

ATCCGATGGA GCAATATCTC GGCTTCGCTG 2100 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO. 
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10 



15 



GCACTTCGCC 
ACGAAGTATT 
ATTTTTAACG 
CTCAACAAAG 
GATGAAAAGA 
GAGGGAGGAA 
GGAGCAGTGG 
TATAGTTGGA 
TACCGGGTAA 
TTCGGATTCA 
GTCACTCGCG 
AAGTCCTATA 
CAGAGCAATC 



TCTCCAAGAG 
ATGAGGGAGA 
GCAAGGGATT 
AGTCGCTCAG 
AAAACACAGG 
GACCGCAAAA 
ATCAGGATGG 
ATATTGCTAC 
CGCAGAATCT 
ATGCGAACTA 
ACATGCACTG 
ATTTCGTCAT 
GTCCCATCAC 



CTTTACCTTG 
AGATGGGAAG 
GGCACGCCTG 
CGGATTGATA 
GGCTACTCCT 
TGAAAGTGGG 
TTACTTCGCA 
CGACTACAAT 
GAGCTTTCGC 
CAATTTCGAC 
CTGGGCTATC 
ATCGGTGAAG 
GAATACTTGG 



CGCTTGTCCG 
ATCATTCCCT 
ATCAGTACGG 
GCTCTTTTCA 
CATGAAGGAG 
GGGTCGCTCC 
TATTCGATCC 
AGGTACAATG 
GGCAATATCC 
TTGAAGAAAA 
TCGGCCAGTT 
AGTTCACTCT 
TAT 



GTCTGTTCGA 
ATAAGAGCAA 
GTACTTCTTT 
GTGGCAAAAA 
ACGATGCTGC 
TCGAGCGCAA 
CATGGAGCCT 
TCAATAAGAT 
AGCCTACACC 
TAACATCGCT 
TCATCCCTAT 
TGCAGGATCT 



TCCCTATTTG 
CGACCTGCGC 
CAGCTATACG 
GGAGCGGAGA 
CGATATACTT 
CCGTCAGGGC 
GTCCTTCGAC 
GGAGCACTAC 
GAACTGGAGC 
TACCTGCAAC 
AG GAG CAT AC 
GAAGTATCAG 



2160 
2220 
2280 
2340 
2400 
2 4 60 
2520 
2580 
2640 
2700 
2760 
2820 
2853 



(2) IN FORMAT I on FOR SEQ ID NO: 220 



(i) SEQ'JEHCE CHARACTERISTICS: 

(A) LENGTH: 3678 ba^e pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDHESS : double 

(D) TOPOLOGY: circular 



25 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 

30 (v-i) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



(ix) FEATURE: 

(A) NAME/ KEY: mi 5c_ feature 
35 (B) LOCATION 1...3678 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220 



ATGATGAAAC GATATACAAT AATTCTTGCA 

40 TTTCTVAATAA AAGCTCGCCC TTATGAAAGA 
AAACATTCAA TGGATTCTAA ATTGGTGCCT 
ATTGTATACC AATCTGTTTC TGAACATAGT 
AGGCCTGCAA ATCGTTTCCC TTCGCATAGG 
TCTCCCCCCG TAGTTCCCGT TGCCGTCGAC 

45 GATCCTGAAA ATCCCAATGC CTGGGATGTG 
GTACCTGTCG ATGTGGTGAT GGTTATCGAC 
GCCAGATTAA AGTCTGCCAT TGCATCGGGA 
GGGACGGCTA CAGAAGGGGT GCGTATCGCT 
TTATCTGATT TTACCAAAGA CACTGCTTTT 

50 ATTTGGGGAA CACATACCCA GGGGGGGCTT 
ACTGCTGTGG ATAAGCATAT CATATTGATG 
GTTA/U\AATG TAACTACTGC AGACTTCATT 
GATTTGGTTA TACAAGGAGC AATTAATTTC 
ACACCTCTTA CCCCAAATTA TCCAACTCAT 

55 GAATCCAAAT TCGATTATAG TAATCTGAGT 
GCATTGGTCT ATGAACCGAG GTTTCCTCAT 
GCTATCAATG AGGCTCAGTT TGCGAAAAAC 
GACCTGGGAG ATTTTGCCTT GGCCAACAAT 
CACTTCTTTA CGGCGACACC GGCCAATTTA 

60 ATTAATATAG GTATACAGAG GGGGGAGGTG 
AAAAATCTGA CGCAATCGGG AGATGTTACT 
CACTATGATG TCTCTACTAA AAAACTGACA 
GAAGCTACCA TAACTTAT CG TATTTATGCC 
CCGGTAAATA CTACTTCTGC TATCGGCCCG 

65 GCAAAATTGA CCTATACCAA TTCCAATGGC 
CCGACGGTTA AGTTAGGTTA TGGTGTTATT 
GGTCAACCCA TACAGGCAAA TGGAACAGTT 
CAGTCACAAG ATTTCTTTTT GCCCTCAGGT 
TTGGACAAAA CGACCGAAGC ATTACAGTAC 

70 ACTACTGCCG ATGGTAAACG TTATCGTTTT 
GGCCAAATCG GTATCAGTTG GAAAAAACCG 
CTCAATTATT GGATGGGAGG AACAACAGAC 
TGGACAGGAG CCCAAGTACC GCTCACAGGA 
TTCGGTTCTC CGGCGGTAGC CGATTTGCAT 

75 AACCTTATCA ATAATTCCGA CAAGGATTTA 



GTTTTTCTTT TATTCTGCAC GGTATTTACC 60 

TTTGCAGATG TAGAG7\AGCC TTGGATTCAG 120 

GCAAAT7\AGG GTAACTTAAT TCAAGCTGAA 180 

GACTTAGTTA TTTCACCTGT GAACGAAATA 240 

AAGTCTTTTT TTGCAGAAAA TCTACGGGCA 300 

AAGTATGCGG TACCGGTTGC CAATCCAATG 360 

ACGCTAAAAA TCACTACTAA AGCGGTAACA 4 20 

CAGTCTTCGT CAATGGGAGG GCAAAACATT 4 80 

CAGCGTTTTG TGAAAAAAAT GTTGCCTAAG 54 0 

CTTGTGAGTT ATGACCATGA GCCTCATCGC 600 

CTCTGTCAAA AAATCCGGGC TTTGACTCCT 660 

AAAATGGCGA GAAACATTAT GGCCACTTCT 7 20 

TCTGACGGGT TAGCGACGGA GCAGTATCCT 7 80 

GGCAAAACTG GAAATGCGAA TGATCCCATT 84 0 

CCTACAAATT ATGTTTCCAA CAATCCATCT 900 

TCTTCTAAAG TTGGACGGAG AAATCTGCCG 960 

GCAAGGATTA CTTTTGATGG TGTTGCTGGC 1020 

CCCTATTATT ATTATTTCCC TTGTAACGCT 108 0 

TCTGGTTATA CAATCCATAC TATTGGCTAT 114 0 

TCGTTGAT^C TAACCGCTAC AGACGAGAAT 1200 

GCTGCAGCGT TTGATAATAT TGCCCAAACT 12 60 

ACGGACTTTG TAGCTCCTGG TTTCATCGTT 1320 

CATTTGCTAA ATGTTTCAAA TGGAACGGTG 138 0 

TGGACTACTG GTACTATCCT GAGCTCATCA 144 0 

GATTTGGATT ATATACAGAA CAATGATATT 1500 

GATCTTGGTG GATTCGATAC CAATACCGAG 1560 

GAACCGAATC AGCAGTTAAT TTTCCCACGT 162 0 

AAGCGGCACT ATGTATTGGT AAATAAAGAC 1680 

GTCAGTTCCC TAAGCGAGGC TCATGTTCTA 17 4 0 

GGAGGTCATA TTGTTCCCAA ATGGATAAAG 1800 

TATTCCGTAC CGCCGACTAA CACGGTCATC 1860 

GTCGAAGTCC CAGGCTCCAC GCCGAATCCG 1920 

GCAGGAAACG CTTACTTCGC TTACAAGCTC 1980 

CAACAGAGTG AATGGGATGT GACGTCCAAT 204 0 

GAAGATGTAG AGTTTGCAAC GACAGAAAAT 2100 

GTCCCGACAA CCAACCCCAA AATTATCGGT 2160 

GTTGTTACCA CAAGCAGTCA ATTGACGATC 222 0 
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AACGGCGTGG TTGAGGATAA CAATCCGAAT 
GACAATCCTA CGGGGACATT GCTTTTTGCC 
ACCGTCGAGT TTTACAATCA GGGATATGAT 
TGGCAGTATT TCGGTATCCC TGTCAATGAA 
5 AACGAGACCG TCAACCAATG GGTTGAGCCT 

TATGCACCTG ATACAGAGCT TCAAAAATTC 
GCACAGCCTA CGGGAGTTTA CAGCTTCAAG 
AATCTGACAC GCACGTCCGG TGTCAACTAC 
ACTGGAGCCA TCGACATCAA GCAGGGTATT 

10 TATCTGTTCA ACACGGGAAC ACGCGACCAG 
GGCTATCGAG CCGGTCAGTA CCTCTCTGTA 
CCGGATCGTA TTCCATCGAT GCATTCCTTC 
ACGTTGCAllA TCTTGTACGA TAAGCTGCTC 
ACGCAGATCA CATGGCGATC CGGCAACTCC 

15 GATGTTCTTG GTAACGAGTC GGCCGACCGT 
TTCGGATTCG ACAACGGCTG GGATGGTCGC 
TATGCGATGT CTGACATCGG TAATGATAAA 
AACCTGCTGA TCGGCTTCGA TGCGGATAAG 
TCGGATCATT TTGCGAAAGG GGCTGTTTAC 

20 CGTATTACGA ATTCTACGTC GTATTCATTC 
TTCCGCTTGT CATATGGATG TGATGAGAAC 
GGCCGTGAAA TTATAATTCT GAATCAAGAT 
ATAGTUVGGTA AGCTTCTTCG CCGCTTGAAA 
GTGCAGACCG GAGGGGCCTA TATTGTGCAT 

25 AAGGTGCTTG TTGAGTAT 



GTCGGTACGA TCGTCGTGAA GTCGTCGAAA 2280 

AATCCGGGCT ATAATCAAAA TGTAGGGGGG 2 34 0 

TGTGCCGATT GTGGTATGTA TCGCAGGAGC 2^00 

TCAGGTTTTC CAATTAATGA TGTGGGCGGA 2*360 

TTCAATGGCG ATAAGTGGCG GCCAGCACCT 2520 

AAGGGCTACC AGATCACGAA TGACGTGCAG 2580 

GGTATGATTT GTGTGTGCGA TGCCTTCCTG 2 64 0 

TCGGGCGCCA ACTTGATCGG CAACTCATAC 27 00 

GTCTTCCCGC CGGAAGTCGA GCAGACGGTG 27 60 

TGGCGTAAGC TTAATGGAAG CACGGTTTCA 2 820 

CCTAAGAATA CAGCGGGTCA GGACAATCTT 2880 

TTGGTGAAGA TGCAGAACGG AGCGTCTTGT 2 94 0 

AAGAACACGA CTGTAAACAA CGGTAATGGT 3000 

GGATCGGCGA ATATGCCGTC ACTTGTGATG 3060 

TTGTGGATCT TTACCGATGG GGGTCTTTCT 3120 

AAGCTGACTG AA7W\GGTTT GTCACAACTT 3180 

TTCCAGGTTG CAGGGGTTCC GGAGTTGAAT 32 4 0 

GATGGTCAAT ACACGTTGGA GTTTGCTCTT 3300 

CTGCACGATC TTCAGTCAGG AGCCAAACAC 3360 

GATGCCAAGC GGGGAGATTC CGGGGCTCGT 34 20 

GTAGATGATT CGCATGTCGT GAGTACAAAT 34 80 

GCTCTTGACT GCACTGTAAC CTTATTCACA 354 0 

GTATTAGCTG GTCATAGAGA AGTCATGAAA 3600 

CTTCAAAATG CTTTCACTAA TGATGTGCAT 3660 

3678 



(2) IIJFORIIATIOIJ FOR SEQ ID MO: 221 

30 (i) SEQUEIICE CHARACTERISTICS: 

(A) LEMGTH: 3675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 

(D) TOPOLOGY: circular 

35 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
40 (iv) AHTI-SEHSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGAN I SH: PORYPHYROMOI fAS GINGIVALIS 

45 (iM) FEATURE: 

(A) NA1)E/KEY: misc_feature 
(Bl LOCATION 1 . . .3675 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221 

50 

ATGAAACGAT ATACAATAAT TCTTGCAGTT TTTCTTTTAT TCTGCACGGT ATTTACCTTT 60 

C7VAATAAAAG CTCGCCCTTA TGAAAGATTT GCAGATGTAG AGAAGCCTT6 GATTCAGAA^ 120 

CATTCAATGG ATTCTAT^TT GGTGCCTGCA AATAAGGGTA ACTTAATTCA AGCTGAAATT 180 

GTATACCAAT CTGTTTCTGA ACATAGTGAC TTAGTTATTT CACCTGTGAA CGAAATAAGG 2 40 

55 CCTGCAAATC GTTTCCCTTC GCATAGGAAG TCTTTTTTTG CAGAAAATCT ACGGGCATCT 300 

CCCCCCGTAG TTCCCGTTGC CGTCGACAAG TATGCGGTAC CGGTTGCCAA TCCAATGGAT 360 

CCTGAAAATC CCAATGCCTG GGATGTGACG CT7\AAAATCA CTACTAAAGC GGTAACAGTA 4 20 

CCTGTCGATG TGGTGATGGT TATCGACCAG TCTTCGTCAA TGGGAGGGCA 7\AACATTGCC 4 80 

AGATTAAAGT CTGCCATTGC ATCGGGACAG CGTTTTGTGA AAAAAATGTT GCCTAAGGGG 54 0 

60 ACGGCTACAG AAGGGGTGCG TATCGCTCTT GTGAGTTATG ACCATGAGCC TCATCGCTTA 600 

TCTGATTTTA CCAAAGACAC TGCTTTTCTC TGTCAAAAAA TCCGGGCTTT GACTCCTATT 660 

TGGGGAACAC ATACCCAGGG GGGGCTTAAA ATGGCGAGAA ACATTATGGC CACTTCTACT 7 20 

GCTGTGGATA AGCATATCAT ATTGATGTCT GACGGGTTAG CGACGGAGCA GTATCCTGTT 7 80 

AAAAATGTAA CTACTGCAGA CTTCATTGGC AAAACTGGAA ATGCGAATGA TCCCATTGAT 84 0 

65 TTGGTTATAC AAGGAGCAAT TAATTTCCCT ACAAATTATG TTTCCAACAA TCCATCTACA 900 

CCTCTTACCC CAAATTATCC AACTCATTCT TCTAAAGTTG GACGGAGAAA TCTGCCGGAA 960 

TCCAAATTCG ATTATAGTAA TCTGAGTGCA AGGATTACTT TTGATGGTGT TGCTGGCGCA 1020 

TTGGTCTATG AACCGAGGTT TCCTCATCCC TATTATTATT ATTTCCCTTG TAACGCTGCT 1080 

ATCAATGAGG CTCAGTTTGC GAAAAACTCT GGTTATACAA TCCATACTAT TGGCTATGAC 114 0 

70 CTGGGAGATT TTGCCTTGGC CAACAATTCG TTGAAACTAA CCGCTACAGA CGAGAATCAC 1200 

TTCTTTACGG CGACACCGGC CAATTTAGCT GCAGCGTTTG ATAATATTGC CCAAACTATT 1260 

AATATAGGTA TACAGAGGGG GGAGGTGACG GACTTTGTAG CTCCTGGTTT CATCGTTAAA 1320 

AATCTGACGC AATCGGGAGA TGTTACTCAT TTGCTAAATG TTTCAAATGG AACGGTGCAC 1380 

TATGATGTCT CTACTAAAAA ACTGACATGG ACTACTGGTA CTATCCTGAG CTCATCAGAA 14 4 0 

75 GCTACCATAA CTTATCGTAT TTATGCCGAT TTGGATTATA TACAGAACAA TGATATTCCG 1500 
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GTAAATACTA CTTCTGCTAT CGGCCCGGAT 
AAATTGACCT ATACCAATTC CAATGGCGAA 
ACGGTTAAGT TAGGTTATGG TGTTATTAAG 
CAACCCATAC AGGCAAATGG AACAGTTGTC 
5 TCACAAGATT TCTTTTTGCC CTCAGGTGGA 
GACAAAACGA CCGAAGCATT ACAGTACTAT 
ACTGCCGATG GTAAACGTTA TCGTTTTGTC 
CAAATCGGTA TCAGTTGGAA AAAACCGGCA 
AATTATTGGA TGGGAGGAAC AACAGACCAA 
10 ACAGGAGCCC AAGTACCGCT CACAGGAGAA 
GGTTCTCCGG CGGTAGCCGA TTTGCATGTC 
CTTATCAATA ATTCCGACAA GGATTTAGTT 
GGCGTGGTTG AGGATAACAA TCCGAATGTC 
AATCCTACGG GGACATTGCT TTTTGCCAAT 
15 GTCGAGTTTT ACAATCAGGG ATATGATTGT 
CAGTATTTCG GTATCCCTGT CAATGAATCA 
GAGACCGTCA ACCAATGGGT TGAGCCTTTC 
GCACCTGATA CAGAGCTTCA AAA^^.TTCAAG 
CAGCCTACGG GAGTTTACAG CTTCAAGGGT 
20 CTGACACGCA CGTCCGGTGT CAACTACTCG 
GG AGCCATCG ACATCAAGCA GGGTATTGTC 
CTGTTCAACA CGGGAACACG CGACCAGTGG 
TATCGAGCCG GTCAGTACCT CTCTGTACCT 
GATCGTATTC CATCGATGCA TTCCTTCTTG 
25 TTGCAIfATCT TGTACGATAA GCTGCTCAAG 
CAGATCACAT GGCGATCCGG CAACTCCGGA 
GTTCTTGGTA ACGAGTCGGC CGACCGTTTG 
GGATTCGACA ACGGCTGGGA TGGTCGCAAG 
GCGATGTCTG ACATCGGTAA TGATAAATTC 
30 CTGCTGATCG GCTTCGATGC GGATAAGGAT 
GATCATTTTG CGAAAGGGGC TGTTTACCTG 
ATTACGAATT CTACGTCGTA TTCATTCGAT 
CGCTTGTCAT ATGGATGTGA TGAGAACGTA 
CGTGAAATTA TAATTCTGAA TCAAGATGCT 
35 GAAGGTAAGC TTCTTCGCCG CTTGAAAGTA 

CAGACCGGAG GGGCCTATAT TGTGCATCTT 
GTGCTTGTTG AGTAT 



CTTGGTGGAT TCGATACCAA TACCGAGGCA 1560 

CCG;iATCAGC AGTTAATTTT CCCACGTCCG 162 0 

CGGCACTATG TATTGGTAAA TAAAGACGGT 1680 

AGTTCCCTAA GCGAGGCTCA TGTTCTACAG 174 0 

GGTCATATTG TTCCCAAATG GATAAAGTTG 1800 

TCCGTACCGC CGACTAACAC GGTCATCACT 18 60 

GAAGTCCCAG GCTCCACGCC GAATCCGGGC 1920 

GGAAACGCTT ACTTCGCTTA CJ^JKGCTCCTC 1980 

CAGAGTGAAT GGGATGTGAC GTCCAATTGG 204 0 

GATGTAGAGT TTGCAACGAC AGAAAATTTC 2100 

CCGACAACCA ACCCCAAAAT TATCGGTAAC 2160 

GTTACCACAA GCAGTCAATT GACGATCAAC 2220 

GGTACGATCG TCGTGAAGTC GTCGAAAGAC 22 8 0 

CCGGGCT ATA ATCAAAATGT AGGGGGGACC 23 4 0 

GCCGATTGTG GTATGTATCG CAGGAGCTGG 24 00 

GGTTTTCCA^\ TTAATGATGT GGGCGGAAAC 2 4 60 

AATGGCGATA AGTGGCGGCC AGCACCTTAT 2 52 0 

GGCTACCAGA TCACGAATGA CGTGCAGGCA 25 80 

ATGATTTGTG TGTGCGATGC CTTCCTGAAT 2 64 0 

GGCGCCAACT TGATCGGCAA CTCATACACT 27 00 

TTCCCGCCGG AAGTCGAGCA GACGGTGTAT 2 7 60 

CGTAAGCTTA ATGGAAGCAC GGTTTCAGGC 2 82 0 

AAGAATACAG CGGGTCAGGA CAATCTTCCG 28 8 0 

GTG7VAGATGC AGAACGGAGC GTCTTGTACG 2 94 0 

AACACGACTG TAAACAACGG TAATGGTACG 3 000 

TCGGCGAATA TGCCGTCACT TGTGATGGAT 3060 

TGGATCTTTA CCGATGGGGG TCTTTCTTTC 312 0 

CTGACTGAAA AAGGTTTGTC ACAACTTTAT 3180 

CAGGTTGCAG GGGTTCCGGA GTTGAATAAC 32 4 0 

GGTCAATACA CGTTGGAGTT TGCTCTTTCG 3300 

CACGATCTTC AGTCAGGAGC CAAACACCGT 3360 

GCCAAGCGGG GAGATTCCGG GGCTCGTTTC 34 20 

GATGATTCGC ATGTCGTGAG TACAAATGGC 34 80 

CTTGACTGCA CTGTAACCTT ATTCACAATA 354 0 

TTAGCTGGTC ATAGAGAAGT CATGAAAGTG 3600 

CAAAATGCTT TCACTAATGA TGTGCATAAG 3660 

3675 



40 (2) 111 FORMATION FOR SEQ ID MO: 2 22 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 1275 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRAIfDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
50 (iii) HYPOTHETICAL: NO 

(iv) AMTI- SENSE: NO 



55 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGAIJISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. , . 127 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 2 



ATGGAAGTGA AGAAAAACAC AGTGGTGCTA CGCCTTCTGA TTTGGTTCGT GGC^ATT^TT 60 

CTCTTCCACT CCTCACGGCT GTGGGGACAG GAAGGGGAGG GGAGTGCCCG ATACAGATTC l^n 

DD AAAGGATTCG TGGATACCTA CCATGCCGTA CGCAGCTCTT CTCCTTTTGA TTTCATGAGC 180 

TCGCGTACGA GAGTGAGAGG TGAGCTGGAG AGGTCGTTCG GTAATTCGAA AGTAGCCGTA 24 n 

TCGGTCAATG CCACCTACAA TGCTCTACTG AAAGACGAGA CCGGCTTACG TTTACGTGAA 300 

GCCTTCTTCG AGCATCAGGA AGAGCATTGG GGGTTGCGCC TCGGACGACA GATTGTCATT 36n 

TGGGGGGCTG CCGACGGTGT GCGCATCACG GATCTGATCT CCCCGATGGA TATGACCGAG 4 20 

/U TTTCTGGCAC AGGATTACGA TGATATTCGT ATGCCGGTCA ATGCATTGCG TTTCTCTGT*- 4 80 

TTCAACGAAT CGATG/^^GT GGAAGTCGTG GTACTGCCTG TATTCGAGGG GTA-^CGTCTG 54 o 

CCTGTGGATC CTCGCAATCC TTGGAATATC TTCTCCCTTT CGCCCATTGC TCAGGGGATG 600 

AATATCGTCT GGAAAGAAGA AGCCGGCAAA CCGGCCTTCA AGGTTGCCAA TATCGAGTAC 6r.0 

GGTGCGCGAT GGAGCACTAC GCTCTCCGGT ATCGACTTCG CTTTGGCTGC ATTGCATA^A 7 2^ 

/t) TGGAACAAGA TGCCCGTCAT CGAAGTACAG GGCATTGTGC CGACGGAAAT CATCGTTAG'^ 



7 sn 
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CCTCGCTATT ATCGTATGGG ATTTGTCGGC GGCGACCTCT CCGTACCCGT CGGACAGTTT S-^O 

GTTTTCAGGG GAGAGGCTGC GTTCAATATC GACAAACACT TCACCTATAA GAGTCATGCC 900 

GAGCAAGAGG GTTTCCAAAC AATCAATTGG TTGGCCGGAG CCGATTGGTA TGCTCCCGGT 960 

GAATGGATGA TCTCAGGACA ATTCTCAATG GAAAGCATAT TCAGGTATAG GGATTTCATC 1020 

TCCCAAAGAC AACATTCTAC CCTGATTACT CTCAATGTTT CCAAGAAATT CTTCGGCAGT 1080 

ACACTCCAAC TTTCGGACTT CACCTACTAC GACCTTACGG GCAAAGGATG GTTCAGTCGC 11^0 

TTTGCAGCTG ACTATGCCTT GAACGATCAG ATACATCTGA TGGCCGGATA TGACTGGTTC 1200 

AGTAGT7V\GG GCAGCGGTAT ATTCGATCGC TACAAAGACA ATTCCGAACT CTGGTTCAAA 1260 

GCCCGCTACA GCTTC 1275 

(2) IN FORMAT I OM FOR SEQ ID NO: 22 3 



(i) SEQUEHCE CHARACTERISTICS: 
15 {Aj LENGTH: 1212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDUESS: double 

(D) TOPOLOGV: circular 

20 (ii) HOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: MO 
(iv) ANTI-SEIISE: MO 



(vi) ORIGIIJAL SOURCE: 

(A) ORGAIIIS!!: PORYPHYROI^IOMAS GIIIGIVALIS 



(ix) FEATURE: 
30 (A) NAI'lE/KEY: misc_feature 

(B) LOCATION 1 . . . 1212 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 223 

35 ATGTCCTCCT GTGAGGTGGC TTATTTTTCA CTAAAGCCGA TCGATCTGCA GAACATCCGC 50 

GAACGGAATC ACTCTTCCGA CATCGCGCTT TCCAATTTAT TAGACAATTC GAATCAGCTA 120 

TTAGCTACTA TTCTGATCGG GAATAATGTG ATTAATGTAG CCATCGTTAT CCTTTCCAAT 180 

TATGCCATCG AGCAGACATT CGTTTTCTCT TCTCCGATCA TTGGATTTCT GATCCAGACG 24 0 

ATACTCCTGA CCACTGTTCT TTTGCTGTTC GGAGAGATTC TGCCGAAAGT GTATGCGCGG 300 

40 AAGAATCCGC TGCAATACTC GCGCTTTTCT GCTGCAGCTA TGTCCGTTAT CTATAAGATA 360 

TTGTCACCGT TTTCAAAATT GCTGGTCAAA AGTACCGGCA TCGTTACCAG AGGTATCAGC 4 20 
AAGAAGAAAT ACGATATGTC CGTGGATGAG CTCTCGAAAG CGGTAGCCCT CACCACTACG 4 80 

GAGGGAGAGC CGGAGGAGAA AGAAATGATT AACGAAATCA TCAAATTCTA TAATAAGACA SAO 
GCCTGCGAAA TCATGGTTCC GCGTATCGAT ATTGTGGATG TGGATCTGAG CTGGCCATTT 600 

45 CGTAAGATGC TTGACTTCGT TGTTTCGTCG GGTTATTCCA GACTTCCCGT TTCAGAGGGG 660 

TCAGAAGACA ATATCAAAGG GGTGATTTAC ATCAAAGATC TAATCCCACA CATGGATAAA 720 

GGCGATGAAT TCGACTGGCA TCCTCTGATT CGTAAAGCAT ATTTTGTCCC CGAAAACAAG 7 80 
CGCATAGATG ATTTGCTCGA GGAGTTCAGA GCCAATAAGG TGCATGTCTC CATCGTTGTG 8 AO 

GATGAGTTCG GTGGCACTTG CGGACTGATC ACAATGGAGG ACATATTGGA AGAGATCGTC 900 

50 GGCGAGATTA CGGACGAGTA CGATGAGGAA GAACTCCCCT TTAAGGTTTT GGGGGATGGC 960 

AGTTATCTTT TCGAAGGAAA AACGTCTCTC TCCGATGTTC GACACTATCT TGACCTTCCG 1020 

GAAAATGCTT TCGGTGAATT GGGGGACGAG GTAGATACGC TAAGTGGGCT CTTCTTGGAA 1080 

ATCAAGCAGG AACTCCCCCA TGTGGGCGAT ACAGCAGTGT ACGAGCCATT CCGCTTTC7VA 114 0 

GTGACCCAAA TGGACAAGCG CCGAATCATC GAAATCAAGA TTTTCCCTTT CGAGCGCACT 1200 

55 TGGGAGGTCG AA 1212 



(2) INFORI IATIOIJ FOR SEQ ID MO: 224 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDUESS: double 
(D) TOPOLOGY: circular 

65 

(ii) t'lOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

70 (iv) ANTI-SEMSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGAMISIl: PORY PHYROHONAS GIUGIVALIS 

75 (ix) FEATURE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



154/ 490 

(A) NAJ-IE/KEY: misc_feature 

(B) LOCATION .1 ... 780 

SEQUEHCE DESCRIPTIOM: SEQ ID MO: 224 

5 

ATGAAACTAT TACTTTATCT CCTATTGGTC TTGTCGACTC TATCCCCGAT GTATTCGCAA 60 

ATGCTCTTCT CAGAGAATCT CACAATGAAT ATAGACAGCA CCAAAACCAT ACAAGGAACG 120 

ATATTGCCCG TACTGGATTT CAAAACCGAA AAGGAAAATG TGTTCACCTT CAAAAATACT 180 

GCCAATCTCA ATCTGCTGAT AAAGCACGGT CAAGTAATCA ACTTAATTAA TAAGCTTGAG 24 0 

iU TTTTCTACCT ATGGCAATAA AGTAACCGTA AGTGGAGGAT ATGTACACAC CGAATACCGC 300 

TATTTGTTGC ATCATGTTTT TGAGGTTTAT CCTTATGTCG AGTCGCAATG GGCAGAAAGT 36 0 

AGAGGAATGA AATATAAGGT TTCTACGGGA TTACAGTCGC GTTATCGGCT GGTAAATAGT 4 20 

GATAACTGTC TCATGTTTGC AACATTGGGG GTATTTTTCG AATTCGAAAA GTGGGAACAG ^80 

CCAGCCACTA GCCTCTTTGC AGGAACGTAT GCATACAGCC GAAGTATCAA AAGCCACCTG 5*^0 

lb TCTATCAGTT TCAGACATCG GTTGGGTGAA CATTGGGAAT TTACAACTAC GGCTATTCAC 600 

CAGGGAAAGC CTGACAGTTA TTTTAAGAAG GCACGTTTTG GAGGAGCTAT CGACCTCAAA 660 

TACCATATCA CACCTACGAT AGGAATACGC GGGGCCT ATC GGATCATCTA CGATACTGCC 720 

CCTATTGTAC CTGTGCGGAA AGATTACAAC ACCGTTGATG TTGGTATCGA TATTTCGTTT 7 80 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(2) inF-QRIIATIOfl FOR SEQ ID 110:225 

{ i ) SEQUEI JCE CHARACTERISTICS : 

(A) LENGTH: 2502 base pairs 

(B) TYPE: nucleic acid 

(C) STRAtlDEDMESS: double 

(D) TOPOLOGY: circulai" 

iii) MOLECULE TYPE: DIIA (genomic) 

{iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROl lOI IAS GINGIVALIS 

(i:-:) FEATURE: 

(A) NAIIE/KEY: misc_feature 

(B) LOCATION 1...2 502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:225 



ATGAAACGAA TCGTTTTATC ATCTTTCCTG TTCGTTCTGT CCATACTTTC TTTGATGGCA 6 0 

CAGAACAATA CCCTCGATGT ACACATATCC GGTACGATCA AGGATGCCTC CTCCGGCGAA 120 

CCAGTGCCCT ATGCCACTGT AAGCATCCGG CTGACAGGAG CAGATACCAC ACAGGTGTTC 180 

CGACAAGTGA CTGACGGCAA CGGCTACTTC GTCATAGGCC TGCCGGCAGC TCCCTCCTAT 24 0 

CACCTGACAG CTTCGTTCGT AGGTATGAAA ACCCATACCA TGCAGATTAG TCGGGGAAAT 300 

GGACAGCACG ACATCAAATC CATCGACATT TCTCTCGAAT CCGAGGACAA ACAACTCTCC 360 

ACCGTCACCG TATCGGCAGC ACGACCACTG GTGAAGATGG AGATAGACCG CCTGTCCTAT 4 20 

AATATGAAAG ATGACCCCGC AGCCAAGACG AACAACCTGC TCGAAATGCT GCGCAACGTT 4 80 

CCTTTGGTAA CGGTGGATGG TCAGGGCAAT ATCCAGGTGA AAGGATCTTC CAACTTCAAA 54 0 

ATCCACCTCA ATGGCAGGCC CTCGACCATG GTGAGCAGCA ACCCGAAGGA GGTCTTTCGC 600 

TCCATTCCTG CCCATACGAT CAAACGGGTG GAGGTCATCA CCGATCCGGG TGTAAAGTAC 6^0 

GATGCGGAAG GCACAAGTGC CATCCTGGAC ATCGTCACGG AAGAAGGTAA GAAGCTGGAA 7 20 

GGATATTCAG GTTCCATCAC GGCCAGTGTC AGCAACAATC CCACAGCCAA CGGTAGTATC 7 8 0 

TTTCTGACGG CAAAGTCCGG CAAAGTCGGG CTGACTACCA ACTATAACTA CTACGGTGGC 84 0 

AAAAACAAGG GCTCTCGCTA CTTTACCGAA CGTACTACAT CCATGCTCCA AACGATAGAA 900 

GAAGGCAAAG GGCAAGAAAC CTTTGGCGGA CACTTCGGCA ATGCCCTCCT CTCATTCGAG <^<^0 

ATAGATTCGC TCAATCTCTT TACGGTGGGC GGCAATGTAC GCCTTTGGGA GATGACCACC 102 0 

GACCGGAACA GCGTAGAAAA AAGCTTTGCC GGCAGCAACC TCATGTCCTA CATAGACAGA 1080 

AAACTCAAAA CACAGATGGA TGCCGGATCA TACGAGCTCA ATGCCGACTA TCAGCACAGC 114 0 

ACTCGCCTGC CGGGCGA.ATT GCTCACCGTT TCCTACCGCT TCACTCACAA TCCTAATAAT 12 0O 

AGCGAGACCT TCATTGACCA ATGGAAGCGC GATCCGCTCA ACACAGCTAA TACGATCCAG 1^60 

Dt> TACGCCGGCC AGCACTCCAA ATCCGATGCG GGCATGGACG AACATACGGC ACAAGTGGAC 132 0 

TATACACGTC CCTTAGGACA AGCACATTCT TTGGAAGCAG GGCTGAAGTA CATCTATCGT 1380 

CATGCCACGA GCGATCCTCT CTATGAGATA CGACCATCCG AAGATGCTCC GTGGCAGCCC 14 40 

GGCTCTCTAT ATGCACAGAA TCCGTCGAAC GGAAAGTTCC GCCACGATCA ATACATCGGA 1500 

GCAGCCTATG CCGGCTACAA CTATCGTAAG GATCAGTATT CTTTGCAAAC CGGCCTCCGA 1560 

/U GTGGAAAGCA GCAGGCTGAA AGCACTCTTT CCCGAAAACG CAGCAGCAGA TTTCTCCCAC 162 0 

AACTCGTTCG ACTGGGTGCC ACAGCTCACG CTCGGCTATA CCCCCTCGCC CATGAAGCAG 1600 

CTTAAGCTGG CCTATAACTT CCGAATCCAA CGTCCTGCAJV TCGGCCAACT GAATCCCTA^ 174 0 

CGGCTACAGA CCAACGATTA TCAAGTACAG TATGGTAATC CCGACCTAAA GTCGGAGAAG 18 00 

CGTCACCACG TCGGTCTCTC CTATAATCAA TACGGAGCCA AGGTCATGCT TACAGCAT^G I8 60 

/O CTCGACTACG ACTTCTGCAA CAACGCCATC CAG.AATTACA CCTTCTCCGA CCCGG^CAAT 1^2 0 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



155/490 



10 



CCCAATCTGT 
ACCTATGCCA 
CGCACATTCC 
TACTCAGGCC 
TATCATGGGG 
ATAGCCAA-AC 
GCGAAGTATT 
AATGCCGGTA 
ACACAAGTGC 
CAAGGACAGC 



TCCACCAGAC 
TGTACACGCC 
AAAJ\GAGCGA 
TGATGTTCAC 
GAAGAAGCTA 
AGCTTTTCGA 
CG AC AT GGAA 
TACAACGGAG 
GCAAGGTAGA 
AGGGTGGCGG 



CTATGGCAAT 
GGCCGTATGG 
AGCACTCGGC 
CCTGCCGAAG 
CCAGACGAAG 
CAAAAAATTG 
GAGCCGGACC 
TGTTTCCCTC 
GCGTACGATC 
ACAAGGAAAT 



ATCGGACGAG 
GTCAGGATTA 
ATTGATGTCA 
GATTGGACTG 
TATGATGGCA 
AGAGTCTCGC 
ATCGGCAATG 
AGCCTCACCT 
GTCAACGACG 
CCTACCGGCA 



AGCATTCTTT 
TGCTCAACGG 
ATTCATGGTC 
TGAATCTCTT 
ATGTATTCAA 
TGAGCGCATU^ 
GATTTACTAT 
ACAGCT'l'CGG 
ACCTCT^AGCA 
AT 



CAGCTTGAAT 
AAAT AT CG AT 
CGGCATGGTA 
CGGAGGTTAT 
CAATATCGGT 
CAACATTCAT 
TTATTCGGAA 
TAAGATGAAT 
AACCTCATCC 



1980 
20^0 
2100 
2160 
2220 
2280 
23-30 
2400 
2460 
2502 



(2) ItlFORMATIOM FOR SEQ ID HO: 226 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 1197 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDUESS: double 

(D) TOPOLOGY: circular 

20 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: tlO 
25 (iv) AMTI-SEIISE: MO 



(vi) ORIGINAL SOURCE: 

(A) ORGAMISIl: PORYPHYROMOMAS GINGIVALIS 

30 {ix} FEATURE: 

(A) MAIIE/KEY: misc_feature 

(B) LOCATIOM 1 ... 1197 



(xi) SEQUENCE DESCRIPTION: SI 

35 

ATGAGACTCT CTGCCATTCT TATCGCTTTG 
CAGCATTATT ATTCCATGGC GGGAGAGCGA 
CTCTCGGCAT CGATCCGAAG TGCGCTTTTC 
GTCAAAGGTT ATACGTTGCC GGGTGCACGG 

40 GCAGCACATG GTGTGAAGCT TTCGCTCGGA 
CGCTATCCGG CCGGTATCGC TTATTCCGAT 
GTACGCTTGC GTATCCTGCC TTATGTACAG 
CTCATGCTGG GCAATATAGC CGGTGGTACG 
CCTGAGTTGG ATTTGACGGC TGATCCTGAA 

45 CGTTTCCGAA TGGATGTTTG GGTCAATTGG 
CAGGAGTCGT TTGTCTTTGG CTTGTCCACT 
TGGCGACTCG AACTGCCCTT GCAGGCTATT 
GCGCAGCAGG ATACCGTGCA TACATGGGTC 
CGCCCTCGTA CCGACAAACC CATGCAGATT 

50 TCAAGCGGAG GATACTTCCC TTACGAAAGA 
GACTTGGAGC ACTTCGCTTT TCGTACCGAC 
TTTGCTGCAC CTTTCGCCAA TTCCCTGACG 
GGCGATTATA TTCGTCTCTA TGCCGACTAT 
GCGGCTGTTG CTCGGGTATG GTTCCAGCCT 

55 GAACTGACGA TGCGTATCGA TCCCAAATTC 



;Q id MO: 226 

ATTGTGATGC TGCCTGCTGT GCTTAGCGGG 60 

CTGGAGACGG ACAGCATTCG TCCGAACGAA 120 

TTTCGGAACA ATGAATACAA TGCACGTTCG 180 

GTTTCCGCTT TTGCCTCTTA CTCGCTGCCG 24 0 

GTATCTACCC TGAACTACTG GGGGGCAAGT 300 

TTACCTTATT GGACGGACTA TAACGACTAT 360 

GCCATGCTGA AGCCGACGGC CACGACTGCT 420 

GCTCACGGAC TGATCGAACC GATCTACAAT 4 80 

GCCGGTGTGC AATTTCGGGG TGATTGGACA 54 0 

ATGAGCATGA TTTTCAAAAA TGACAATCAT 600 

ACTTCGAAAT TGTTATCGGG TGAAGGCAAA 660 

GCCACGCATC GCGGCGGGGA ATACT^CTGG 7 20 

AATGGAGCTG TCGGACTTAA GCTTTCGTAT 780 

TGGGGATCTG CTTATGGTGT GGCAGCCTTG 84 0 

GGGTGGGGCG GTTATCTTTC TCTCGGAATG 900 

TATTGGTACG GCAGGCATTA CGTTTCTCCC 960 

TATGACAAAC AGCCTCTTAC GAACGGTTGG 1020 

TCGTGGCGGA TGGCACGAAG TGTTTCGTTG 108 0 

TCGGATCGTT TTGCGATGAG CCACGCCTTG 114 0 

CCAATAGCTT TTCTGAAAGG CAATCAT 1197 



(2) INFORMATION FOR SEQ ID MO: 227 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIIDEDMESS: double 

(D) TOPOLOGY: circular 

65 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: MO 
70 (iv) AMTI -SENSE: MO 



(vi) ORIGINAL SOURCE: 

(A) ORGAMISI'i: P0RYPHYR0I10NAS GINGIVALIS 



75 (ix) FEATURE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870AV l_> 



wo 99/29870 



156/490 



PCT/AU98/01023 



(A) IIAIIE/KEY: misc_feature 
(B ) LOCATIOrJ 1 ... 1146 

(xi) SEQUENCE DESCRIPTIOH: SEQ ID 110:227 



10 



15 



20 



25 



ATGAACAAAT 
CAACAGACAG 
CTGGAGCAGA 
CCGCTTGTCG 
GCTCATGGCG 
GCCGACGGCA 
TGGCACCCGA 
ATAGAATACG 
ACCTGCTATA 
CTACAGGTAC 
GTCGCTACCG 
GGAGGAGCAC 
ATGTGGTACT 
GACCTCACCT 
AAGGGCAAGA 
ATTTTCGATC 
CAATACGGCG 
TACCCTACGG 
AGTGCGACCG 
TTCTTA 



CGCTATTATC 
GACCGGCCGA 
AGCAGAAAAA 
ATGACGAATC 
AACTGAACCG 
CTTCGTTTGC 
ACGACAAGGT 
ACATGAACCC 
CTTCGGGCGT 
TCAATGCTTA 
CCACGAGCTA 
TGCATATGCG 
TCACTGCGGG 
ACTCGATCGA 
CCCTCACGGA 
AGGTCAATCT 
AGAGCAGCCA 
AGACCAACTT 
AGACGGAAAG 



ATTGGCATGC 
ACGCAGCGGC 
GCTGAAGGTG 
CCATATCGGA 
CCACCTCAGC 
CGACAATCTC 
GTCTTTCTTC 
CGTAGAGATC 
GAACTTCGCA 
CAACAACCGC 
CCCGCTCCTC 
TTACGCCGTG 
CAACCTGTTC 
GGGATTGGAC 
CGTCAAGTAC 
CTTCCTCAAA 
CACGCGCCAC 
CCGTCTGTTC 
CACCAATGCT 



CTCATCCTGT 
GAGCCTTCTC 
TACTTAGGCA 
CACTTCAAGG 
TTCGACTGGC 
TCCAATGCCA 
TTCGGACGTC 
TACCAGTACA 
TGGAACTTCC 
TTCGCCGACC 
TACTCGGCAC 
TCGATGGCTC 
AATCCGGGCA 
GACAAAGGCA 
TATGCTCTGG 
GGCATGTACG 
TCCTACGGCT 
GTCACCTACA 
CTTCGCGCCG 



GCGGTATGCC 
TGGCCGAACG 
TACAGTCGTT 
TACAGGAGCT 
GACAACGTCT 
TCGACATCGC 
AGTACGCGCG 
GCGACCTTGT 
ACCCCGAACA 
GCTACCACGT 
AGTGGAACGG 
ATCAGGCCCA 
AACGGATCAA 
TTATGACTGC 
TATCGAAGTG 
AGAACGGCTA 
ATATGGGAGG 
TAGGACGGCA 
GTCTGATCTA 



GGCCATCGCC 
TGTATTCGGT 
CTACGACCAG 
GCGGATGTCT 
CAACCGTGCC 
AGGTGTGGAC 
TTTCGGAGGG 
GGATTACATG 
GCAGCTGCAG 
GACACCCGAT 
TACCCTCCTC 
AGAGCGTAAT 
CGGATACCTC 
TCGCTACGGC 
GAACTTCCGC 
TGCGCCTGCC 
GGTGGAATAT 
TTACCGGTAC 
TCAGATACCT 



60 
120 
180 
24 0 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
114 0 
1146 



30 



35 



40 



45 



50 



55 



60 



(2) IIIFORMATION FOR SEQ ID NO: 228 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 base pair^ 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAl'flSH: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAT IE/KEY: misc_feature 

(B) LOCATION 1 . . . 666 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228 



ATGAAACATT 
TATACCTTTG 
GGACAGGGAT 
GATAGACTTT 
ACCGGCGTAG 
AAAGAAAATA 
CCTATACGCC 
ATGAGTCGTC 
TCGAGGTCGG 
GTTTACTTTA 
TACCAGCAAA 
AACCTC 



TGTTTAAGTC 
CACAAGAAAA 
ATATTGCAGG 
TGTCTGGTGC 
CATTTCGTTT 
CCTTGAATTA 
CTATCAAAAA 
ATGAGCAAAT 
GACTGGATTT 
TGGCAGGAAC 
AGGATCGAAC 



GACATTAGTA 
TAATACAGAA 
TTCAACTACC 
AATTTATCTG 
CTTATCNCCC 
TGCGTTTTAC 
TTTTAATTTC 
TTACAATTCT 
TGGTCTCGGG 
CGATCTTACG 
CTTCATTGCA 



CTTCTTTGTG 
AAGTCACGAT 
AACCTGATGT 
GGCTTGACAC 
TCTCCGGGTT 
GTTGTCGGAG 
ATCTTCTCTA 
ACTTCGCAGA 
ATGCATCTGC 
TCTTGCATGT 
CTTATCGACA 



CTCTTTCTTT 
TTGATTTTTC 
ATGGGTATAC 
CAAGT/yVGAA 
ATTATGTCGA 
CATATAATAG 
CAGAAGTCGG 
CTTGGGATAA 
AATNCCACAT 
TCGGAA7\AAG 
ACAGTATTGG 



TTCCGGTACC 
TGTTAGGCTG 
ATCTGCTAAC 
AGAAAATGCA 
TATATCCGGC 
AATAGCCATT 
AATGGCTTGG 
GCAGCGCAAG 
TAATAAGACC 
GATCAATGAC 
CATAGGATTA 



60 
120 
180 
240 
300 
360 
4 20 
480 
540 
600 
660 
666 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 22 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

tii) MOLECULE TYPE: DNA (genomic) 
(iii ) HYPOTHETICAL: NO 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929B70A1_I_> 



wo 99/29870 



PCT/AU98/01023 



157/490 



10 



55 



(iv) ANTI -SENSE: NO 

(vi) ORIGIIIAL SOURCE: 

(A) ORGAIIISM: PORYPHYROMOMAS GIHGIVALIS 

(ix) FEATURE: 

(A) IJAI-IE/KEY: inisc_f ea ture 

(B) LOCATIOM 1 . . . 1173 

(xi) SEQUENCE DESCRI PTIOtJ : SEQ ID NO: 229 



ATGATTATCA AGATWVTGCT GAAAAATAAA TTGGCCCCCT TGGCCATACT GTTCCTTTTT 60 

GCTCCAAAGG CTATGAAGGC TCAGGAGCAA CTGAATGTGG TACACACCTC TGTGCCATCG 120 

15 CTGAATATCA GTCCGGATGC ACGTGCGGCC GGTATGGGGG ATATAGGTGT GGCAACGACG 180 

CCGGATGCGT ATTCACAGTA TTGGAATCCG AGTAAATATG CTTTCATGGA TACGAAAGCC 24 0 

GGTATTAGCT TCTCATATAC ACCCTGGCTG TCCAAGCTGG TCAATGATAT TGCCCTGATG 300 

CAGATGACCG GTTTCTACAA ATTGGGAACA GACGAGAATC AGGCTATTAG TGCTTCTCTG 360 

CGTTATTTCA CATTAGGAAA GTTGGAGACT TTCGACGAAT TGGGCGAATC CATGGGAGAG 4 20 

20 GCCCATCCCA ATGAATTTGC TGTCGATTTG GGCTATAGCC GCCAGTTGTC GGAGAACTTC 4 80 

TCCATGGCTG TTGCACTGCG TTACATCCGC TCAGACCAAA GCACTCACAA CACCGGAGAG 5^0 

AATCAGGCCG GAAATGCCTT TGCGGCGGAT ATAGCCGGTT ATTTGCAGAA GTATGTGCTA 6 00 

CTGGGTAATG CGGAGAGCTT GTGGTCGTTG GGTTTCAACG TAAAGAATAT CGGAACGAAG 660 

ATCTCCTATG ACGGAGGTGT CACGAGTTTT TTCATCCCTA CTTCGTTGAA TCTCGGGACG 7 20 

25 GGGCTGTTGT ATCCGATCGA TGACTATAAC AGCATCAATT TCAACCTTGA ACTTAGCAAG 7 80 

CTGCTTGTAC CCACTCCTCC TATCATGGAT CAAAACGATC AGGCCGGGTA TGAGGCTGCA 840 

CTCAAGAAAT ATCAGGAAAC TTCTTCGATC AGCGGTATAT TCTCTTCTTT CGGTGATGCG 900 

CCGGGAGGAC TCAAGGAAGA ATTCCGTGAG ATTACATGGG GACTTGGGGC TGAATATAGC 960 

TATGACGATA AATTTTTTGT TCGTGCCGGA TATTCATACC TGCACCCCAC CAAAGGCAAT 1020 

30 TTGCAGTACT TCACGGCCGG TGCCGGCTTC AAAATGAACA TATTCCGTAT CGATGCTTCC 1030 

TACCTGTTGT CTACGATCCA GAGTAATCCG TTGGATCAGA CTCTGCGGTT TACGCTTGCT 114 0 

TTCGATATGG ATGGATTGCG CAATTTGTTC CAC 1173 

35 (2) INFORMATION FOR SEQ ID NO: 230 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1338 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

45 (iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGIIJAL SOURCE: 
50 (A) ORGAN I SH: PORYPHYROMONAS GINGIVAL IS 

(ix) FEATURE: 

(A) NAl'IE/KEY: misc^feature 

( B ) LOCATI ON 1 . ..1338 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230 



ATGAAAACAA CAGTTCAACA AATTATTCTG TGCCTGGCTT T7U\TGATGTC AGGTGTATTG 60 

GGCGGAAACG CACAGAGCTT TTGGGAAGAA ATAGCTCCTC CTTTTATCAG TAATGAGCCT 120 

60 AACGTCAAGT ATATAATTCC CAATATGGGG ATTGATTCAA AGGGAACAAT CTATGTAACC 180 

GTGACAAAAA GGATTCAGCA GGGAGCAAAT TATACTTCTG AGCAATTGGG TATGTACTAT 24 0 

CGACCATTAG GTGATAATGA ACAGTGGTGG AAACATGATC CGTATTTTGA TGACAAGATA 300 

GTTGCGGATA TTCAGACAGA TGCATATGGC AGAGTTTATG TATGTACGAC TTCTTCTCGA 360 

GATCAAGAGT ATCAACTTTA TATAAACGAG CAGAACGAAT GGAGGTGTAT ATTCAAAACT 4 20 

65 TCTGTGTCTA CATATGAGCA TGGTATGGCT GTTTTTCGCT CTTCGACAGG GGTGACTTAT 4 80 

ATAGGTACCA GGCATCACAT CTTCGCATCA GGTGTAAATG ATTTCGAGTT CAACACTATC 54 0 

TATGAAGACT CTACACCTAT GAGCTGTCGC TTTGCAGAGG CTACGAATAG TGGCACCATC 600 

TATCTGGCAT TGATGCATGA AACCACAATG TCTACGACTA TCCTTACTTA TCAAAACGGT 660 

GAGTTCGTCG ATATCTCGGA AAGTGAATTG AGTAACTCGA TTATTGCATC CATGTGCTCT 720 

70 AATAAAGAAG GTGATATAAT AGCTCTTGTT ACTTCATATA CAGGATTTAT GAGTGGAACC 7 80 

CTTGCGATCA GAAAAGCAGA TGAAGGCAAA TGGCAACTTG TTGGCGGAGA TATACAGAAT 84 0 

GCGATCGTTC AAAATATATG CATGATGGAC GACAACAAGA TTGCTTGTGA AGTCTTCGGG 900 

ACTCCTAACG GAGTAGATGG TCGGACAAGG GTTTGTGTTT CTGACGCATC TGTCTTTGAT 9 60 

TTTGAGTGGT ATGAAGATGA AATATACGGA GGCCTGATAT TTGACACTTT CTTCTATAGC 1020 

75 CCTTGGGACA AACTTCTTTA TGCGAAATTT GGTGGGATTA TGCTCAGGAG TAAAGAGTCT 1080 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



158/490 



TTTATAACCT CTTTCATTTC TCCGACAGTT GTACAAGGAG TGGATGTCTA TACTTTGGCC 
GGGAAGATAA GGATCGAAAG TGAAACTCCG GTGTCTGAGG TGTTGCTTTT CGACCTGGCT 
GGCAGGATGG TACTTCGGCA AACCATTGAT AATAAAATCT ATTCGGACAT AGATACTAAC 
GGACTAAAGC GAAGCGGTAT TTACGTAGTC TCGGTGCGGC TCTCTTCCGG ACAGGTATTC 
AGTCATAAGG TGCAGGTA 



(2} IMFORMATIOJJ FOR SEQ ID MO: 2 31 

(i) SEQUEriCE CHARACTERISTICS: 

(A) LEMGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRAUDEDHESS: double 

( D) TOPOLOGY : circular 

(ii) MOLECLILE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: HO 
(iv) AHTI-SEHSE: NO 

{ V i. ) OR I G I HAL S OU RCE : 

(A) ORGAHISM: PORYPHYROMOHAS GIHGIVALIS 

(i::) FEATURE: 

tA) HAIIE/KEY: misc_feature 

(B) LOCATION 1 ... 924 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 231 

ATGATAATCC GGTGTCTTAT CCGTCGTCCG AGAACCGTCC TGTTCGGGTT GATATTCGTG 
GTAGGTCTTT TCTCTGCGAT GGCGCAAGAG AAAAAGGATA GTCTCTCTAC GGTTCAGCCA 
GTGCCGAATA GCAGCATGGT GGAGCAGACC CCTCTTCTCT CCATTGATCA CCCCGTCCTG 
CCCGCTTCTT TTCAGAATAC CCGTACACTG AAAAGGTTTA GAGACAAACA TCTTTCCGAT 
GCTTTGCTCA ATGGATTGAA GCCTCATCGC TCATCTTTGC AATTGAATGA GGAACTCAAC 
TTCGCGGCAG AGCGTCGGGA TTTCGTTTCT CCCCTCTTGC AAACTCGCCA CGCTGCCGGT 
GTCCTTTCAT GGCGACCGAC CGATAGGATG CATTTTTATA CATCGGGCAA TATCGGTCTT 
GGCCATGATT TATTGACCGG TGTGCGCAAG GACTTCGGAT GGAATGCTGG TGCCGA-^TTC 
TTGCTGAGTC AAAATCTTAC GGCACATGTC CAAGGCGGTT GGCAGCAGAA TTTCGGCTTT 
ATACCTATGA cggctgtca;^. TGGCCAACTG CGTTGGCAAG CCACCGAGAG ATTGAGTTTT 
ACCACCGGTA TCGATTATCG ACAGGTACAG TGGAATGCTT TCGATMTAG AACGTTCTCG 
CTTAAAGGAA GTGCTCGATA CGAAGTGATG GACAATGTCT TTGTCAATGG ATTTGGCAGC 
TATCCTCTCT ACAGCAGTAC GCGCTCAGGA CTCAATATGG CTGTTCCGAT GCATGGATTC 
GGCCCTCAGT ACGGTGGATC GCTTGAGCTG AAAGTCTCCG AGCGATTCGG CTTTGCCGTC 
GGTATGGAGC GCGAATACAA TATCTGGACT CGTCGGTGGG AAACGCATTA CTTTGCTTAT 
CCTGTATTCT ATGGCGATAA gaag 



(2) INFORMATION FOR SEQ ID HO: 2 32 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA {genomic) 

(iii) HYPOTHETICAL: HO 

(iv) AMTI-SEHSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOHAS GIHGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 ... 945 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 

ATGAAAACGA ATAGACGATA CGCATTTGTT TTGCCGCTTC TGCTACTCAC CGGATTGTTG 
GCATGGGGGC AGGATTCTTC CCACGGTAGC AATACAGCGT TTGCAACTGA TTCTTCGAGT 
AGAGAGTTGC CCACGGAGCA GTCCGCCTAC CGCATTCATT CTGCCTATAT GGTCGGTGGT 
GGCGGAAGCA TAACGCGCGA CACCTATTTG TCACCCCTTC GTTATGGAGG ATGGACACTG 
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AATTTGTTGG GAGAGAAGAC GTTCCCTCTC AAAGCCTCCG ATTCCCGTTG GATGATCCGT 300 

ACCGGGCATG AGCTGGATTT TGCCCTGATG GACAATCCGG CCAATAATGC TCATTTCTAT 360 

TCCCTGCTGT ATAACGGTTC CGCTGCGGCT CTTTACCGCC TTGGCGCTAA GCATCTGCGA 4 20 

GCCGCGTGGA TGGACAATCT GCGCTTGGCA TTCGGCCCGG GCTTGGAAAT CGGGCTTGGA 4 00 

5 GGAATTTATA GTACACGCAA CGGCAATAAT CCTGCGACAT TGAAGCTCTA CACCAATGCC 54 0 

ATCGCCCAAG CCTCGATAGG ATACTACGTC CCCTCCGAAA CTTTTCCCCT GTATTTTCGG 600 

TTGCTCTCCC AGATCAATCT CTTCGGTATA GCCTATGGAA ATGGTTTTGG TGAGAGCTAT 6 60 

TACGAGAATT TTTTGCTCAA TAACGGCATT GCAGGCTCCC TGCATTTCAC TTATCCGGGC 7 20 

AAGTTTACTC GGTTCACGAC ACTCATAACG GCGGATATTC CCATTCGGAA CTTCTGTACG 7 80 

10 CTTCGTGTCG GTTATCGCTA TTCCCATTTG GGCTCTTCGC TTAACGCATT GGATACTCGA 8 40 

ATCCACAGTC ATACGGCTTT TATCGGTTTC GTCACGGAGT TTTACCGATT CCGTGGGCGC 900 

7\AAGCCATGA ATACCGGTCG GAGAACCAGT CTTTACTATC ATGAT 94 5 

15 {2} IMFORMATIOII FOR SEQ ID 110:233 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 55 base pairs 
(E) TYPE: nucleic acid 
20 {C) STRAIIDEDKESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic} 

25 (iii) HYPOTHETICAL: HO 

(iv) AI IT I -SENSE: 110 

(vi) ORIGINAL SOURCE: 
30 (A) ORGAHIStl: PORYPHYROMONAS GIHGIVALIS 



35 



( ix ) FEATURE : 

(A) NAI4E/KEY: misc_feature 

(B) LOCATION 1 ... 855 

(>;i) SEQUENCE DESCRIPTION: SEQ ID NO: 233 



ATGGTAGTGA CGCTGCTCGT AATTGTCGGT ATTGTGGTAG TAGTGCGGTA CTCCTTGCGC 60 

GTCCACGTAC ATAAGACCGG CACTGTTGTA TCGGCGGCCA TATTTGGCTT CATCCTCCTT 120 

40 GGAAAGACCG TTCCATGCGA TACCCGTAAC TTCTTTTCCT CCGAAAGTGA TGAACCTGAG 180 

AGCCGTGTTG CTACCGAAAT AGCCCACCTG TGCGAAATAG GATTTCAGAT CCACGCTTCC 24 0 

TCTATCCACG TAGCCGTCCG AACCGATTTT GGACAGGCGG GCATCCACTG CCCAATGGCG 300 

ACCGATGCGT CCGCTACCGA GTTTGACCGA TCGGCGGAAT GTGCCGAACG AACCTCCGCT 3 60 

CAAATCGACA CGGCCATAAG GAGCCAGTCC CAAATTATCC GTACGCATAT TGACACTTGC 4 20 

45 CCCAAAAGCT CCGGCACCAT TGGTGGAAGT ACCCACACCT CGCTGCACCT GAAGGTCTTC 4 80 

GATGGAAGAG GCGAAGTCGG GCATATTCAC CCAAAAGACG GACTGAGATT CGGAGTCGTT 54 0 

GAGGGGTACT CCATTGGTAG TTATGTTGAT GCGATTGGCA TCGGTGCCAC GCACGCGAAA 600 

GCCGGAATAT CCGATACCCG TACCGGCATC GCTGGTGGCT ACCACGGAGG GAGTCAGCAT 660 

CAGCAGATAG GGGATGTCAC GACCATAATT GGACTTGGAA AGTTCGGCCT TGCGAACGTT 7 20 

50 GGTGTAAGCG ACAGGGGTTT TCGCCGTGGC GCGAGTAGCT ACGACCTGTA CGGTCTGGAG 7 80 

CTGCACATTG CTAAGACTAT CTATCTCGCT GTTGGAGACG GGTGCTTGTG CCGTCAGGCA 84 0 

GAAAGGCAGG ACGGC 8 55 

55 (2) INFORMATION FOR SEQ ID HO: 234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1797 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 

65 (iii) HYPOTHETICAL: HO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
70 (A) ORGANISM: PORY PHYROHONAS GIHGIVALIS 

(ix) FEATURE: 

(A) NAl-lE/KEY: misc_feature 

(B) LOCATION 1. . .1797 

75 
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[:<!) SEQUEIJCE DESCRIPTIOM: SEQ ID lJO:23^1 



ATGGAAAACT TAAAGAACAT TCAGCCCAGA 
GGTGGCGTCC ATGCTGCCGT GAGTCGTCAG 
5 AATACCATCA AGGAAAAGGA AGTGGTAATG 

GTGGTTATCA ATGTAGGGTA CAAATCGGAA 
AATCCCGAAC TCAAAGTGGG AGACGAAGTG 
AAGGGCCAGC TCGTCTTGTC TCACCGCAAG 
AACGAGGCTC TCGAAAAAGA CGAAATCGTA 

10 GGTATGATCG TCGATGTATT CGGTATCGAG 
CGCCCCATTC GCGACTACGA TGCATTCGTT 
ATCAATCAAG AATATAAGAA TGTAGTTGTT 
GAACAACAGA AGAAAGAAAT CATCGGCAAG 
GTCAAGAATA TTACTTCTTA CGGAGTATTT 

15 CATATCACTG ACCTTTCATG GGGTCGTGTG 

CAGAAGATCA ATGTCGTTAT CCTCG ACTTT 
CTC7\AACAGC TGATGCCTCA TCCTTGGGAT 
AAGGTG/VAGG GT7\AAGTTGT GGTGATGGCA 
GGCGTTGAGG GTCTTATCCA CGTAAGCGAA 

20 CAGGACTTCC TGCATGTAGG CGACGAAGTG 
GAACGCAAAA TGTCGCTCGG TCTGAAGCAA 
ACTCGTTTCC CTGTAGGCTC TCGTCACCAT 
GTATTCGTTG AGATCGAAGA GGGCGTAGAT 
ACGAAGAAGA TCAAACACCC CAGCGAGTTT 

25 GTAATCGAGA TCGACAAGGA AAACCGTCGT 

AATCCTTGGG ATGTATTCGA GACGGTATTC 
ATCGAAGTGA TGGACAAGGG TGCTGTCGTT 
ACTCCGAAGC ACATGGTGAA GGAAGATGGC 
TTCAAGGTTA TTGAGTTCAA TAAGGATGCC 

30 TTCGAAGATG AGCAGAAAAT GGCTCAGCGT 

7WVGCGGCTC AGAAAGAAGC TGCTGCCGAA 
GCCACTCTCG GAGACCTCGG CGAGCTGGCC 



GAGGATTTCA ACTGGGAAGA GTTTGAGGCC 60 

GAGCAGGAAG CTGCTTATGA CAAAACGCTC 120 

GGTAGGGTAA CTGCTATCAA CAAGCGTGAA 180 

GGTGTGGTAC CTGCAACAGA ATTCCGCTAC 24 0 

GTVAGTTTATA TCGAGAATCA GGAAGAT/^G 300 

GGTCGTGCCG CTCGCTCTTG GGAGCGCGTG 360 

AAGGGCTATG TGAAGTGTCG TACCAAGGGT 4 20 

GCTTTCCTCC CGGGATCACA GATCGACGTG 4 80 

GAGAAGACGA TGGAGTTCAA GATTGTGAAA 54 0 

TCCCACAAGG TGCTCATCGA AGCAGAGCTC 600 

CTCGAAAAAG GGCAGGTACT CGAAGGTATC 660 

ATCGACCTCG GTGGAGTGGA TGGTCTTATC 7 20 

GCTCATCCGG AAGAAATCGT ACAGCTGGAT 7 80 

GATGAAGATC GCAAGCGTAT CGCTCTCGGA 84 0 

GCTCTCGACA GCGAGCTTAA GGTAGGCGAT 900 

GATTACGGTG CTTTCGTTGA GATTGCACAG 960 

ATGTCATGGA CACAGCACTT GCGTTCTGCT 102 0 

GAAGCCGTGA TCCTGACGCT CGACCGCGAA 108 0 

CTCAAGCCGG ATCCTTGGGC TGATATCGAA 1140 

GCTCGTGTTC GCAACTTCAC CAATTTCGGT 1200 

GGCCTTATCC ATATTTCCGA CCTTTCTTGG 1260 

ACGGAAGTAG GTGCTGATAT CGAAGTTCAG 132 0 

CTCAGCTTGG GTCACAAACA GTTGGAAGAG 1380 

ACTGTAGGAT CTATCCACGA AGGAACGGTA 14 40 

TCTCTGCCTT ACGGTGTGGA AGGTTTTGCC 1500 

TCACAGGCTG TACTCGAAGA GAAGTTACCT 1560 

AAGCGAATCA TTGTATCTCA TAGCCGTGTA 162 0 

GAAGCCAATG CAGAGCGTAA GGCTGAAGCC L680 

GCTGCCAATC CTGCACAGGC TGTAGAGAAA 17 4 0 

GCTTTGAAAG AAAAGCTTTC AGAAAAC 17 97 



35 



(2) INFORMATION FOR SEQ ID NO: 235 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA {genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 



50 



(vi) ORIGINAL SOURCE: 

( A ) ORGANI SI 1 : PORY PHYROMOHAS GINGI VALI S 



55 



(ix) 



FEATURE : 

(A) NAME /KEY: 

(B) LOCATION 



misc_f ea ture 
1- . .1550 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:235 



60 



65 



70 



75 



ATGGGTAGGG 
GAAGGTGTGG 
GTGGAAGTTT 
AAGGGTCGTG 
GTAAAGGGCT 
GAGGCTTTCC 
GTTGAGAAGA 
GTTTCCCACA 
AAGCTCGAAA 
TTTATCGACC 
GTGGCTCATC 
TTTGATGAAG 
GATGCTCTCG 
GCAGATTACG 
GAAATGTCAT 
GTGGAAGCCG 
CAACTCAAGC 
CATGCTCGTG 



TAACTGCTAT 
TACCTGCAAC 
ATATCGAGAA 
CCGCTCGCTC 
ATGTGAAGTG 
TCCCGGGATC 
CGA'J'GGAGTT 
AGGTGCTCAT 
AAGGGCAGGT 
TCGGTGGAGT 
CGGAAGAAAT 
ATCGCAAGCG 
ACAGCGAGCT 
GTGCTTTCGT 
GGACACAGCA 
TGATCCTGAC 
CGGATCCTTG 
TTCGCAACTT 



CAACAAGCGT 
AGAATTCCGC 
TCAGGAAGAT 
TTGGGAGCGC 
TCGTACCAAG 
ACAGATCGAC 
CAAGATTGTG 
CGAAGCAGAG 
ACTCGAAGGT 
GGATGGTCTT 
CGTACAGCTG 
TATCGCTCTC 
TAAGGTAGGC 
TGAGATTGCA 
CTTGCGTTCT 
GCTCGACCGC 
GGCTGATATC 
CACCAATTTC 



GAAGTGGTTA 
TACAATCCCG 
AAGAAGGGCC 
GTGTVACGAGG 
GGTGGTATGA 
GTGCGCCCCA 
TWVATCAATC 
CTCGAACAAC 
ATCGTCAAGA 
ATCCATATCA 
GATCAGAAGA 
GGACTCAAAC 
GATAAGGTGA 
CAGGGCGTTG 
GCTCAGGACT 
GAAGAACGCA 
GAAACTCGTT 
GGTGTATTCG 



TCAATGTAGG 
AACTCAAAGT 
AGCTCGTCTT 
CTCTCGAAA^\ 
TCGTCGATGT 
TTCGCGACTA 
AAGAATATAA 
AGAAGAAAGA 
ATATTACTTC 
CTGACCTTTC 
TCAATGTCGT 
AGCTGATGCC 
AGGGTAAAGT 
AGGGTCTTAT 
TCCTGCATGT 
AAATGTCGCT 
TCCCTGTAGG 
TTGAGATCGA 



GTACAAATCG 
GGGAGACGAA 
GTCTCACCGC 
AGACGAAATC 
ATTCGGTATC 
CGATGCATTC 
GAATGTAGTT 
AATCATCGGC 
TTACGGAGTA 
ATGGGGTCGT 
TATCCTCGAC 
TCATCCTTGG 
TGTGGTGATG 
CCACGTAAGC 
AGGCGACGAA 
CGGTCTGAAG 
CTCTCGTCAC 
AGAGGGCGTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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GATGGCCTTA TCCATATTTC CGACCTTTCT TGGACGAAGA AGATCAAACA CCCCAGCGAG 1140 

TTTACGGAAG TAGGTGCTGA TATCGAAGTT CAGGTAATCG AGATCGACAA GGAAAACCGT 12 00 

CGTCTCAGCT TGGGTCACAA ACAGTTGGAA GAGAATCCTT GGGATGTATT CGAGACGGTA 1260 

TTCACTGTAG GATCTATCCA CGAAGGAACG GTAATCGAAG TGATGGAC.AA GGGTGCTGTC 132 0 

5 GTTTCTCTGC CTTACGGTGT GGAAGGTTTT GCCACTCCGA AGCACATGGT GAAGGAAGAT 138 0 

GGCTCACAGG CTGTACTCGA AGAGAAGTTA CCTTTCAAGG TTATTGAGTT CAATAAGGAT 14 4 0 

GCCAAGCGAA TCATTGTATC TCATAGCCGT GTATTCGAAG ATGAGCAGAA AATGGCTCAG 1500 

CGTGAAGCCA ATGCAGAGCG TAAGGCTGAA GCCAAAGCGG CTCAGAAAGA AGCTGCTGCC 1560 

GAAGCTGCCA ATCCTGCACA GGCTGTAGAG AAAGCCACTC TCGGAGACCT CGGCGAGCTG 162 0 

10 GCCGCTTTGA AAGAAAAGCT TTCAGAAAAC 1650 

(2) IHFORMATION FOR SEQ ID !IO:236 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1374 base pairs 

(B) TYPE; nucleic acid 

(C) STRAHDEDMESS : double 

(D) TOPOLOGY: circular 

20 

(ii) MOLECULE TYPE: DWA (genomic) 

(iii) HYPOTHETICAL: NO 

25 (iv) AUTI- SENSE: NO 

(vij ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMOt IAS GINGIVALIS 

30 (ix) FEATURE: 

(A) NAI'IE/KEY: mi sc_f ea tu re 

(B) LOCATION 1. . . 1374 



35 



60 



70 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 236 



ATGATCGTCG ATGTATTCGG TATCGAGGCT TTCCTCCCGG GATCACAGAT CGACGTGCGC 60 

CCCATTCGCG ACTACGATGC ATTCGTTGAG AAGACGATGG AGTTCAAGAT TGTGAAAATC 120 

AATCAAGAAT ATAAGAATGT AGTTGTTTCC CACAAGGTGC TCATCGAAGC AGAGCTCGAA 180 

CAACAGAAGA AAGAAATCAT CGGCAAGCTC GAAAAAGGGC AGGTACTCGA AGGTATCGTC 24 0 

40 AAGAATATTA CTTCTTACGG AGTATTTATC GACCTCGGTG GAGTGGATGG TCTTATCCAT 300 

ATCACTGACC TTTCATGGGG TCGTGTGGCT CATCCGGAAG AAATCGTACA GCTGGATCAG 360 

AAGATCAATG TCGTTATCCT CGACTTTGAT GAAGATCGCA AGCGTATCGC TCTCGGACTC 4 20 

AAACAGCTGA TGCCTCATCC TTGGGATGCT CTCGACAGCG AGCTTAAGGT AGGCGATAAG 4 80 

GTGAAGGGTA AAGTTGTGGT GATGGCAGAT TACGGTGCTT TCGTTGAGAT TGCACAGGGC 54 0 

45 GTTGAGGGTC TTATCCACGT AAGCGAAATG TCATGGACAC AGCACTTGCG TTCTGCTCAG 600 

GACTTCCTGC ATGTAGGCGA CGAAGTGGAA GCCGTGATCC TGACGCTCGA CCGCGAAGAA 660 

CGCAAAATGT CGCTCGGTCT GAAGCAACTC AAGCCGGATC CTTGGGCTGA TATCGAAACT 7 20 

CGTTTCCCTG TAGGCTCTCG TCACCATGCT CGTGTTCGCA ACTTCACCAA TTTCGGTGTA 7 80 

TTCGTTGAGA TCGAAGAGGG CGTAGATGGC CTTATCCATA TTTCCGACCT TTCTTGGACG 84 0 

50 AAGAAGATCA AACACCCCAG CGAGTTTACG GAAGTAGGTG CTGATATCGA AGTTCAGGTA 900 

ATCGAGATCG ACAAGGAAAA CCGTCGTCTC AGCTTGGGTC ACAAACAGTT GGAAGAGAAT 960 

CCTTGGGATG TATTCGAGAC GGTATTCACT GTAGGATCTA TCCACGAAGG AACGGTAATC 1020 

GAAGTGATGG ACAAGGGTGC TGTCGTTTCT CTGCCTTACG GTGTGGAAGG TTTTGCCACT 1080 

CCGAAGCACA TGGTGAAGGA AGATGGCTCA CAGGCTGTAC TCGAAGAGAA GTTACCTTTC 1140 

55 AAGGTTATTG AGTTCAATAA GGATGCCAAG CGAATCATTG TATCTCATAG CCGTGTATTC 1200 

GAAGATGAGC AGAAAATGGC TCAGCGTGAA GCCAATGCAG AGCGTAAGGC TGAAGCCAAA 12 60 

GCGGCTCAGA AAGAAGCTGC TGCCGAAGCT GCCAATCCTG CACAGGCTGT AGAGAAAGCC 1320 

ACTCTCGGAG ACCTCGGCGA GCTGGCCGCT TTGAAAGAAA AGCTTTCAGA AAAC 137 4 



(2) INFORMATION FOR SEQ ID NO: 2 37 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1278 base pairs 
65 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AUTI -SENSE: NO 

75 (vi) ORIGINAL SOURCE: 
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(A) ORGAMISn: PORVPH i ROHOIJAS GINGIVALIS 



(i::) FEATURE: 

(A) {lAflE/KEY: misc_f ea tur- 
5 (B) LOCATION 1...127 8 



(xi) SEQUEIICE DESCRIPTIOU: SEQ ID 1)0:23"? 



ATGGAGTTCA AGATTGTGAA T^ATCAATCAA 

10 GTGCTCATCG AAGCAGAGCT CGAACAACAG 
GGGCAGGTAC TCGAAGGTAT CGTCAAGAAT 
GGTGGAGTGG ATGGTCTTAT CCATATCACT 
GAAGAAATCG TACAGCTGGA TCAGAAGATC 
CGCAAGCGTA TCGCTCTCGG ACTCAAACAG 

15 AGCGAGCTTA AGGTAGGCGA TAAGGTGA-^G 

GCTTTCGTTG AGATTGCACA GGGCGTTGAG 
ACACAGCACT TGCGTTCTGC TCAGGACTTC 
ATCCTGACGC TCGACCGCGA AGAACGCAAA 
GATCCTTGGG CTGATATCGA AACTCGTTTC 

20 CGCAACTTCA CCAATTTCGG TGTATTCGTT 
CATATTTCCG ACCTTTCTTG GACGAAGAAG 
GGTGCTGATA TCGAAGTTCA GGTAATCGAG 
GGTCACAAAC AGTTGGAAGA GAATCCTTGG 
TCTATCCACG AAGGAACGGT AATCGAAGTG 

25 TACGGTGTGG AAGGTTTTGC CACTCCGAAG 

GTACTCG7y\G AGAAGTTACC TTTCAAGGTT 
ATTGTATCTC. ATAGCCGTGT ATTCGAAGAT 
GCAGAGCGTA AGGCTGAAGC CAAAGCGGCT 
CCTGCACAGG CTGTAGAGAA AGCCACTCTC 

30 GAAAAGCTTT CAGAAAAC 



GAATATAAGA ATGTAGTTGT TTCCCACAAG 60 

AAGAAAG7W\ TCATCGGCAA GCTCGAAAAA 120 

ATTACTTCTT ACGGAGTATT TATCGACCTC 180 

GACCTTTCAT GGGGTCGTGT GGCTCATCCG 24 0 

AATGTCGTTA TCCTCGACTT TGATGAAGAT 300 

CTGATGCCTC ATCCTTGGGA TGCTCTCGAC 360 

GGTAAAGTTG TGGTGATGGC AGATTACGGT 4 20 

GGTCTTATCC ACGTAAGCGA AATGTCATGG 4 80 

CTGCATGTAG GCGACGAAGT GGAAGCCGTG 54 0 

ATGTCGCTCG GTCTGAAGCA ACTCAAGCCG 600 

CCTGTAGGCT CTCGTCACCA TGCTCGTGTT 660 

GAGATCGAAG AGGGCGTAGA TGGCCTTATC 7 20 

ATC.^W\CACC CCAGCGAGTT TACGGAAGTA 7 80 

ATCGACAAGG AAAACCGTCG TCTCAGCTTG 84 0 

GATGTATTCG AGACGGTATT CACTGTAGGA 900 

ATGGACAAGG GTGCTGTCGT TTCTCTGCCT 960 

CACATGGTGA AGGAAGATGG CTCACAGGCT 102 0 

ATTGAGTTCA ATAAGGATGC CAAGCGAATC 108 0 

GAGCAGAAAA TGGCTCAGCG TGAAGCCAAT 114 0 

CAGhAAGPAG CTGCTGCCGA AGCTGCCAAT 1200 

GGAGACCTCG GCGAGCTGGC GGCTTTGAAA 1260 

1278 



(2) HI FORMAT I on FOR SEQ ID HO: 238 



35 



40 



45 



50 



55 



60 



65 



(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 

(D) TOPOLOGY: circular 

{ii) liOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv} AHTI-SEHSE: t(0 

{"i) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOMAS GINGIVALIS 

fix) FEATURE: 

(A) NAilE/KEY: misc_feature 

(B) LOCATION 1 . . .720 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238 



ATGAAAAAAG 
AACGTGCAGT 
AAACTGACTA 
ATCGACATGG 
CTG/^AGTTTT 
AGCTTTACTT 
TTTACACGTG 
ACCTATCAGA 
GG C T T C CT CG 
GAAGGGGATA 
AAAGGCATCG 
TTCGCTCGCA 



CTATTCTTTC 
TGCACTACGA 
CCACAGTGGA 
ATTACACGGG 
GGCAAGCTCC 
TCGGACACGA 
GATTTACGAT 
TCACCGGCAC 
ATCTTTGGGG 
AGTTCGTATT 
ACAAGGATTT 
TGGACAAATT 



CGGAGCGGCC 
TTTCGGTCAT 
AAACTTCACA 
CAAGGGTATC 
CGTTTCCATT 
TGCTCTAATC 
CACGCCCATG 
TTGGTACATG 
TTTCCCCCAA 
CCTGTCCGAA 
CAATCTCAGC 
CTCCTGCATC 



TTGCTCCTCG 
TCCATCTACG 
CCCGACAAAT 
CAGTCGGCCT 
CATTTGGAGT 
GGTGCCACCT 
TACAAGCATC 
CACTTTCTGG 
GAGAACCCm 
CCGCAGTTCT 
ATAGGGACAG 
CCTACTCTTG 



GCCTATGTGC 
ACGAACTAGA 
GGGGAAGCAC 
ATTGGGAGAT 
ACAACGGAGG 
ACACCTACAA 
TGGGTGCGCA 
ACGGTCTGCT 
TCGGGGGCCC 
GGATCAACCT 
AGATGGAAAT 
CGGTCAAATG 



C7\ACGCACAA 
TGGACGTCCC 
CTTCTTCTTC 
TTCGCGCGAA 
CCTCTCCACA 
CAACCCCTCC 
CGACTTCCAC 
TACCTTCAAC 
TGTGCTCAAA 
CAATCGCATC 
CAGCAGGAAC 
GACTTTCAAC 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 



70 (2) IN FORMATION FOR SEQ ID NO: 239 

(1) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 1302 base pairs 

(B) TYPE: nucleic acid 
75 (C) STRANDEDNESS: double 
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(D) TOPOLOGY: circular 
(ii) HOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AUTI-SEHSE: HO 



10 



(vi) ORIGIllAL SOURCE: 

(A) ORGANISH: PORYPHYROMONAS GIUGIVALIS 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(ix) FEATURE: 

(A) HAI IE/KEY: misc_feature 

(B) LOCATIOII 1 . . .1302 

(xi) SEQUEIJCE DESCRI PTI014 : SEQ ID MO: 23 9 



ATGTATAAAG 
TTGCTATTTG 
ACTCGCTTCG 
GGACTCGGCG 
TCGGCTGTGG 
TATGCCGAGA 
CTTTTTCCTA 
GGGTACCAGT 
TACTTGGGGA 
AACTTCTCAA 
GTAATCTTCT 
GCTGCCAAGT 
GTTATCGGTG 
AATCAGGTTC 
GACTATTATA 
CTTCTCTTAG 
GATTGCAAAT 
AATGCCGTTG 
TACCTGAAAG 
TTCGGTATAG 
GACCGATTGA 
CTCACGTTCA 



ACTATAAGGG 
CAGCAGACGG 
GCCTTGGCCG 
TCGGTCTGCG 
ATTCGATGAC 
ACGGGAAAAJ\ 
TTTCCAAATC 
TCGGATCCGT 
CAGGCAATCT 
TAGGAGCCAA 
CCACGGAGGC 
TCGACTTCGG 
CCGTCTATTC 
AGAACGGTGT 
CCCTGCCTCA 
GAGCAGACGT 
TCCAGGACAG 
GGATGTGGCC 
TGCCGACTAA 
GAATCCCGCT 
TCCCGAAGGA 
ACGAGTCATG 



TTTGTATGCG 
TATACAGGCT 
TCTCGGAGAA 
TCAGGGCACA 
GTTTATCTTC 
GGACAATAGG 
CATTGCTATG 
TGATCAAGTG 
GAACGATCTC 
TGCTTCATCC 
TCCTTACAAT 
TATGCAGTAT 
TCCGCGGGTG 
AGTAGTGGAG 
TACATTGGGG 
CCAATATAGT 
AATACGGGTA 
TAAAGTTCGC 
AGGCGGTGTA 
CAATGACAGA 
GGGTATGATC 
GTTTAAAAAG 



TCGCTTCGGT 
CAGAACAACA 
CGGACGACTA 
TACGTCAATG 
GATTTCGGTG 
AAAATGGGAA 
AGTGCGGGAG 
GAAGGAGGCA 
TATGTCGGTA 
CTTTTTGGGC 
CCCGTACATC 
CACCTTCTTC 
AAGATGCATA 
AGCGAAACCC 
ATAGGTTTTT 
AAATGGAAAG 
TCTCTCGGCG 
TATCGCTTCG 
TATCAAGGAT 
CGTTCGTTCG 
AAAGAAAATG 
CTGAAACTGA 



GGTATGCCCT 
ACTTTACCGA 
TTAGTGGGCA 
CCGTCAATCC 
CATCTACCGG 
ACATTGAGTA 
TGCTTCCTTA 
GCGTCCAGTA 
TAGGTGCAAC 
GATTCACACA 
TCTCGACGCT 
TCAAATCAGA 
GCGAGCTGAC 
AAGAATATAT 
CTTATGAAAA 
GCGAGAAATT 
GAGAGATCAT 
GTTTACATGG 
ACCATATCGT 
TAAATGTCTC 
CTCTGAAATT 
AC 



GATCATTGGG 
GTCGCCTTAC 
TTCCATGGGA 
TGCTTCATAC 
AATTACGTGG 
TTTCGCCATG 
CTCCGCATCC 
CACCCGTA.AA 
CCCGTTCAAA 
CAGCAGGCAG 
GTACTTGAAG 
TCGTTCGCTC 
TCAGATAAAG 
CAAGGGAATG 
GAAAGATAAA 
TTATAAATCC 
ACCGGATATA 
TGAAA.ATTCT 
AGGTGCTGTA 
TCTTGAATAT 
GACCTTCGGC 



(2) IIIPORHATIOIJ FOR SEQ ID HO: 24 0 

(i) SEQUEHCE CHARACTERISTICS; 

(A) LENGTH: 2778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) tlOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: 110 

(vi) ORIGINAL SOURCE: 

(A) ORGANISH: PORYPHYROMONAS GINGIVAL IS 



(ix) FEATURE: 

(A) NAIIE/KEY: 

(B) LOCATION 



misc_f eature 
L . . . 2770 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2^0 



ATGCGATCGA 
CTGGAAGCCC 
ATTCAAGCCA 
ACTAATGAAA 
GTTTCATATG 
ATTACGCTCA 
CAGGCTAAAG 
TATACCGTAG 
ATCGGATCCG 
GGCAAAGAGT 
GTCAATAAAG 



TTTATCAATT 
AACAAGCCGG 
ACGTACAGCT 
AAGGGTTGTT 
TAGGTTACAC 
AAGATATATC 
CGGCAGAGGT 
CACAGGGAGC 
ATGGGAAGAT 
TTTTCTCCAA 
TACAGGTACT 



ACTGTTGTCA 
AGTAGCAGGT 
TGTACAGAGT 
CAGCCTGAAA 
TACCCACGAC 
CATGAACGAA 
CGTGGTACGC 
TTCTATCGAG 
CACCATCAAC 
AGATCCACAG 
GAACAAACTG 



ATACTCCTTG 
AGAGTATTGG 
ACCGGCCAlAG 
ACGTCACAGG 
GAAAAAATAT 
GATGCCCGTC 
AACGATACGC 
G;iJ\CTGATCA 
GGCAAGGACA 
GTGGCAATAA 
AGCGAGCTGT 



CTTCTCTTGG 
ACGAAGT^AGG 
TAGCCGTTGC 
AGGGTGACTA 
CTCTTAGAAA 
TTCTACAGAG 
TCGAATTCAA 
AGAAGCTACC 
TTAGCAAGAT 
AGAATCTTCC 
CGCGGATGAG 



TTTCGTCGGG 
CAACCCCATG 
CGCAGGTGCC 
CATTCTGCGC 
CGGGCAAACC 
TGTGACGGTG 
TGCCGGATCC 
CGGAGCAGAG 
CCTTGTCGAT 
GGCCGATATG 
CGGTTTCGAT 



60 
120 
180 
240 
300 
360 
4 20 
4G0 
54 0 
600 
660 
720 
780 
84 0 
900 
960 
1020 
1080 
114 0 
1200 
1260 
1302 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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GATGGAGAAG AGGAGACCGT AATCAACCTG 
TTCGGAACGC TTCAGGCCGG CTACGGTACC 
AATCGGTTCG ATGGAAATAA GCAATGGACA 
ATGGGCTTTA GCGAGATGGA GAG CG AG ATG 
5 GGTGGTCGAC GCGGCTTCGG CAATAGTGGA 

AACTTCAGTG TCGAATTCTC CTCTGCCCTT 
AACGACAAGG CCATAGAGAC GACCA7\ACGC 
TATATGGACG AAAATATATT GGAACGCTCT 
ATGCAATGGA AACCGTCCGA ACGTACCGAA 

10 AAGATCGATG GGTTCTTTAA CGACACATAC 
AACAAAGGTT CTATCCACCA AACTACACAA 
GATATCAGTC ACAAGCTCAA CGACGAAGGC 
CTGACCGACG AAGACGGAGA TGGCATATAT 
CAAAAGCAAT TCAACGACAA CTCCAACCTG 

15 CCGTTGGGTA AA7\ACTACTT CGCACAAGCG 
TCGGATCGTG AGGTGTACCG ACTGGGCGAT 
TACGGACTCT CCTACAGTAA CGAGTTCACC 
ATTGCCAAAA CGTGGGACTA CACCGTAGGA 
TATCGGAGCG TAGCCGGAGT AGAGCAGGAC 

20 CCGATGCTCC GAATCAACTA CAAACCGAGC 
GGACGCACGA CACAACCATC CATCAATCAG 
CTATTCGTGA CGGAAGGCAA TCCCGGTCTG 
ATGTTCTCGG ACTTCGATGC CAAAAGTCAG 
TATACATTCG ACGACATCGT CCCCAATACG 

25 ACTCGTTACG AAAACGCCTC CGGTACGTGG 
CCACTCAAGA ACmGGGCATT TTCTTTCAGG 
CAAAGCTTCA TCAATGACGA TAAGAACAAA 
ACGCTGACCT ATCGCAACAA TTGGATCGAT 
ATGGCGAATA ATAGTCTGAG CGGACAGAAA 

30 TATCAAGTTG ccctaacgct tccctatgga 

ACGAACTCCG GTTACAGCGG AGGATTCAGT 
TCATACAGCT TCCTCCGTGA CAAGGCCGGT 
GGTCAGCGGT CAAGTATCAG CCGTTCTGCT 
AATACGATCG GACGCTACGT GATGGTGGAC 
35 GGTGGATCTC GCAGCGATCA TCAGCGTGGC 

GGTGGCAGAC GACCGTCC 



ACGGTGAAGC CCGAAAAAAA GAAAGGCCTC 7 20 

GACCAACGCT ATATGGCCGG AGGGAACGTC 7 80 

TTGATCGGTA GTGCGAAC7VA TACGAACAAT 8-^0 

GGATCCATGA CCTTCTTCTC TCCCCAAGGC 900 

GGTGTTACGT CTTCGTCGAT GCTGGGCGGC 960 

AATACAGGAG GCGATGCACG CTACGGATAC 102 0 

GTGGAAAATA TCCTCGCCGA AGGGAATACT 108 0 

TTCTCTCACA ATGGTCAGGC GCGATTTAGG 114 0 

GTGGTATTCG AGCCGGATCT TTCGATATCC 1200 

GAGACGAAAG ATGCCACCGG AATCTCTATC 12 60 

GGAAACAACT TCAGACTGAA CGGAGAATTG 132 0 

CGTACGATCA GTGCCTCCGT CAGTGGCGGT 138 0 

CAGGCTGTGC TCCAAAGCGT GGAGACGAAT 144 0 

CAATATCGGC TTCGCCTCTC GTATGTGGAA 1500 

ATTCTGAACA GACGTTTCTC CCGTCGCAAT 1560 

GACGGGCAAT ACTCCATATT AGACAGTCAG 162 0 

CAGTATCGCA TCGGACTCAA CCTCAAGAAG 168 0 

TTCAATGTGG ATCCCAACAG AACTGTCAGC 17 4 0 

AAACTGGCTT TCAATCGTGT CAATCTCTCC 1800 

AGGACTACCA ACCTCCGAGT GGACTACCGA 18 60 

ATCGCTCCCG TTCAGGACAT CACGAATCCG 192 0 

AAGCCGAGCT ATTCCAACAA TGTGATGGCC 198 0 

CGAGCTTTCA ACATTGTTTT CTTCGGCAAC 2 04 0 

CACTACGATC CGTCTACAGG GATCCGTACC 2100 

CAAGCGAATC TTCATGGGAC ACTATCGCTT 2160 

ATGTCCTTGT TCAACAGGTT GGCCGAAGGA 2220 

GCTCTCTCTT TCCGAACGAG GGAACGCCTG 22 80 

ACGAGTATCG GTGGCAATAT CGGATTCTAT 2 34 0 

GATTCTCGCA CATACGATTT TGGCGGCA?\T 2 4 00 

TTCCGTATCG ACAGCGATGT TGAATACAAT 2 4 60 

CTGGACGAAT GGCTTTGGAA TGCTTCGCTT 2520 

ACACTGCGTG TCAATGGCTA TGACATCCTC 2 580 

TCGGCCATCA ATATAGAAGA GAGCATGTCC 2 64 0 

TTTATCTACC GATTCAACGC CTTCAGTGGT 27 00 

AATATGAATC GTCCGGGCCC ACCTTTCGGC 27 60 

2778 



40 



50 



(2) IIIFORIIATIOM FOR SEQ ID NO: 241 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 
45 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DlIA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AllTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 



55 

{ ix) FEATURE: 



(A) NAtlE/KEY: mi£C_f eatur€ 

(B) LOCATION 1 . . .1200 



60 (>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 241 



ATGGTTATGA AGCTGATTAA AAGAAGTTTG 
CTTCCTGCGT ACTCGCAGAA TGATGACATC 
GAAATACGTA AAC7\AAACCA AGTTAAAGAC 

65 GATACGGAAT ATACAGTGGC TTCCGATCGG 
CAGTCCTACG ATGGGAAAAA GTTGTCCAAA 
GTTCCCGGTC GCTATAGTCG CCGCTTGGCT 
TCAGGTGCCG ACAATGTATA TGTAACTGAT 
TACTATGATG ACGCGTCGTC TGTAAACATT 

70 CCTTATACGT CATGGTATCC ATCTTTCTCC 
TGGTTCTACT ACGGTAGCCA TATCGGATGG 
TGGAGCTACT ACTATGATCC TTTCTACAAT 
CCTTATGGCT GGGGCAGCTA TTACGGTTGG 
CACCACTACC CCAAGAAGAC CTATTCCAAT 

75 GGCCGACCGA ATCGTATCAA AGGTGGAACG 



CTCCTGCTTG GAGCGGTACT GCTGATTACG 60 

TTCGAAGATG ACATCTATAC ATCGCGAAAA 120 

TGGCAAAACC AAGAGGACGG ATACGGCGAC 180 

GACATTGACG CCTACAATCG TAGAGATGGC 240 

GACAAGAAAA GAGACTCCAC TCGTTCTTCT 3 00 

CGATTCTATA AGCCGAATAC GATCGTCATT 3 60 

GATGGTGAGT ATTTCGTCTA TGGAGACGAA 4 20 

TACATCAACA GTCCTTGGTG CGATCCGTTC 4 80 

GGCTGGTACA ACTATACGTG GAACTATCCA 540 

GGCGGTTATT ACCCCGGATA TAATTGGTAT 600 

CCCTATGGAA TCGGTATGGG TTGGGGATAT 660 

GGTGGCTATC CGGGAGTGAT ACATCACTAC 7 20 

GGTCAGCATT CCGGAGCTTA CTATTCTTAT 7 80 

TCCGGTGCCA AACTTGGGAC AGGACGCTAC 84 0 
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GATAGAATTC 
AATAATAATC 
ATAGAAACGG 
CAGTCCGGCA 
AACGATAGGA 
TCTCGCTCTT 



AAAATTCGTC 
TGCAAAATGT 
TA.ACTCCAAA 
ATGACCGACC 
CATTTTCGAC 
CTTCCGGCTC 



TTCGCAAA7\A 
CAAGTCGGGA 
CAACGGGCAA 
GACCGGACGG 
TCCTTCTCGT 
TATGAGCGGA 



AATAAGTTCG 
CGTACCGGCC 
AAGCAGAATC 
AATATCCGCA 
AGCAATAGTA 
GGTGGCGGAC 



GATTGCAGTC 
GAGCCAATAG 
GTCCCGTATT 
GCGAGAGACA 
ACGGTGGCTT 
GTAGTGGCCG 



GAACAAACCC 
AGACCGAAAT 
CCAGCAGAAT 
GGGGGAAAAT 
CTCCACGCCT 
GGGACGCAAT 



900 
960 
1020 
108 0 
1140 
1200 



10 



(2) IMFORIIATIOII FOR SEQ ID IIO:24 2 



(i) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 1194 base pairs 
{B) TYPE: nucleic acid 
(C) STRAMDEDtlESS: double 
15 {D} TOPOLOGY: circular 



20 



25 



55 



65 



70 



(ii) MOLECULE TYPE: DriA (genomic) 

(iii) HYPOTHETICAL: MO 

(iv) AIITI-SEUSE: IIO 

(vi) ORIGIIJAL SOURCE: 

(A) ORGAMISn: PORY PHYROf-IOI JAS GIIIGIVALIS 



(i::) FEATURE: 

(A) tlAlIE/KEY: misc_featur- 

(B) LOCATIOII 1 . , , 1194 

30 (xi) SEQUEIICE DESCRI PTIOl I : SEQ ID lJO:242 

ATGAAGCTGA TTAAAAGAAG TTTGCTCCTG CTTGGAGCGG TACTGCTGAT TACGCTTCCT 60 

GCGTACTCGC AGAATGATGA CATCTTCGAA GATGACATCT ATACATCGCG AAAAGAAATA 120 

CGTAAACAAA ACC.AAGTTAA AGACTGGCAA AACCAAGAGG ACGGATACGG CGACGATACG 180 

35 GAATATACAG TGGCTTCCGA TCGGGACATT GACGCCTACA ATCGTAGAGA TGGCCAGTCC 24 0 

TACGATGGGA AAAAGTTGTC CAAAGACAAG AAAAGAGACT CCACTCGTTC TTCTGTTCCC 300 

GGTCGCTATA GTCGCCGCTT GGCTCGATTC TATAAGCCGA ATACGATCGT CATTTCAGGT 360 

GCCGACAATG TATATGTAAC TGATGATGGT GAGTATTTCG TCTATGGAGA CGAATACTAT 4 20 

GATGACGCGT CGTCTGTAAA CATTTACATC AACAGTCCTT GGTGCGATCC GTTCCCTTAT 4 80 

40 ACGTCATGGT ATCCATCTTT CTCCGGCTGG TACAACTATA CGTGGAACTA TCCATGGTTC 54 0 

TACTACGGTA GCCATATCGG ATGGGGCGGT TATTACCCCG GATATAATTG GTATTGGAGC 600 

TACTACTATG ATCCTTTCTA CAATCCCTAT GGAATCGGTA TGGGTTGGGG ATATCCTTAT 660 

GGCTGGGGCA GCTATTACGG TTGGGGTGGC TATCCGGGAG TGATACATCA CTACCACCAC 120 

TACCCCAAGA AGACCTATTC CAATGGTCAG CATTCCGGAG CTTACTATTC TTATGGCCGA 7 80 

45 CCGAATCGTA TCAAAGGTGG AACGTCCGGT GCCAAACTTG GGACAGGACG CTACGATAGA 84 0 

ATTCAAAATT CGTCTTCGCA AAAAAATAAG TTCGGATTGC AGTCGAACAA ACCCAATAAT 900 

AATCTGCAAA ATGTCAAGTC GGGACGTACC GGCCGAGCCA ATAGAGACCG AAATATAGAA 960 

ACGGTAACTC CAAACAACGG GCAAAAGCAG AJ\TCGTCCCG TATTCCAGCA GAATCAGTCC 102 0 

GGCAATGACC GACCGACCGG ACGGAATATC CGCAGCGAGA GACAGGGGGA AAATAACGAT 1080 

50 AGGACATTTT CGACTCCTTC TCGTAGCAAT AGTAACGGTG GCTTCTCCAC GCCTTCTCGC 114 0 

TCTTCTTCCG GCTCTATGAG CGGAGGTGGC GGACGTAGTG GCCGGGGACG CAAT 1194 



(2) INFORMATIOM FOR SEQ ID MO: 243 



(i) SEQUEUCE CHARACTERISTICS: 

(A) LEMGTH: 1743 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDMESS: double 
60 (D) TOPOLOGY: circular 



(ii) IIOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: MO 

(iv) AIITI-SEMSE: MO 

(vi) ORIGIHAL SOURCE: 

(A) ORGAIIISM: PORY PHYROMOMAS GIIIGIVALIS 



(i>:) FEATURE: 

(A) MAIIE/KEY: misc_feature 
(D) LOCATIOII 1 ... 1743 

75 (xi) SEQUEtlCE DESCRIPTIOM: SEQ ID MO: 243 
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ATGATCCGCA AGTTGATCTT ACTGCTTGCT CTGATGCCGG TAGCCTCTGT GGCTTTTGCG 60 

GTACCAACGG ACAGCACGGA ATCGAAAGAC AATCGTATCC TTACAAGCAT GCAATCCTCC 120 

TCTTTGAATA GGGATGATGC TCCGGATAAA TGGCAACCTA TGCATGCCAA TTTCAGTATT 180 

5 CAGAGCGATA TGCTGCTTTC TACTGCCCAA AAGTCCAAGA ACACCTGGTT CGGCAACTCC 2 -JO 

TATATCATGG GTATAATCAA GAACAATTAT CTGGAGTTTG GTGCCCGTTT CGAGGATCTC 300 

TATAAGCCCC TGCCCGGACA TGAACCCGAG ATGGGGCGTG GCGTTCCTCA CATGTATGTG 3 60 

AAGGGAAGCT ATCATTGGGC GGAGCTGACT ATGGGAGACT TCTACGATCA GTTCGGTAGC '1 2 0 

GGTATGGTAT TCCGCACCTA TGAAGAGCGC AACCTCGGTA TAGACAACGC GGTTCGCGGC 4 80 

10 GGACGTATAG TACTCACTCC TTTTGATGGA GTGCGTGTCA AGGGTATTGC AGGACAGCAG 54 0 

CGTAACTACT TCGACCGCAC GGGCAAGGTA TTCAATTCCG GCCGAGGCTA CCTACT6GGT 600 

TCTGATCTGG AGCTGAATGT AGAGCGTTGG AGCAGTGCCA TGCGCGACAA TGACTATCAT 660 

TTGGCTATCG GGGGATCGTT CGTTTCCAAA CACGAAGCAG ACGAAGATAT ATTTGTGGGT 7 20 

GTAGGCGAAG ATCGCAAGCG ACTCAACCTG CCGCTCAATG TCCCGATTAT GGGCCTGCGC 7 80 

15 ACCAACTTTC AAAAAGGAGG TCTCGCCCTC TACGCAGAGT ATGGATACAA ATACAACGAT 8^0 

CCCTCGGCAG ACAATGACTA TATCTACCAC GACGGACAGG CTGCACTCCT CTCTGCCTCA 900 

TACTCC7yW\ AAGGGATGAG TATCCTGTTG CAGGCCAAAC GTTGTGAGAA CTTTGCTTTC 960 

CGCAGCAAGC GAAGTGCCCA GCTCACACCG CTTATGATCA ACTATATGCC GGCTTTTACC 1020 

CAAGCTCACA CTTATACGCT GGCGGCCATC TACCCCTATG CTACTCAGCC TCAGGGAGAA 1080 

20 TGGGCTTTCC AAGGTGAACT GCGTTACAAC TTTGCTCGCC GGACAGCTCT CGGTGGACGC 114 0 

TACGGTACCG GCTTGCGTAT CAACGTTTCG CATGTGCGTG GTCTGGACAA AAAGATGCTC 1200 

AAAGAGTUKTC CCGACGAACT GATCGGAACG GATGGCTACA CCGTTTCTTT CTTCGGCATG 12 60 

GGCGACCTCT ATTATTCGGA TATAGATGTG GAGATTACTA AAAAGGTAAG CCCAGGATTC 132 0 

AACTTTACGC TCACCTACTT GAATCAGATC TACAATAACA AGGTACTGCA CGGTGCAGCC 138 0 

25 GGAGAGAAGC CTGAGAAGAT CTATGCCAAT ATCTTCGTCT ATGATGGTAA GTATAAGCTG 14 4 0 

AGTAATAAGG TAGCCCTCCG TACCGAACTG CAATATTTGC ACACGAAGCA GGATCAGGGT 1500 

GACTGGATCT ACGGCATGGC CGAGCTCTCT ATCCTGCCTT CTCTGATGCT TTCCCTCTCG 1560 

GAGCAGTATA ATATCGGAGA GACCAAGAAA CATTATGTCA TGGGGTCTGT CACCTATACT 162 0 

CACGGAGCAC ATCGAGTAGC TTTCTCTGCA GGCAAAACCC GTGCAGGGAT GAACTGCTCG 168 0 

30 GGAGGTGTAT GTCGTGTGGT CCCTGAGACT CAGGGATTCT ACCTTTCTTA TAGCACCAAT 17 4 0 

CTG 1743 



35 



45 



50 



70 



{2) IllFORMATIOM FOR SEQ ID NO: 24 4 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAN I SI 4: PORYPHYROMONAS GINGIVAL IS 



(ix) FEATURE: 

(A) NAI^IE/KEY: misc_feature 
{ B) LOCATION 1 . . . 717 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID MO:244 

ATGCGTTCTT TATTTTTGAG CGCGTTGCGC AGCTCCTCTC TCCATGGTTC AGAGCGACGC 60 

AGTCGGATAA GTTCTTCTGT AGTCATGTCA ATAAGGCAGA AAATAAGGTT ATTCCATCTC 120 

TCGGTATGCG CCCAAACGCA TGATCATCTC ATCGAAATCC ACTTGGTGTG CATCGAATTC 180 

60 GGGGCCATCG ACACAGACGA ATTTCGTCTG TCCTCCCACG CTTATACGAC AAGCCCCACA 24 0 

CATACCGGTG CCATCCACCA TAATTGTATT GAGAGAAGCT ATGGTCGGTA TCTCGTAACG 300 

TTTGGTCAGG AGAGAAACGA ACT TC AT CAT CACAGCCGGC CCGATCGTAA CGCAGAGGTC 360 

TACCGTTTCC CGTTTGATAA CGCTTTCCAC TCCATCCGTT ACGAGGCCTT TCGTCCCATA 4 20 

AGACCCATCG TCTGTCATGA TGATCACTTC ATCGCTATTG GCTCGCATTT GTTCTTCAAG 4 80 

65 GATAACCAGA TCTTTAGTTC TGGCAGCCAA TACGACAATT ACACGGTTGC CTGCTTTGTG 54 0 

G7W\GCCTCC ACGATCGGGA GCAAAGGAGC CACACCCACA CCGCCTCCGG CACAAACCAC 600 

TGTGCCGACC TTTTCGATAT GCGTACTCTG TCCCAGCGGA CCTACCACAT CCGTGATATA 660 

GTCGCCGACT TCGAGTTCGG CCAATTTCTT GGAAGATTTG CCCACGGCCT GAACCAC H"^ 



(2) INFORMATION FOR SEQ ID MO: 24 5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 
75 IB) TYPE: nucleic acid 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



167/ 490 



10 



15 



20 



25 



30 



35 



40 



(C) STRAIIDEDUESS: double 

(D) TOPOLOGV : circular 

(ii) MOLECULE TYPE: DJIA (genomic) 

(iii) HYPOTHETICAL: MO 

(iv) AMTI-SEIISE: HO 

(vi) ORIGIMAL SOURCE: 

(A) ORGAN ISM: PORY PHYROMOMAS GIUGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATIOtI 1 . . . 633 

(:<i) SEQUEHCE DESCRIPTIOH: SEQ ID lJO:245 

ATGTCAATAA GGCAGAAAAT AAGGTTATTC CATCTCTCGG TATGCGCCCA AACGCATGAT 

CATCTCAT CG AA/^TCCACTT GGTGTGCATC GAATTCGGGG CCATCGACAC AGACGAATTT 

CGTCTGTCCT CCCACGCTTA TACGACAAGC CCCACACATA CCGGTGCCAT CCACCATAAT 

TGTATTGAGA GAAGCTATGG TCGGTATCTC GTAACGTTTG GTCAGGAGAG AAACGAACTT 

CATCATCACA GCCGGCCCGA TCGTAACGCA GAGGTCTACC GTTTCCCGTT TGATAACGCT 

TTCCACTCCA TCCGTTACGA GGCCTTTCGT CCCATAAGAC CCATCGTCTG TCATGATGAT 

CACTTCATCG CTATTGGCTC GCATTTGTTC TTCAAGGATA ACCAGATCTT TAGTTCTGGC 

AGCCAATACG ACA.ATYACAC GGTTGCCTGC TTTGTGGAAA GCCTCCACGA TCGGG AGO AA 

AGGAGCCACA CCCACACCGC CTCCGGCACA AACCACTGTG CCGACCTTTT CGATATGCGT 

ACTCTGTCCC AGCGGACCTA CCACATCCGT GATATAGTCG CCGACTTCGA GTTCGGCCAA 

TTTCTTGGAA GATTTGCCCA CGGCCTGAAC CAC 

(2) IHFORIIATIOII FOR SEQ ID MO: 2^6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23*13 base pairs 

(B) TYPE: nucleic acid 

(C) STRA14DEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: MO 



60 
120 
180 
240 
300 
360 
4 20 
480 
540 
600 
633 



45 



50 



55 



60 



65 



70 



75 



(iv) AMTI-SEMSE: HO 

(vi) ORIGIMAL SOURCE: 

(A) ORGAIJISfl: PORY PHYROMOMAS GINGIVALIS 

(ix) FEATURE: 

(A) MA14E/?:EY: misc_f eature 

(B) LOCATIOH 1 . . .2343 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 24 6 

ATGGATCGTC CTAAGCCTTC ATATATTGTT CGAATAGCAG CCATTCTCTG CTTGTTTGTC 60 

GGCAGGCCTT TGTTTGCGCA GAGCTATGTG GACTACGTCG ATCCGCTGAT CGGGACGCTA 12 0 

AGTTCTTTTG AGCTGAGTGC GGGCAATACC TATCCGGTGA TCGGTTTACC GTGGGGAATG 180 

AATAGCTGGA CACCGATGAC CGGTGTACCC GGTGACGGCT GGCAATATAC CTACTCGGCA 24 0 

CACAAGATTC GCGGATTCAA ACAGACCCAC CAACCCAGTC CTTGGATCAA CGACTACGGC 300 

CAATTCTCCC TTCTTCCCCT TACGGCACCG CAGAAGCCAT CATCGAACGA CTCCATAGCT 360 

CTGACTAAAT GGTGCAAGCA ACTCTTTTCG GACGAACAGA CCTCGTGGTT CTCGCACAAA. 4 20 

GCGGAGACGG CGACGCCATA CTATTATAGT GTCTATTTGG CCGATTACGA CACACGCGTG 4 GO 

GAGATGGCTC CGACCGAGCG TGCAGCTATC TTTCGCATAC GTTATTCCGG CAATACCGAA 54 0 

AGTGGCTCCG GTCGATGGCT TCGTCTTGAT GCCTTTACCG GTGGTTCGGA GATTAGCATC 600 

GTGGATCCTC ACACCGTAGT GGGCATATCT CGCAAGAATA GCGGAGGTGT GCCGGCTAAC 660 

TTCGCCTGTT ATTTCATCCT GCAGTCCGAT ACTCCTATGG CCGATGTCCT GCTTGAGACA 72 0 

GATACCGGCA AGTCAGACGA AGGCACAAGG GCATGGGCAG CCTGTCGCTT CGATTCGCA.A 7 80 

GAAGTTACCG TCCGGGTGGC ATCTTCTTTT ATCAGTGTCG AGCAGGCCGA AAGAAATCTT 84 0 

GCGGAAGTCA AJ\GGGCAGAG TTTCGACCGG ATCAGACTTG CCGGTCGCGA AGCTTGGAAT 900 

AAGGTGCTCG GACGCATACA TGTGGAAGGA GGAACGAAGG ATGAGCGCAC TACATTCTAT 960 

TCCGCACTCT ATCGCTGTCT GCTTTTTCCC CGTCGCTTCT ATGAGGAGGA TGCTTCCGGC 1020 

AATTTTGTGC ATTACAGCCC CTACAATGGA GAGGTACTTC CCGGTTATCT CTATACCGAT 1080 

ACCGGATTTT GGGACACTTT TCGAGCCCTT TTCCCCCTGC TCAATCTGCT GTATCCCGAT 114 0 

GAAAACATTA AAATTCAGGA AGGTCTGCTG AATGTATATC GCGAGAGTGG CTTTTTCCCC 1200 

GAATGGGCCA GTCCGGGCCA TCGGGATTGT ATGATAGGCA ACAACTCTGC TTCTGTTCTG 1260 
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GCGGATGCCT ACCTCAAGGG TGTTCGGGTA GAAGATACCC GTACACTGAT GAACGGACTC 1320 

TTGCATGCTA CGAAAGCCGT CCATCCGAAA ATCTCCTCCA CGGGTCGCAA AGGTTGGGAG 1380 

TGGTACAACT CCTTAGGTTA TGTTCCGGCT GATGCAGGCA TCGACGAAAG TGCTGCCCGT 14 40 

ACGCTCGAAT ATGCTTATAA CGATTGGTGC ATCCTCCGAC TGGGGCGCAC ATTGGGTTGG 1500 

5 GATAGAGCTG CATTGGACAC GTTGGCTCAT CGTTCGATGA ACTATCGTCA TCTGTTCGAT 1560 

CCGGAAACCA AACTCATGCG CGGTAGAAAT CAGGATGGTA GTTTCCGGAC ACCTTTTTCC 1620 

CCTTTCAAAT GGGGAGATGT ATTCACGGAG GGCAATGCCT GGCACTACAC TTGGTCGGTC 168 0 

TTTCATGATG TGCAGGGGCT TATCGACCTG ATGGGAGGAG ATCGCCCGTT CGTGTCTATG 17 4 0 

CTCGATTCGG TATTCAATAC TCCTCCTATG TTCGATGAGA GCTATTACGG ATTTGTCATC 1800 

10 CACGAAATCA GAGAGATGCA AATAGCGGAT ATGGGCAATT ATGCTCATGG CAATCAACCC 1860 

ATACAGCATA TGATATATCT GTATAATCAT GCCGGTCATC CATGGAAAGC TCAGGAGAGA 1920 

CTACGCGAAG TGATGGGGCG GCTCTATCGT CCTACTCCGG ATGGGTATTG CGGCGATGAA 1980 

GACAACGGAC AGACTTCGGC TTGGTACGTT TTCTCTGCTT TAGGCTTCTA TCCTGTTACA 20^0 

CCCGCTACGG ATCAGTATGT GCTCGGTTCG CCGATTTTTT CCAAGGTAAT ACTCTCTTTT 2100 

15 CCCGACGGAC ACAAAACGGT GTTGCATGCT CCGGCCAACA GTGCCGATAC GCCTTACATC 2160 

CGCTCGATCA GCGTAGAAGG AAAAGAATGG AGCTGCAATT ACCTGACTCA CGAACAGCTT 222 0 

CGCTCTTCTG CATCCATTCA ATGGATGATG GACACGAAAC CCAATTATAA TCGTGGTATG 2 2 80 

AAGGAAAGTG ACAGACCTTA TTCCTTCTCC ACGGAGCAAC AGCGTCGCGC TAATCACAGT 234 0 

AAT 23^13 



20 



35 



60 



70 



(2) lilFORIIATION FOR SEQ ID NO: 2^7 



( i ) SEQUEIJCE CHARACTERISTICS : 
25 (A) LEMGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDHESS: double 

(D) TOPOLOGY: circular 

30 (ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) AMTI-SEllSE: HO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOMAS GINGIVALIS 



(ix) FEATURE: 
40 {A) MAl'IE/KEY: misc_f eature 

(B) LOCATIOM 1 . . .813 

(xi) SEQUEMCE DESCRIPTION: SEQ ID NO: 2 47 

45 ATGATGAAAT CCATGCGCAG CGTGCTGTTG CTACTCTTTC CATTGTCTTT GATCACTGCT 60 

TTGGGCTGTA GCAATAACAA AGCTGCCGAA TCGAAGTCTG TCTCTTTCGA TTCGGCCTAT 12 0 

CTCGAACGCT ACATCCCTCT GCGGGCAGAC ATAGATACGC CATCGCTGCA TGTGATGATC 180 

AGCTACGTCT ATCCTTCGGG AGATGATATG CTCACAGAGA TTTTCAACGG TTTGCTCTTC 24 0 

GGCGACAGCC TGATGGATTC CTCTTCGCCG GAGAATGCCA TGGAAGGCTA TGCACAGATG 300 

50 CTGGGAGAAG ACTATCGCTC TAACAATGCC GAAGCC7\ATC TGC7KAGGGCT TCCTTCTGAC 360 

CTTTTGGACT ATATCTACAA GCAGGAAAAT ACCATCGCTT ATTGCGATAC GGGATTGATC 4 20 

TCCACGCGCA TCAATACATA TACTTACGAA GGCGGTGCAC ATACGGAGAA TACAGTCCGG 4 80 

TTTGCCAACA TCCTTCGCAC CACCGGCAAG GTGCTCGAAG AGCGAGATAT ATTCAAGATC 54 0 

GACTATGCGG AAAGGCTGTC CGCACTCATC ATAGGACAAT TGGTGCACGA TTTCGGCAAG 600 

55 ACCACACCTG CCGAATTGGA TGCAATAGGT TTCTTCAACG CAGAAGAAAT ACAGCCCAAT 660 

GGCAATTTTA TGATCGATGA CAAAGGTCTC ACATACTGTT TCAATGAGTA TCAGATAGCT 72 0 

GCTTATGCCA GAGGTGCTGT CTATGTCCGT CTCGGATATG ACGTATTGGC TCCTTTGCTA 7 80 

AGGGATGATT CCCCACTAAA GCGTTACTTG CCG 813 



(2) IMFORHATIOH FOR SEQ ID NO: 248 



(i) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 
65 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

75 (vi) ORIGINAL SOURCE: 
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(A) ORGAMISM: PORY PHYROMONAS GIIIGIVALIS 

(ix) FEATURE; 

{A) IIAJIE/KEY: misc_feature 
5 (B) LOCATION 1 . . . 810 

(xi) SEQUENCE DESCRIPTIOM: SEQ ID IIO:2^8 

ATGAAATCCA TGCGCAGCGT GCTGTTGCTA CTCTTTCCAT TGTCTTTGAT CACTGCTTTG 50 

10 GGCTGTAGCA ATAAC7\AAGC TGCCGAATCG AAGTCTGTCT CTTTCGATTC GGCCTATCTC 120 

GAACGCTACA TCCCTCTGCG GGCAGACATA GATACGCCAT CGCTGCATGT GATGATCAGC 180 

TACGTCTATC CTTCGGGAGA TGATATGCTC ACAGAGATTT TCAACGGTTT GCTCTTCGGC 24 0 

GACAGCCTGA TGGATTCCTC TTCGCCGGAG AATGCCATGG AAGGCTATGC ACAGATGCTG 300 

GGAGAAGACT ATCGCTCTAA CAATGCCGAA GCCAATCTGC AAGGGCTTCC TTCTGACCTT 360 

15 TTGGACTATA TCTACAAGCA GGAAAATACC ATCGCTTATT GCGATACGGG ATTGATCTCC 42 0 

ACGCGCATCA ATACATATAC TTACGAAGGC GGTGCACATA CGGAGAATAC AGTCCGGTTT ^80 

GCCAACATCC TTCGCACCAC CGGCAAGGTG CTCGAAGAGC GAGATATATT CAAGATCGAC 54 0 

TATGCGGAAA GGCTGTCCGC ACTCATCATA GGACAATTGG TGCACGATTT CGGCAAGACC 600 

ACACCTGCCG AATTGGATGC AATAGGTTTC TTCAACGCAG AAGAAATACA GCCCAATGGC 66 0 

20 AATTTTATGA TCGATGACAA AGGTCTCACA TACTGTTTCA ATGAGTATCA GATAGCTGCT 72 0 

TATGCCAGAG GTGCTGTCTA TGTCCGTCTC GGATATGACG TATTGGCTCC TTTGCTAAGG 7 80 

GATGATTCCC CACTAAAGCG TTACTTGCCG 810 

25 (2) lUFORMATIOlI FOR SEQ ID 1J0:249 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 801 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRAMDEDMESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 
35 (iii} HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



40 



45 



65 



70 



75 



(vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) M.^'IE/KEY: mi sc_f eature 

(B) LOCATION 1 ... 801 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 9 



ATGCGCAGCG TGCTGTTGCT ACTCTTTCCA TTGTCTTTGA TCACTGCTTT GGGCTGTAGC 60 

AATAACAAAG CTGCCGAATC GAAGTCTGTC TCTTTCGATT CGGCCTATCT CGAACGCTAC 120 

50 ATCCCTCTGC GGGCAGACAT AGATACGCCA TCGCTGCATG TGATGATCAG CTACGTCTAT 180 

CCTTCGGGAG ATGATATGCT CACAGAGATT TTCAACGGTT TGCTCTTCGG CGACAGCCTG 24 0 

ATGGATTCCT CTTCGCCGGA GAATGCCATG GAAGGCTATG CACAGATGCT GGGAGAAGAC 300 

TATCGCTCTA ACAATGCCGA AGCCAATCTG CAAGGGCTTC CTTCTGACCT TTTGGACTAT 360 

ATCTACAAGC AGGAAAATAC CATCGCTTAT TGCGATACGG GATTGATCTC CACGCGCATC 4 20 

55 AATACATATA CTTACGAAGG CGGTGCACAT ACGGAGAATA CAGTCCGGTT TGCCAACATC 4 80 

CTTCGCACCA CCGGCAAGGT GCTCGAAGAG CGAGATATAT TCAAGATCGA CTATGCGGAA 54 0 

AGGCTGTCCG CACTCATCAT AGGACAATTG GTGCACGATT TCGGCAAGAC CACACCTGCC 600 

GAATTGGATG CAATAGGTTT CTTCAACGCA GAAGAAATAC AGCCC7\ATGG CAATTTTATG 660 

ATCGATGACA AAGGTCTCAC ATACTGTTTC AATGAGTATC AGATAGCTGC TTATGCCAGA 7 20 

60 GGTGCTGTCT ATGTCCGTCT CGGATATGAC GTATTGGCTC CTTTGCTAAG GGATGATTCC 7 80 

CCACTAAAGC GTTACTTGCC G 801 



(2) INFORI'lATIOil FOR SEQ ID NO: 2 50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii} MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
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(ivl AMTI-SEtlSE: HO 



(vi) ORIGIIfAL SOURCE: 

( A ) ORGAMISM : PORY PHYROl^OMAS GINGIVALI S 

5 

(ix) FEATURE: 

(A) MAI IE/KEY: misc_f eature 

(B) LOCATION 1 . . .77 7 

10 SEQUENCE DESCRI PTIor] : SEQ ID HO: 250 



ATGAAAAAAA CTACTTTGAC AGGATCGATA 
GCCAATGCCC AATCGAAGTT AAAGATCAAG 
GCCACGGCCG GAAATGGTTT TGGTGGCAAT 

15 AGGGTACACC ACAGCATTCT GCCCGAAGGG 
AGAGCACACT GGGAAGAGGC CGGAAGTCCG 
ATTGGTATTG TTGAAAAGGT AATAGTCTTC 
GCTCTCGGCC GCTTCCTCAT CAGTTTGGGG 
TGGGGATGGA CCAATGACGC CCATATTCTT 

20 TATGACACCT ACACAAGAGC TGGCAGTGAC 
GTGAGCAATG GCTTTTCACC GGGGATCGGA 
ATCAAGAGCA AATATGATTT TCGCCTCGGT 
CTATATCCGT ATCGTAATTT CAAGCTGGAT 
CGCATGAACC ACATCGGCCA TGTGGGCTTC 

25 



TGTGCTTTAC TCCTGTTTTT GGGTCTCTCG 60 

AGCATTGAGG CAGCTACCAC TTTCAGTTCG 120 

ATCTTCGGCA TGGACATGAG CATACGGATG 180 

TTGGATTTTT CGGTAGGAAT ACATGAAAGA 24 0 

AAGCTCATGT ATACGAATGT CCCAAGTATC 300 

GAAGACGCAG AAGACTTTTT TGACAAAAAA 360 

ATATCCTATA CCAAGCATCT GGGAGCGTAT 4 20 

TTCTCACCGA TACCCAAGAG CAAGGTCCAC 4 80 

CTTGTACTTC AGTCCGAAGA TGTTGCCACA 540 

CTCAAAAGTT CTATTTGGTG GAAAATGCCC 600 

TTCAGCCTGG GCTATGAGTA TCTGAACCTG 660 

GGAAATAAGC CGCTTTCAGC ACTATCTCCT 7 20 

AACTTTACCG TGGGTCTTTG GACTAAT 777 



(2) III FORMAT I OH FOR SEQ ID HO: 251 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 3798 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 

(D) TOPOLOGY: circular 

35 (ii) MOLECULE TYPE: DHA {genomic) 

( iii ) HYPOTHETICAL: HO 



(iv) AHTI-SEHSE: HO 

40 



(vi) ORIGINAL SOURCE; 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 



lix) FEATURE: 
45 (A) NAME/KEY: misc_feature 

(E) LOCATION 1 ... 3798 



(xi) SEQUENCE DESCRIPTION: SI 

50 ATGGGCAAGT ATAAAAGAGC TAAGTACCGC 

TATACCTTTG AGGGAGTTAC TTTTTTATGC 

CAAGCCATTT TAGACTTAGA GAAGGCCTAT 

GAAGGGATAA GCTGGTCGGT AAATAGATAT 

GAGCTTTGCT TGCGAGAATG TCAGATAGAA 

55 CTAAAAAAGC TTGATCTATC GTATAACCAA 

ACTTCGTTAA CAAAACTTCG TCTAAGAAGT 

AGTCTCACCT CGCTAACAAA ACTTTCTCTC 

CTGGAACGTC TCACCTCGTT AGCGGAGCTT 

GAGGGTCTGG AACGTCTCAC GTCCTTAGCA 

60 AAGCTGGAGG GTCTGGAACG TCTCACGTCC 

ATCCGTAAGC TAGAGGGTCT GGAACGTCTC 

AACCA^ATCA GTAAGCTAGA GGGTCTGGAA 

TCGGGTAACC AAATCCGTAA GCTGGAGGGT 

GAACTGTCGG GTAACCAAAT CAGTAAGCTA 

65 AAGCTTCGTC TAAGAAGTAA CCAGATCAGT 

CTAACAAAAC TTTCTCTCTC CGATAACCAA 

ACCTCGTTAG CGGAGCTTTA TCTTTTGGAT 

CGTCTCACCT CGTTAACAAA GCTTCGTCTA 

CTGGATAGTC TCACCTCGCT AACAAAACTT 

70 GAGGGCCTGG AACGTCTCAC GTCCTTAGCG 

AAGCTGGAGG GTCTTGATGG TCTTGCTTCC 

ATCAGTAAGC TGGAAGGACT AGACAGACTA 

AATGATATTC AATCTATTGA TGATATTAAG 

GAAAAACTGA GAATCCATGA CAATCCATTT 

75 TATGATAATC ATTTGCCGGA GATTAAAGCT 



:Q ID HO: 251 

TATTGGCTTT TTCCTTTCTG TTCGGATTAT 60 

GCATCTGACG ATATGACAAC CAAGAAACCC 120 

AACATTGAAA TTCCTGATCT CTCCTCACAA 180 

TTCAAGCAAG ATTCCTCCGG TGCAGTCGTT 24 0 

AGCATGACTT GGCTTATTGA TTTTCCTGCT 300 

ATCAGTAAGC TAGAGGGTCT AGAACGTCTT 360 

AACCAAATCC GTAAACTAGA GGGCCTGGAT 4 20 

TCCGATAACC AAATCAGTAA GCTAGAGGGT 4 80 

TATCTTTTGG ATAACCAAAT CAGTAAACTA 54 0 

ACGCTTGAAC TATCGGGTAA CCAAATCCGT 600 

TTAGCAACGC TTGAACTATC GGGTAACCAA 660 

ACTTCGTTA.A CAAAGCTTCG TCTAAGAAGT 7 20 

CGTCTCACGT CCTTAGCAAC GCTTGAACTA 7 80 

CTGGAACGTC TCACGTCCTT AGCAACGCTT 840 

GAGGGTCTGG AACGTCTCTC TTCGTTAACA 900 

AAACTAGAGG GCCTGGAACG TCTCACCTCG 960 

ATCAGTAAGC TAGAGGGTCT GGAACGTCTC 1020 

AACCAAATCC GTAAGCTGGA GGGCCTGGAA 1080 

AGAAGTAACC AAATCAGTAA ACTAGAGGGC 114 0 

TCTCTCTCCG ATAACCAAAT CAGTAAACTA 12 00 

GAGCTTTATC TTTTGGATAA CCAAATCCGT 1260 

TTAACAAGGC TTAGTCTAAG GCGCAACCAA 1320 

AJ\GGTTTTGA GAAAACTTGA TGTTTCGGGC 138 0 

CTATTGGCTC CGATTCTGGA GCAAACTTTA 14 40 

GTTGCATCAT CAGGCTTGAT ACTCTCTCCT 1500 

CTTCTTGAAA AAGAAAAAGA AAAACAGAAA 1560 
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AAGACTTCAG TTGAATATCA CCCATTTTGC 
GGTAAAACAA CATTTCTTAG TCAATACGAT 
GTGTTGTCGA TACATCGAAG CAATAACCCT 
GACTATTATC ATGGGATTTA CCAAGCCTTT 
5 TGGGATGCTA AGAAGGATCG AAACTTTGTG 
AATTTCAATC GCCCCTATTG GTTAGGACAG 
GTTGGAGGAA ATCCTGATGG CAAGGACACA 
CAGACTCATG CCGATGA.A.AC GGGCGCTAAG 
GGAGTATTGG AAGAAATCTA TGTATCCTTA 

10 CTCAACTATC TGAATGAGCG GGTGCGAGAA 

ATCACAGAAA AAGATAAGGG ATTGTACGAA 
CACATCCCTA TCTCTCTCGA AGCTCTTGCG 
GATCTTTACA CCATAGAGTA TCTACAGACC 
GTGCTTTACT ATCGTGAGAA TGAGAAGCTG 

15 TTTGTCCAAA TGATTCATGG AGAAATCCTC 

CCTAAAGACA TTTTTGAATG CAAACTGCAT 
GATGGCCAAA ATGGTAATAT GATCTTGCAG 
GATAAGGACT GCTATGTGAT ACCGGGCTAT 
AAATGGCTTA CTTTGGGATT CGAGAGGCCC 

20 CCCTTTGGCC TGATCAACCA GATTATAGCC 
CGGTATTGGC GAGATCAGGT CATCTTCACA 
GAGCAAGAAG AAGAGAAAGA GGGTTTGCCC 
ATCAAGCTCG ACTTTACCGA CTTGGCCATA 
TCAGCTAAGG ATATGCAGCG GAAAGAGGCT 

25 TGGAACAATA TCCCTCCGAG GGAGCAAATA 
ACTATTCGTG AAACAAACAG AAAGAAGAGA 
CAAGCGGATA AAGATTTGAC GGAGTCTCAT 
AGCAAGACTA CGGCGAGGAT TGCAGCCTAT 
CGGGTGCGAG AAGTATCGAC TCGTCCCTAC 

30 ACTGCAAAAC AGATCTTTAT TTCCTATTCC 
CTGCAATTTT TCAAACCCTT GGAGAAGAAT 
ACTAAGTTTG AAACACCTAT TCACCCTGAA 
ATAATCGCTT TGATCAGCCA ACGCTATCTG 
CCTGTATTTC GGGAGTATAA CAAGACCATA 

35 GAAGACGATG AGTTCCTTCG GGAGAAATAT 

GGAAAAGAGG GAAAAACCAT TAAAGCTTAT 
GAAAATTGGG TGGCAGTAGT CAGAGAGTTC 
GAGGTAAATA CAGATGAA 

40 

(2) IIIFORl-IATION FOR SEQ ID HO: 25! 



AAAGTAATGC TATTGGGAAA TCATTCTTCG 1620 

ACAAATTATA CGTATCAGAA AAATACACAT 168 0 

AATGCGATCT TTTACGACTT TGGGGGACAG 17 4 0 

TTTACCACCC AATCGTTATA CCTTCTCTTT 1800 

AGCGTAGATG ATAAAGAATA TCAGACTCTT 18 60 

ATAGCCTATG CCTGCAATCG TTGTATGTCC 192 0 

CCACAGACCA CAGACGATAC AATTATCATT 198 0 

CAGCAAACCT TAGGCTGTGC AGCCGAGAAT 20 '3 0 

GAGCCCAAGG CGAATAGTGC CGTACATGCG 2100 

GTTGTCGCAA GCAGGAGTAA ATCAATTCAG 2160 

GCTCTTCCCA CAATCGCCGG TGATAATAAA 2220 

GCTCAATTGA ATAAGGGAAG AGCTGAAAAT 22 80 

GAATTGAACC AGCTTAGTCT GCGAGGGGAG 2 34 0 

AACAATTATG TCTGGTTAGA TCCGGCAGCT 24 00 

CAAAAAGACA ACATCAATAG AGGAACAGTT 2 4 60 

AATCTAAGTT CCGGAAGTAT ATTTGAAGAA 252 0 

CTATTATTGG AAGAGCTGAT CGTATATGAA 2580 

CTCCCTTTGC ATTCCGATGA CGAAGCCTAT 2 64 0 

AATTTTGTCC TCAAATTCGA ACGTTTTATC 27 00 

TACTATGGCC GGGAAGAAGG TGCTCTAAAG 27 60 

GCAGGCCGTG AGATGGATAG GCAAACGCTT 2 82 0 

AAGACGAATG CCGAGGATTA TCAGATCTGG 28 80 

TCCGTATTCA TCAAAGAGCA GAGAAAGACA 2 94 0 

ACTATCCTCA GTGATATGTT GG AT ATGT AT 3000 

GGAGATAAGG ATACGGAGCA 7^ACGAGAAGC 306 0 

CCCATCCAGG ATCTCTACCT CTCCTGTGCC 3120 

TATATCCATT TGGGCACGCT GGACGATGAA 3180 

CCGTTGAAGA ACGGCGTTAT CGATAAAGAG 32 4 0 

AAACATCTTT CCGTCAATAA AAATCTGGCT 3300 

AAAGAGGATC AGACTGAACT GGAGACCTGT 3360 

GGTCAGATCG AGATCTACTA TGATAAGTTG 34 2 0 

ATAAGAAAGC GTATTGTCGA AGCCGACTGT 34 80 

GCCACGGATT ACATCCTGGA TCATGAGTTG 354 0 

GTGCCGATAT TGATCAAGCC TTGTACATTC 3600 

TTTGCTCAGA AAGCTCAAAT AATCAATCTT 3560 

GATAGTATTA CGGCATCAGC CCATCGTGAT 37 20 

AAAGAGAAGA TATTAAGT^AT AACAAT^ACAG 37 80 

3798 



(1) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 36 96 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 

50 

( i i i ) H Y POTHETI CAL ; MO 



{iv) ADTI-SENSE: MO 

55 (vi) ORIGINAL SOURCE: 

(A) ORG AU ISM: PORYPHYROMOHAS GIIIGIVALIS 



(ix) FEATURE: 

(A) NAME/KEY: misc__f eature 
60 (B) LOCATIOn 1...3696 



(xi) SEQUENCE DESCRIPTION: SEQ ID 110:252 



ATGACAACCA AGAAACCCCA AGCCATTTTA 

65 CCTGATCTCT CCTCACAAGA AGGGATAAGC 

TCCTCCGGTG CAGTCGTTGA GCTTTGCTTG 

CTTATTGATT TTCCTGCTCT AAAAAAGCTT 

GAGGGTCTAG AACGTCTTAC TTCGTTAACA 

AAACTAGAGG GCCTGGATAG TCTCACCTCG 

70 ATCAGTAAGC TAGAGGGTCT GGAACGTCTC 

AACCAAATCA GTAAACTAGA GGGTCTGGAA 

TCGGGTAACC AAATCCGTAA GCTGGAGGGT 

GAACTATCGG GTAACCAAAT CCGTAAGCTA 

AAGCTTCGTC TAAGAAGTAA CCAAATCAGT 

75 TTAGCAACGC TTGAACTATC GGGTAACCAA 



GACTTAGAGA AGGCCTATAA CATTGAAATT 60 

TGGTCGGTAA ATAGATATTT CAAGCAAGAT 120 

CGAGAATGTC AGATAGAAAG CATGACTTGG 180 

GATCTATCGT ATAACCAAAT CAGTAAGCTA 24 0 

AAACTTCGTC TAAGAAGTAA CCAAATCCGT 300 

CTAACAAAAC TTTCTCTCTC CGATAACCAA 360 

ACCTCGTTAG CGGAGCTTTA TCTTTTGGAT 4 20 

CGTCTCACGT CCTTAGCAAC GCTTGAACTA 4 80 

CTGG.AACGTC TCACGTCCTT AGCAACGCTT 54 0 

GAGGGTCTGG A.^\CGTCTCAC TTCGTTAACA 600 

AAGCTAGAGG GTCTGGAACG TCTCACGTCC 660 

ATCCGTAAGC TGGAGGGTCT GGAACGTCTC 7 20 
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ACGTCCTTAG CAACGCTTGA ACTGTCGGGT 
CGTCTCTCTT CGTTAACAAA GCTTCGTCTA 
CTGGAACGTC TCACCTCGCT AACAAAACTT 
GAGGGTCTGG AACGTCTCAC CTCGTTAGCG 
5 AAGCTGGAGG GCCTGGAACG TCTCACCTCG 
ATCAGTAAAC TAGAGGGCCT GGATAGTCTC 
AACCAAATCA GTAAACTAGA GGGCCTGGAA 
TTGGATAACC AAATCCGTAA GCTGGAGGGT 
AGTCTAAGGC GCAO^CCAAAT CAGTAAGCTG 

10 AAACTTGATG TTTCGGGCAA TGATATTCAA 

ATTCTGGAGC AAACTTTAGA AAAACTGAGA 
GGCTTGATAC TCTCTCCTTA TGATAATCAT 
GAAAAAGAAA AACAGAAAAA GACTTCAGTT 
TTGGGAAATC ATTCTTCGGG TAAAACAACA 

15 TATCAGAAAA ATACACATGT GTTGTCGATA 
TACGACTTTG GGGGACAGGA CTATTATCAT 
TCGTTATACC TTCTCTTTTG GGATGCTAAG 
AAAGAATATC AGACTCTTAA TTTCAATCGC 
TGCAATCGTT GTATGTCCGT TGGAGGAAAT 

20 GACGATACAA TTATCATTCA GACTCATGCC 
GGCTGTGCAG CCGAG7VATGG AGTATTGGAA 
AATAGTGCCG TACATGCGCT CAACTATCTG 
AGGAGT7W\T CAATTCAGAT CACAGAAAAA 
ATCGCCGGTG ATAATAAACA CATCCCTATC 

25 AAGGGAAGAG CTGAAAATGA TCTTTACACC 
CTTAGTCTGC GAGGGGAGGT GCTTTACTAT 
TGGTTAGATC CGGCAGCTTT TGTCCAAATG 
ATCAATAGAG GAACAGTTCC TAAAGACATT 
GGAAGTATAT TTGAAGAAGA TGGCCAAAAT 

30 GAGCTGATCG TATATGAAGA TAAGGACTGC 
TCCGATGACG AAGCCTATAA ATGGCTTACT 
AAATTCGAAC GTTTTATCCC CTTTGGCCTG 
GAAGAAGGTG CTCTAAAGCG GTATTGGCGA 
ATGGATAGGC AAACGCTTGA GCAAGAAGAA 

35 GAGGATTATC AGATCTGGAT CAAGCTCGAC 

AAAGAGCAGA G7W\GACATC AGCTAAGGAT 
GATATGTTGG ATATGTATTG GAACAATATC 
ACGGAGCAAA CGAGAAGCAC TATTCGTGAA 
CTCTACCTCT CCTGTGCCCA AGCGGATAAA 

40 GGCACGCTGG ACGATGAAAG CAAGACTACG 

GGCGTTATCG ATAAAGAGCG GGTGCGAGAA 
GTCAATAAAA ATCTGGCTAC TGCAAAACAG 
ACTGAACTGG AGACCTGTCT GCAATTTTTC 
ATCTACTATG ATAAGTTGAC TAAGTTTGAA 

45 ATTGTCGAAG CCGACTGTAT AATCGCTTTG 
ATCCTGGATC ATGAGTTGCC TGTATTTCGG 
ATCAAGCCTT GTACATTCGA AGACGATGAG 
GCTCAAATAA TCAATCTTGG AAAAGAGGGA 
GCATCAGCCC ATCGTGATGA AAATTGGGTG 

50 TTAAGAATAA CAAAACAGGA GGTAAATACA 



AACCAAATCA GTAAGCTAGA GGGTCTGGAA 7 80 

AGAAGTAACC AGATCAGTAA ACTAGAGGGC 84 0 

TCTCTCTCCG ATAACCAAAT CAGTAAGCTA 900 

GAGCTTTATC TTTTGGATAA CCAAATCCGT 960 

TTAACAAAGC TTCGTCTAAG AAGTAACCAA 102 0 

ACCTCGCTAA CAAAACTTTC TCTCTCCGAT 1080 

CGTCTCACGT CCTTAGCGGA GCTTTATCTT 114 0 

CTTGATGGTC TTGCTTCCTT AACAAGGCTT 12 00 

GAAGGACTAG ACAGACTAAA GGTTTTGAGA 12 60 

TCTATTGATG ATATTAAGCT ATTGGCTCCG 132 0 

ATCCATGACA ATCCATTTGT TGCATCATCA 1380 

TTGCCGGAGA TTAAAGCTCT TCTTGAAAAA 14 4 0 

GAATATCACC CATTTTGCAA AGTAATGCTA 1500 

TTTCTTAGTC AATACGATAC AAATTATACG 1560 

CATCGAAGCA ATAACCCTAA TGCGATCTTT 1620 

GGGATTTACC AAGCCTTTTT TACCACCCAA 1680 

AAGGATCGAA ACTTTGTGAG CGTAGATGAT 17 4 0 

CCCTATTGGT TAGGACAGAT AGCCTATGCC 1800 

CCTGATGGCA AGGACACACC ACAGACCACA 1860 

GATGAAACGG GCGCTAAGCA GCAAACCTTA 192 0 

GAAATCTATG TATCCTTAGA GCCCAAGGCG 198 0 

AATGAGCGGG TGCGAGAAGT TGTCGCAAGC 2 04 0 

GATAAGGGAT TGTACGAAGC TCTTCCCACA 2100 

TCTCTCGAAG CTCTTGCGGC TCAATTGAAT 2160 

ATAGAGTATC TACAGACCGA ATTGAACCAG 22 20 

CGTGAGAATG AGAAGCTGAA CAATTATGTC 2280 

ATTCATGGAG AAATCCTCCA AAAAGACT^C 2 34 0 

TTTGAATGCA AACTGCATAA TCTAAGTTCC 2 4 00 

GGTAATATGA TCTTGCAGCT KTTATTGGAA 2 4 60 

TATGTGATAC CGGGCTATCT CCCTTTGCAT 2 520 

TTGGGATTCG AGAGGCCCAA TTTTGTCCTC 2 580 

ATCAACCAGA TTATAGCCTA CTATGGCCGG 264 0 

GATCAGGTCA TCTTCACAGC AGGCCGTGAG 27 00 

GAG7\AAGAGG GTTTGCCCAA GACGAATGCC 27 60 

TTTACCGACT TGGCCATATC CGTATTCATC 2 82 0 

ATGCAGCGGA AAGAGGCTAC TATCCTCAGT 28 8 0 

CCTCCGAGGG AGCAAATAGG AGATAAGGAT 2 94 0 

ACAAACAGAA AG/WXGAGACC CATCCAGGAT 3000 

GATTTGACGG AGTCTCATTA TATCCATTTG 3060 

GCGAGGATTG CAGCCTATCC GTTGAAGAAC 3120 

GTATCGACTC GTCCCTACAA ACATCTTTCC 3180 

ATCTTTATTT CCTATTCCAA AGAGGATCAG 32 4 0 

AAACCCTTGG AGAAGAATGG TCAGATCGAG 3300 

ACACCTATTC ACCCTG7U\AT AAGAAAGCGT 3360 

ATCAGCCAAC GCTATCTGGC CACGGATTAC 34 20 

GAGTATAACA AGACCATAGT GCCGATATTG 34 80 

TTCCTTCGGG AGAAATATTT TGCTCAGAAA 354 0 

AATUXCCATTA A^GCTTATGA TAGTATTACG 3600 

GCAGTAGTCA GAGAGTTCAA AGAGAAGATA 3660 

GATGAA 36 96 



(2) inrORHATIOH FOR SEQ ID MO: 253 

55 (i) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 3525 base pairs 
{B) TYPE: nucleic acid 

(C) STRAMDEDNESS : double 

(D) TOPOLOGY: circular 

60 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
65 (iv) AMTI-SEIJSE: NO 



(vi) ORIGIMAL SOURCE: 

(A) ORGAIIISM: PORYPHYROI lOMAS GTtlGIVALIS 

70 (ix) FEATURE: 

(A) 1-IAIIE/KEY : mi £c_ feature 

(B) LOCATION 1. . . 3525 

f'-i) SEQUENCE DESCRIPTION: SEQ ID NO: 253 

75 
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ATGACTTGGC TTATTGATTT TCCTGCTCTA AAAAAGCTTG ATCTATCGTA TAACCAAATC 60 

AGTAAGCTAG AGGGTCTAGA ACGTCTTACT TCGTTAAC7VA AACTTCGTCT AAGAAGTAAC 120 

CAAATCCGTA AACTAGAGGG CCTGGATAGT CTCACCTCGC TAACAAAACT TTCTCTCTCC 180 

GATAACCAAA TCAGTAAGCT AGAGGGTCTG GAACGTCTCA CCTCGTTAGC GGAGCTTTAT 24 0 

5 CTTTTGGATA ACCAAATCAG TAAACTAGAG GGTCTGGAAC GTCTCACGTC CTTAGCAACG 300 

CTTGAACTAT CGGGTAACCA AATCCGTAAG CTGGAGGGTC TGGAACGTCT CACGTCCTTA 360 

GCAACGCTTG AACTATCGGG TAACCAAATC CGTAAGCTAG AGGGTCTGGA ACGTCTCACT 4 20 

TCGTTAACAA AGCTTCGTCT AAGAAGTAAC CAAATCAGTA AGCTAGAGGG TCTGGAACGT 4 80 

CTCACGTCCT TAGCAACGCT TGAACTATCG GGTAACCAAA TCCGTAAGCT GGAGGGTCTG 54 0 

10 . GAACGTCTCA CGTCCTTAGC AACGCTTGAA CTGTCGGGTA ACCAAATCAG TAAGCTAGAG 600 

GGTCTGGAAC GTCTCTCTTC GTTAACAAAG CTTCGTCTAA GAAGTAACCA GATCAGTAAA 660 

CTAGAGGGCC TGGAACGTCT CACCTCGCTA ACAAAACTTT CTCTCTCCGA TAACCAAATC 7 20 

AGTAAGCTAG AGGGTCTGGA ACGTCTCACC TCGTTAGCGG AGCTTTATCT TTTGGATAAC 7 80 

CAAATCCGTA AGCTGGAGGG CCTGGAACGT CTCACCTCGT TAACTVAAGCT TCGTCTAAGA 84 0 

15 AGTAACCAAA TCAGTAAACT AGAGGGCCTG GATAGTCTCA CCTCGCTAAC AAAACTTTCT 900 

CTCTCCGATA ACCAAATCAG TAAACTAGAG GGCCTGGAAC GTCTCACGTC CTTAGCGGAG 960 

CTTTATCTTT TGGATAACCA AATCCGTAAG CTGGAGGGTC TTGATGGTCT TGCTTCCTTA 102 0 

ACAAGGCTTA GTCTAAGGCG CAACCAAATC AGTAAGCTGG AAGGACTAGA CAGACTAAAG 108 0 

GTTTTGAGAA AACTTGATGT TTCGGGCAAT GATATTCAAT CTATTGATGA TATTAAGCTA 114 0 

20 TTGGCTCCGA TTCTGGAGCA AACTTTAGAA AAACTGAGAA TCCATGACAA TCCATTTGTT 1200 

GCATCATCAG GCTTGATACT CTCTCCTTAT GATAATCATT TGCCGGAGAT TAAAGCTCTT 12 60 

CTTG7\AAAAG AAAAAGAAAA ACAGAAAAAG ACTTCAGTTG AATATCACCC ATTTTGCAAA 1320 

GTAATGCTAT TGGGAAATCA TTCTTCGGGT AAAACAACAT TTCTTAGTCA ATACGATACA 1380 

AATTATACGT ATCAGAAAAA TACACATGTG TTGTCGATAC ATCGAAGCAA TAACCCTAAT 144 0 

25 GCGATCTTTT ACGACTTTGG GGGACAGGAC TATTATCATG GGATTTACCA AGCCTTTTTT 1500 

ACCACCCAA.T CGTTATACCT TCTCTTTTGG GATGCTAAGA AGGATCGAAA CTTTGTGAGC 1560 

GTAGATGATA AAGAATATCA GACTCTTAAT TTCAATCGCC CCTATTGGTT AGGACAGATA 162 0 

GCCTATGCCT GCAATCGTTG TATGTCCGTT GGAGGAAATC CTGATGGCAA GGACACACCA 1680 

CAGACCACAG ACGATACAAT TATCATTCAG ACTCATGCCG ATGAAACGGG CGCTAAGCAG 17 40 

30 CAAACCTTAG GCTGTGCAGC CGAGAATGGA GTATTGGAAG AAATCTATGT ATCCTTAGAG 1800 

CCCAAGGCGA ATAGTGCCGT ACATGCGCTC AACTATCTGA ATGAGCGGGT GCGAGAAGTT 1860 

GTCGCAAGCA GGAGTAAATC AATTCAGATC ACAGAA7\AAG ATAAGGGATT GTACGAAGCT 1920 

CTTCCCACAA TCGCCGGTGA TAATAAACAC ATCCCTATCT CTCTCGAAGC TCTTGCGGCT 1980 

CAATTGAATA AGGGAAGAGC TGAAAATGAT CTTTACACCA TAGAGTATCT ACAGACCGAA 204 0 

35 TTGAACCAGC TTAGTCTGCG AGGGGAGGTG CTTTACTATC GTGAGAATGA GAAGCTGAAC 2100 

AATTATGTCT GGTTAGATCC GGCAGCTTTT GTCC7W\TGA TTCATGGAGA AATCCTCCAA 2160 

AAAGACAACA TCAATAGAGG AACAGTTCCT AAAGACATTT TTGAATGCAA ACTGCATAAT 2220 

CTAAGTTCCG GAAGTATATT TGAAGAAGAT GGCCAAAATG GTAATATGAT CTTGCAGCTA 22 60 

TTATTGGAAG AGCTGATCGT ATATGAAGAT AAGGACTGCT ATGTGATACC GGGCTATCTC 2 34 0 

40 CCTTTGCATT CCGATGACGA AGCCTATAAA TGGCTTACTT TGGGATTCGA GAGGCCCAAT 2 4 00 

TTTGTCCTCA AATTCGAACG TTTTATCCCC TTTGGCCTGA TCAACCAGAT TATAGCCTAC 24 60 

TATGGCCGGG AAGAAGGTGC TCTAAAGCGG TATTGGCGAG ATCAGGTCAT CTTCACAGCA 2520 

GGCCGTGAGA TGGATAGGCA AACGCTTGAG CAAGAAGAAG AGAAAGAGGG TTTGCCCAAG 2580 

ACGAATGCCG AGGATTATCA GATCTGGATC AAGCTCGACT TTACCGACTT GGCCATATCC 2 64 0 

45 GTATTCATCA AAGAGCAGAG AAAGACATCA GCTAAGGATA TGCAGCGGAA AGAGGCTACT 27 00 

ATCCTCAGTG ATATGTTGGA TATGTATTGG AACAATATCC CTCCGAGGGA GCAAATAGGA 2 7 60 

GATAAGGATA CGGAGCAAAC GAGAAGCACT ATTCGTGTW^A CAAACAGAAA GAAGAGACCC 2 8 20 

ATCCAGGATC TCTACCTCTC CTGTGCCCAA GCGGATAT^lAG ATTTGACGGA GTCTCATTAT 2 8 80 

ATCCATTTGG GCACGCTGGA CGATGAAAGC AAGACTACGG CGAGGATTGC AGCCTATCCG 2 94 0 

50 TTGAAGAACG GCGTTATCGA TAAAGAGCGG GTGCGAGAAG TATCGACTCG TCCCTACAAA 3000 

CATCTTTCCG TCAATAAAAA TCTGGCTACT GCAAAACAGA TCTTTATTTC CTATTCCAAA 3060 

GAGGATCAGA CTGAACTGGA GACCTGTCTG CAATTTTTCA AACCCTTGGA GAAG7\ATGGT 3120 

CAGATCGAGA TCTACTATGA TAAGTTGACT AAGTTTGAAA CACCTATTCA CCCTGAAATA 3180 

AGAAAGCGTA TTGTCGAAGC CGACTGTATA ATCGCTTTGA TCAGCCAACG CTATCTGGCC 32 4 0 

55 ACGGATTACA TCCTGGATCA TGAGTTGCCT GTATTTCGGG AGTATAACAA GACCATAGTG 3300 

CCGATATTGA TCAAGCCTTG TACATTCGAA GACGATGAGT TCCTTCGGGA GAAATATTTT 33 60 

GCTCAGAAAG CTCAAATAAT CAATCTTGGA AAAGAGGGAA AAACCATTAA AGCTTATGAT 34 20 

AGTATTACGG CATCAGCCCA TCGTGATGAA AATTGGGTGG CAGTAGTCAG AGAGTTCAAA 34 80 

GAGAAGATAT TAAGAATAAC AAAACAGGAG GTAAATACAG ATGAA 3525 



60 



(2) IIIE'ORMATIOII FOR SEQ ID MO: 254 



(i) SEQUEIJCE CHARACTERISTICS: 
65 (A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIJDEDHESS: double 

(D) TOPOLOGY: circular 

70 (ii) UOLECl'LE TYPE: DUA t genomic) 

{iii) HYPOTHETICAL: NO 

(iv) AHTI-SEJISE: NO 

75 
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10 



15 



20 



(vi) ORIGIMAL SOURCE: 

(A) ORGANISM: PORY PHY ROl JONAS GINGIVAL! S 

(ix) FEATURE: 

(A) NAJ IE/KEY: misc_feature 

(B) LOCATION 1 . . . 687 

i'Ai) SEQUENCE DESCRI PTIOII : SEQ ID NO: 2 54 



ATGATGAAAA 
TCCGCCCAAA 
AGTTTGAACT 
AACGGTCTCC 
GCTACCACTT 
CACCCGGGCG 
ACCTACAATG 
GTCTTGATAC 
CGAACCGAAG 
GATGGTAACG 
GTGGGGGACG 
TGTGCCCAAA 



AAGCATTTGT 
C7iJ\CGACGAA 
TAGGGGTCCC 
CTTTCGAAAT 
ATTACATAGC 
GCATCGTGAG 
CAGTCTGCAC 
CCAAGCCCAA 
CCGGGAGGGA 
AAGTAACGAT 
TATCTCAAAA 
CAGTATCACA 



TTTCGTACTA 
CAGTAGCCGG 
CCCCGTAAGC 
ACCTATCTCT 
GAATAGCGAG 
GGTAGAAGGT 
CCGGATTACA 
AGTCTCGCTG 
ATTTATCCTT 
AGGCGGTAAA 
GTATGCCGTG 
ACAAAAA 



CTGGTTTGCC 
AGTTATTTTA 
ACAGAGGTTT 
TTCAGTCGTT 
GCAACTTTGA 
CGTTTTTGGA 
TTCGAAAATC 
CCTCATGTGT 
TGCGAAGAAG 
CCTTTCTTGC 
GGGGTAGGAG 



TATTCTCCTC 
CAGGACGAAT 
GGGGAATGAC 
TCAACAGCCA 
ATGAATGGTG 
AAATGACTTA 
AAGAAATAGA 
CGGAATCGGT 
ACGACACCTT 
TCAATACCA.A 
AAATTCGATT 



GTTCAGCAGT 
CGAGAAGGTG 
CCATGATGCG 
GGGAGATATA 
CGACTATGCA 
CAACATACCA 
AGGAACGATC 
GCCTTGCATC 
TGTGTCTCAC 
CGTAAAGATT 
CCTGCAGATT 



GO 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
687 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 255 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 



40 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROl-'IONAS GINGIVALIS 



(ix) 



FEATURE: 

(A) NAME/ KEY: 

(B) LOCATION 



misG_f eature 
1 . . . 684 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 



50 



55 



60 



65 



70 



75 



ATGAAAAAAG 
GCCCAAACAA 
TTGj^ACTTAG 
GGTCTCCCTT 
ACCACTTATT 
CCGGGCGGCA 
TACAATGCAG 
TTGATACCCA 
ACCGAAGCCG 
GGTAACGAAG 
GGGGACGTAT 
GCCCAAACAG 



CATTTGTTTT 
CGACGAACAG 
GGGTCCCCCC 
TCGAAATACC 
ACATAGCGAA 
TCGTGAGGGT 
TCTGCACCCG 
AGCCCAAAGT 
GGAGGGAATT 
TAACGATAGG 
CTCAAAAGTA 
TATCACAACA 



CGTACTACTG 
TAGCCGGAGT 
CGTAAGCACA 
TATCTCTTTC 
TAGCGAGGCA 
AGAAGGTCGT 
GATTACATTC 
CTCGCTGCCT 
TATCCTTTGC 
CGGTAAACCT 
TGCCGTGGGG 
AAAA 



GTTTGCCTAT 
TATTTTACAG 
GAGGTTTGGG 
AGTCGTTTCA 
ACTTTGAATG 
TTTTGGAAAA 
GAAAATCAAG 
CATGTGTCGG 
GAAGAAGACG 
TTCTTGCTCA 
GTAGGAGAAA 



TCTCCTCGTT 
GACGAATCGA 
GAATGACCCA 
ACAGCCAGGG 
AATGGTGCGA 
TGACTTACAA 
AAATAGAAGG 
AATCGGTGCC 
ACACCTTTGT 
ATACCAACGT 
TTCGATTCCT 



CAGCAGTTCC 
GAAGGTGAGT 
TGATGCGAAC 
AGATATAGCT 
CTATGCACAC 
CATACCAACC 
AACGATCGTC 
TTGCATCCGA 
GTCTCACGAT 
AAAGATTGTG 
GCAGATTTGT 



(2) INFORMATIOl^l FOR SEQ ID NO: 256 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
68 4 
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(A) ORGAItlSI-i: PORV PHYROMOMAS GIIIGIVALIS 



(ix) FEATURE: 

(A) IIAIIE/KEY: misc_f ea tu re 

(B) LOCATIOn 1 . . .1620 

(xi) SEQUENCE DESCRI PTIOII : SEQ ID HO: 2 56 



ATGAAGACAA AAGTTTTACG CAAATTCGTG 

10 CCTCTCGCCC AAGCGCAGAC GATGGGAGGA 
GAAAAACTGG TACAAACGAG GATGAGTGTG 
CACAGTGGAT ACGACACCGG CAATAGCAAT 
GCCACATACC AAAAGTTGAG GGATTGGGAT 
GATATCGTGG TAACGGGTAA GAATGAATCC 

15 AATAAGCCCG GAGGATATAA GAGTAGAGTT 
AATGCGAAAC TCGTGTATAA GGAAGACTTC 
GCCTCCAACT ATCGTTCGCC TTCTTCTCTT 
TTCGCTTACA CCGGCTTCAA CAATACGCAC 
CTGAATGGAG GGCAAAATTT CAATAAAAAC 

20 ATTGACAAGG TGGATCTCTC ATTGGGTAGC 
CCGCTAATGG GTGTGGTATT CGAAATG/^T 
TTGTCGAACT TTGTCGACAA TGATCCCGAA 
GAAAGCGACA TGTCGTTCAG CCCCAAAATC 
ATCAATGGGG AGAGTTGCCA CAACTTCATG 

25 TCGGATTGGG ACATTCGGTA TGTATATCCC 
ACTCCGACTA TGGATGATCT GGTGGAAGCT 
A.ACTCGGGGC TGGGGTATGA CAAGAACGCC 
GAAGAGAACG GTACGAACAC GCTGAAATAC 
AAAGATTTGT GGAGCGACAC ATTTACGTAT 

30 CAAGTAGACA TCAATCCGAC CAAGGGTCTC 

GGCAAACGGA TCGTTTGGTC TGATACGCAG 
GTAATGCAAG AAGGCAGCAT GAAGCTCTAC 
AGCCTGCCGA CGGCAGCAAA CTGCAAGGCT 
GCTGAGGCTT CTTTCTCCGG CAACGAATAC 

35 ACGTACATAC TCAAGGTCGT ATCCGATACG 



GTGGCGGCTT TCGCCGTCGC AACCCTCTGT 60 

GATGATGTCA AGGTGGTCCA GTACAATCAG 120 

GCGGACAACG GATGGATCTA TGTAATGACC IBO 

GTGAAGATCT TCCGCTCCAA AGACCAAGGT 2-10 

CCATCGGATG ATTATCAGTT TCAAGACTTC 300 

GACATCAAGA TTTGGTCGGT AGAGCTCATG 360 

GCGGTCTTCA GTCGCGATGC CAACGCGCAG 4 20 

TCCAATGTGC AGTTGTACGA TGTGGATATA 4 80 

AACAATGGTG GCAACCCTTT TGCTTTGGCT 540 

AAAATAAGTT TTGTGGACTA TGTGTTCTCT 600 

TTACTCTTCA GTCAAGATGG AGAGAAGAAA 660 

ACCTCTGAAT CCATGGGTCA CAATGCCTGG 7 20 

AAACAAGGGG GAAAAAGCGA TATCGGTTTC 7 80 

TTTCAGTGGT CAGGCCCTAT AAAAGTGAGT 84 0 

CAAATGTTGC TGGACGAGGA TAACAATACG 900 

ATTACGTACA GCGATTATGA TTCTGAATAT 960 

A/'vGAAATCGT TCAAGTATGA AAAAGGA/W^ 1020 

TTCCTTACAG CTTCGTACCA GAGTGAGACC 108 0 

AATCACTACC TGATTACATA TGCCAAAAAA 11 AO 

CGCTGGGCCA ATTATGACAA GATTCATAAC 1200 

ACATCATCTG CCAATGCTCT CTACACACCT 1260 

GTGTGCTGGT CATGGGTGGA ATATCTGCCG 1320 

TGGACCCATG CCAACGGTGT AGAAGACATC 1380 

CCGAATCCGG CTCAAGAATA TGCTGTGATT 14 4 0 

GTTGTTTACG ATATGCAGGG CAGAGTAGTC 1500 

AGGCTGAACG TGCAGCACTT GGCTAAGGGT 1560 

GAGCGTTTCG TAGAGAAGCT CATCGTGGAA 1620 



(2) IMt'ORllATIOW FOR SEQ ID 110:2 57 

40 (1} SEQUEI4CE CHARACTERISTICS: 

(A) LENGTH: 2313 base pairs 
(E) TYPE: nucleic acid 
{C) STRAMDEDIIESS: double 
(D) TOPOLOGY: circular 

45 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: MO 
50 (iv) At ITI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAN I SI 1: PORYPHYROI^iONAS GTNGIVALIS 

55 (ix) FEATURE: 

{A} NAME/ KEY: misc_f ea ture 

(B) LOCATION 1 ... 2313 



60 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO: 2 57 



ATGCTGACGA TCCGAAACTT CCTCCTCTTT TGTTGTCTGT CGCTGATAGC GTTTGCTGCC 60 

GATGCACAAA GCTCTGTCTC TTCGGGTAGA CGACTGACAG AATATGTCAA TCCCTTTATC 120 

GGAACGGCCA ATTACGGTAC CACGAATCCG GGAGCAGTAT TGCCCAATGG GTTGATGAGC 180 

GTTACCCCTT TCAATGTCAG CGGATCGACA GAGAATCGCT TCGACAAAGA TTCGCGTTGG 2^0 

65 TGGAGTGCGC CTTATTCGGC CGACAATAGT TACTGCATCG GTTTCAGCCA TGTGAATCTG 300 

AGTGGAGTAG GCTGTCCCGA ACTGAGTGGA ATACTGCTGA TGGCCACTTC CGGCACATTC 360 

GATCCTGATT ACTGCTGCTA TGGCTCTTCG CTCAGTCGAG AATATGCGCG CCCGGGAGAA 4 20 

TACAAGGCTG TATTGGACAA ATACGGTATA GATGCAGCCG TGACCGTAAC CGAGCGGACT 4 80 

GCTTTGACCG AATTTGCTTT TCCCGAAGGA GAAGGCCATA TCCTGCTGAA CCTGGGACAG 54 0 

70 GCCCTAAGCA ATGAATCGGG AGCCTCTGTT CGATTCTTAA ACGACTCCAC AGTCGTCGGC 600 

AGCAGGCTGA TGGGGACGTT CTGCTACAAT CCGCAAGCAG TTTTTCGTCA GTATTTCGTA 660 

CTTCAGGTGA GTCGGCGACC GATCTCTGCC GGCTATTGGA AGAAGCAGCC TCCTATGACA 720 

GTGGAAGCCC AATGGGATTC GACTGCAGGG AAATATAAGC AGTACGACGG CTACAAGCGT 7 80 

GAGATGAGCG GTGATGACAT CGGTGTCCGA TTCTCGTTCA ACTGCGATCA GGGGGAAAAG 84 0 

75 ATCTATGTAC GATCGGCCGT TTCATTCGTC AGCGAAGCCA ATGCGCTCTA TAATCTGGAA 900 
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10 



15 



20 



25 



GCGGAGCAAG 
ATACGCTCTC 
GGCACACCGG 
AATATCCTAC 
ACGGCTCACG 
CTGCTCTGCC 
TACCGAGAGA 
GAGGGCGACC 
GATACGGCAA 
CCGATCCGTC 
TACGACAATT 
TTTGCCCACG 
TACAGACACT 
TTCCTCACTC 
GAGGGCAGTG 
CTGATGGGAG 
TATGATCCGA 
GAAGCATGGC 
CCTAACGGCT 
ATGCTGGGAT 
TTCCCCCGAG 
ACGACCAATA 
AATAAAACAC 
CACCTCCGTT 



AAGAGGTGTT 
GCGCTATAGA 
ATGAAAAGAC 
AAGATGCCAA 
ACCGCTACAC 
TCCTCTATCC 
GCGGGTGGCT 
CCTCGCTTAT 
CGGCCTATGA 
CTGACAACGA 
CCGTATCGCA 
CACTTGGGCA 
ATTATAATAA 
CTTTCGATCC 
CTTATAACTA 
GAGCAAAGGT 
CCAACGAGCC 
GAACGCAGAA 
TGCCCGGTAA 
TCTACCCTGA 
TTAGGATTCG 
CAGAGAATCA 
TTCCGCATGG 
ACGAACTAAG 



CAAAAGTGTC 
GCGTTGGGAG 
GATATTCTAT 
TGGAGAATAT 
CGTGTTCTCT 
GGAGAAGCAG 
GCCGAGATGG 
CGTCATCAAT 
AGCCATGAAA 
CGACTATCTC 
TGCGCTGGAA 
TAAAGAAGAC 
GGAGTATGGT 
CAAACAGGGT 
TGCCTTTTTC 
TTTTTCGGAA 
GGACATCGCC 
ATTGACCCGG 
TGACGATGCC 
CTGTCCGGGC 
GCTCAATCCG 
ACCGACAGAT 
AACAAGGCAT 
CAATCGTCCT 



GGAGGGAP.TC 
GAAGCCCTCG 
ACCGCACTCT 
CCTATGATGG 
CTTTGGGACA 
TTGGATATGG 
GAGCTGTACG 
GACACTTGGC 
AAAAATGCTT 
ACCCTCGGCT 
TACTATCTGG 
GCAGCTCTAT 
ATGCTGTGTC 
GAAAACTTCG 
GTTCCCCACG 
AGGTTGCAGA 
TATCCTTACC 
GAGTTGATAG 
GGTACGATGA 
AGCCCCACCT 
CAGTATTATC 
TCCATTTACA 
ATCAGCCATG 
CGA 



CGGCCAAGGC 
GTACGGTGGA 
ATCACCTGCT 
GCAGTGGCAA 
CGTACCGCAA 
TACGGACACT 
GACAGGAGAC 
AAAGGGGCCT 
CTTCGGCAGG 
TCGTACCGCT 
CCGACTGG7\A 
TCGGAAAACG 
CATTGCTGCC 
AGCCTAA-TCC 
ATATACAAGG 
AAGTCTTCGA 
TCTTCTCCTA 
ACAAACATTT 
GTGCTTGGCT 
ATACACTGAC 
CTCAGGGGGA 
TCCATACGGT 
CCGATTTGGT 



TTTCTCCGCT 
AGTGGAAGGA 
GATACATCCG 
AACGGGTAAT 
TGTACACCCG 
GATCGACATG 
CCTGACGATG 
TCGTGCTTTC 
AGCGACCCAT 
TCGCGAACAG 
TCTGTCCCGG 
CTCGTTGGGC 
GGATGGATCA 
CGGTTTCCAC 
GCTTGCCCGG 
TGAAGGATAT 
TTTCCCCAAG 
TTGCAATGCT 
TGTCTATTCC 
CTCGCCGGTA 
GTTGATCATT 
TTCTCTTGGC 
GCGCTGCGGT 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2313 



30 



35 



M lilFORI IATlOIl FOR SEQ ID MO: 258 

(i) SEQUEJ ICE CHARACTERISTICS: 

(A) LEHGTH; 2328 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDMESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 
( i i i } HYPOTHETICAL : MO 



40 



(iv) AHTI-SEMSE: MO 

(vi) ORIGIllAL SOURCE: 

(A) ORGANISM: PORYPHYROMOMAS GIMGIVALIS 



45 



(ix) FEATURE: 

(A) MAllE/KEY: mi3c_featur€ 

(B) LOCATIOM l.-.232a 



(Mi) SEQUEMCE DESCRIPTION: SEQ ID MO: 256 



50 



55 



60 



65 



70 



75 



ATGTGCAAAA 
TTTTCTCTCC 
GAAAAAG CCG 
GTGGGCAAAG 
ACATCGGTAA 
TCAGGCAATA 
GACAATATCG 
TATTTGGCAG 
ACCTACGCCA 
ATGCTGAAAG 
GCATGGACAG 
GGGGAAGACA 
TTTGAGCCGG 
GGTCTGTTCA 
CAATTTCCTA 
ATAGCATTGG 
GCCATGCCTG 
CACGGACGTC 
ATGGTGAAGC 
CAGTTGGGCG 
CCCGATCACT 
GCGATAGCTC 
CGTGTGAAAG 
GGCAAGAACA 
GATGTAGCCA 
GTAAATATCC 



TAAGATTCAG 
AAGCTCAAGA 
TAGAAACGCC 
AAGCTCCCCA 
GCAGCATAGC 
TCGACATCTT 
ATCTGATAGA 
GAGGATTCGG 
AGGGAACTAA 
AAGGGCAGCT 
CCTTGTCTTT 
TCTGTTTCCT 
AGCTATTGCT 
TCTGTGCCGA 
TAGCCGACGT 
GAGAAGAAGG 
TAGCATTCGA 
TGTATGCAGC 
TATACGACGG 
GCGGATTCAG 
TTTTTGTCGG 
GCTATTCGGG 
CGATTGCCTT 
TCTTCATGTA 
ATCTGGCCTC 
TTCACCGTAG 



CCTCTTGCAG 
GGAAGGTATT 
CAAGAAAGTC 
TGAGGCAAAG 
CTACTCCGAG 
GGACGAAGCA 
CAAAACGCTC 
CCTCTCCGTT 
GGTGACCGAT 
CTTCATCGGA 
GAATTTGCCG 
GCTCGCCGAT 
CTCTTCCTCC 
GAATCGAATT 
CCTTGGTGTC 
TTTGGCTTCA 
CGGACCGGGG 
CAGCGGACTC 
CAACCGATGG 
TTTCAATGAT 
TACATGGGGA 
AAACGAAACT 
TGACAATAAG 
CGATCCGCAG 
CTTCGGCAAT 
TGGCGGATCC 



GCTCTTGTCG 
TGGAATACCC 
TTTGCCGTAG 
ATCTTCGACC 
CAACTAAAAT 
GGCCGTGTGA 
AATCGCCTTT 
CTGGATGTCG 
GTGGCTAAGT 
AAAGAGACCG 
ATGGGCTCGG 
GGCCGTGTAT 
GCCGATTCAC 
TATTTCATAG 
GGTGCCATGA 
CTTCTTCTCG 
GAC7\ATGATT 
TGGGGAACAA 
ACTAACTTCG 
GCTATCGATA 
AACGGTCTGT 
GCTATCGCAG 
GGCAACCTCT 
AGTAGCACAT 
ATGATTATCC 
ACGCGCAA^G 



TCTGCTTATT 
TCTTGGCTAT 
CCAACGGAGT 
GTATCAGCGG 
CCTTGGTCAT 
CCAACGTACC 
TGATCGTAGG 
CCGAAGCTCG 
TGGACAATGA 
ATAACCTGCA 
TCACCGGTCT 
ATGTCGCTGC 
GACTGTATGT 
AAAAAGGTCG 
ACGAAAGCA/i. 
CAGAGGG/iAG 
TCTACGAGAT 
ACCTGATGGG 
ACAAGAAGAC 
TAGCTGTTTC 
TCGAATTCAA 
AATGTAATCC 
GGGGGACGCT 
GGCATTCTTT 
TACCCAACGG 
GTGTCTTGAT 



GTTCACCTCT 
CCAC/U\GACG 
ACTTTACTCG 
ACTCAGCGAT 
ATACTATGCA 
TGCATTGAAA 
C7\ACAGGGCT 
CATACCGGCT 
TCGCTTGCTG 
AGATCCGGCC 
GGGCATTGTC 
AAACCAATCG 
GACGGATCGT 
CAAAACGACA 
TACGGCATAC 
TACGGCCGAA 
GCGGTTTAGT 
ACATGCCGGT 
CGTACAGGAA 
CAACGGAGAC 
GGATGGCAAA 
CGGAGATGCC 
CGGTGCCGTA 
CAGCTATCCG 
AGACAAATGG 
CTTCAACGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7 80 
840 
900 
960 
102 0 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



8 
P 

p 
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10 



15 



20 



CGGGGTACAC CGGAAACGAC TTCGGACGAC AGCCATCTTT ACGTCGAGCA GTTTGTCAAT 
CGCCTCGGGG CAGCCATAGG ACATAAGACT ATCTATGCAA TGGCCGTCGA TCATAACGGC 
TCTGTCTGGA TGGGATCGGA TATAGGCATT TTCGGCGTCT ACAATGCAGC CGGAGTATTG 
TCCTCGACTT CTACCCCTAT CGCTGTTCGG CCGGTCGGAG GAGAAGAACC CAATTTGTAC 
TATGTGCTGG ACAAGGTGAC GGTGACAGAC ATCGTCGTGG ACAAACTCAA TCACAAATGG 
GTTGCCACCC AAGGGACAGG ACTCTATCTC CTTTCGGAAG ATTGCAGTAA GATCCTCGCG 
CAATTTACCG TAGAAAACAG CCCTTTGCTT TCTAACAACA TACTATCCCT GGCCTTAAAT 
GACGATAACG GACTGCTGTA CATCGGTACG GCGGACGGAC TGATGACGTT CCAAACGGGT 
ACGGGGAGTG GATCAGCTTC CGAACTGGAC GGCGTCTATG TATACCCCAA TCCGCTAAGG 
CCGGAATATC CCGATGGCGT CACCATTGCC GGACTGCAAG CCGGCTGTAG TGTCAAAATC 
ACCGATACCA CCGGCAGACT GCTATACCAG ACTGAGAGCG TAACCACCGA AGTCAAATGG 
AATGCTCGAG GTGCCGATGG CAATAGGGTA GCTTCGGGCG TATATGCCGT TGCAGTGTAC 
GATCCGGTAT CGAAAAAGTC CAAACTAATT CGCTTCGCAG TGATTCGC 



(2) inrORilATIOI-J FOR SEQ ID UO:2 59 



(i) SEQUEtlCE CHARACTERISTICS: 

(A) LENGTH: 3414 base pairs 
<B) TYPE: nucleic acid 

(C) STRAIIDEDMESS: double 

(D) TOPOLOGY: circular 



1620 
1680 
1740 
18 00 
1860 
192 0 
1980 
2040 
2100 
2160 
2220 
228 0 
2328 



25 



30 



<ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: HO 

{iv} AlITI-SEIISE: NO 

(vi) ORIGIIJAL SOURCE: 

(A) ORG AM ISM: PORYPHYROMOMAS GIMGIVALIS 



35 



(i:-:} FEATURE: 

(A) MAME/KEY: mi sc_f eature 

( B) LOCATIOU 1 . . .3474 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 2 59 

ATGAAACGAA TACTTCCAAT AGTCGCATTC CTTTCTCTCT TCCTTGCCCT TGCTTTGCCT 60 

40 GCGAAAGCGC AACGAGCTAT GGGGAAGACG GCCGACCGTT CGCTAATGGC TTCGGGACAT 12 0 

TGGGTCAAGA TACGTGTCGA TGCAAGTGGA GTGTATCGCC TTACGGACGA ACAGCTCCGT 180 

GCCAATGGCT TCTCCGATCC GTCCAAGGTA GGTGTGTTCG GTTATGGTGG AGGGGTGCTT 24 0 

CCCGAAGATC TGAGCCGGAT CACGACAGAC GATTTGCCTC CGGTACCGGT ACTCCGTCAG 300 

GGCAATGCGC TGTATTTCTA TGCCGTGGGC CCGGTGACAT GGTTCTACAA TCCGGCCAAA 360 

45 ACCACCATGG AGCATACGGT GAATACATAC AGTACGCATG GCTACTACTT CCTGTCGGAT 4 20 

GCTGCCGGAG CACCTTTGCA GATGTCCCAA TATACGGGTG GAGGTGCGTC GGCCGAGGCT 4 80 

TTGATCGACT ACTACGATGA GCTGATGCTC CATGAACAGG AATTGTATTC GCCCPJKAGAA 54 0 

TCGGGACGAG ATCTGTATGG CGAGTCTTTC AGTGCAGTCA ATACGCGTAC GGTCAAGTTC 600 

CCTTTGAGGG GCAACACCCG CTCGTCTGGC GAACTCGGTA CCGTATTCTC ATACATAGCC 6 60 

50 AAGGCCAGAT CGGCCGGTGG CGGCCGTGAG ATGTCGCTCT CGGCGAATGG CATTCTGATC 7 20 

TTCAGCGATC CTTTTTCCAT GACATCGAAT GAAGTCTCCA ATTCCTATTT GGCCGGCAAG 7 80 

AAGCGTCGTC TCTATCACAG TACGCCGATG AACAGCTTGG TCAATGAGTT GCGCTTGGAC 84 0 

GCG/\ACTATA GCATGACAGG AGATGCGGTC AATCTGGATT TCATAGAGGT GGCTACACAG 900 

AACGACCTCC GGTACGATGG CGCACCCATG CATATCAGGC GGTTTTCCAA TTTGCCCGTT 960 

55 TTGGGGGGCG AGTCCTGCCG GTTCGTTATC AGTGAGGTGC CGGAGTCTCT GGTGGTTTTG 1020 

CAGGCCAATT CTTCCCTGAC AGCATCGCTT GTTCCCGTTA AGACTGTCGG GGATAAGACC 1080 

ATTGAGTTCG TGGCTCCGCC GAAGGGTCAG GATCGTAGGA CTATCAATAC GTTTTATGCC 114 0 

GTGGACTTGT CACAGGCTTC TGCTCCGGAG ATCCTCGGAG CGGTACCCAA TCAAAACCTG 12 00 

CATGGAGAGG AAATCCCTGA TCTGATCATT GTCTCTACTC AGGCGCTCCT CCTTGAGGCT 12 60 

60 GATCGACTGG CCACCTATCG TAGAGAGAAA AACGGGCTGA AGGTTTTGGT CGTGTTGCAG 132 0 

GAACAGGTGT TCAACGAGTT TTCGGGTGGA ACTCCCGATG CTACAGCATA CCGCCTCTTT 1380 

GCCAAAATGT TCTACGACAG ATGGAAGGCA AATGCACCTG TGGGAGAGAC CTTCCCGATG 14 4 0 

CAAATGCTTC TCTTCGGTGA TGGGGCTCAT GACAACAGGA AGGTCTCCGT AGCTTGGCAG 15 00 

AAACCGTATC TCCAACAAAC GGAGTTCTTG CTGACATTCC AAGCCGTCAA TTCGACGAAC 1560 

65 GTAAACAGTT ATGTGACGGA TGATTACTTC GGCTTGCTGG ATGATCAGCC GGCCTCGGTC 162 0 

AATATCGGTT GGCGCAATTA TAATATGGCT GTAGGGCGAT TCCCCGTACG TACTCCGGCC 1680 

GAAGCTCGCA TCGCAGTGGA CAAGACCATC CGATATGAGG AGGATCGAGA GAGTGGTGCC 17 4 0 

TGGCGTATTC GTGCCTGTTT TGCGGCAGAC AACGGGGACA AGCACGC;iALC CGAGACTTCC 18 00 

CGTTTGATCG ATACCGTCAA GCGTTATGCT CCTGCCATCA TGCCGGTACG CGCCTTTCAG 18 50 

70 GACGTATATC CGCATGTCAT CGAGAACGGG TTGCACAGCA TTCCGGGTGC AAAGAAAAAG 1920 

ATGCTGGAAA CCCTTCAGTC GGGTATTATC CTGCTTAATT ATGCTGGTCA TGGCGGTCCT 198 0 

GCCGGATGGT CGGACGAGCA TTTGCTGACG CTCAACGATA TACACAAATT CAATTATAAG 204 0 

CATATGCCCA TTTGGATTAC TGCCACGTGC GACTTTGCCA ACTATGACAG TCAGACGACC 2100 

TCGGCAGGGG AGGAGGTTTT CCTCCATGAG AAGAGTGGCA CTCCGATCAT GTTCTCGACT 2160 

75 ACGCGTGTCG TTTACAATAC GCAGAATGAG AAGATCAATG GTTTTATGCT TCGGCGTATG 2220 
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TTCGAGAAAG CTAAGGATGG GCGTTATCGT ACGATGGGCG AGATTATCCG ATCGGCCAAA 2280 

CAGGGGATGC TCAGTACTGT TTTCCCCGAT TCGATCAACC AGTTGAGTTT CTTTCTGATG 2 3*10 

GGTGATCCGT CCGTGCGTAT GAATCTTCCT ACCCACAAAG TGCAATTGAC CGCAATCAAC 2 4 00 

GGGCAGGATC CCGAAGGGCA GTATGGAACT ATTATGCTCA AGTCTTTGGA ACGGGTAGCT 2 4 60 

5 CTGAAGGGTA AGGTAACCGA TGAAAAGGGG ACATTCGACG AGACATTCAG TGGCAAGGTT 2 520 

TTCCTGACCG TCTTCGATGG CAGAAAGAAA ATGACAGCTT TGGAAGAGGA GGGAAACGAT 2 58 0 

CTCTCTCTTG TATATTATGA CTATCCTAAC GTGATGTATG CCGGTATTGC CGAGGTGAAA 2 64 0 

GACGGACTCT TCGAAACTTC GTTTATCGTA CCCAAGGATG TGAACTATTC CGAGCACGAA 27 00 

GGCCGGATCA ATCTTTATGC TTATAACGAG AGCACAAAGG CGGAAGCCAT GGGGGTAGAC 2160 

10 TTCTCCATCA GAGTCCAJVCC GGGTATTCCT GATGAGGTAA CGGAAGATAA TACACCGCCT 282 0 

GAAATCATAA GCTGCTTCCT CAATGACAGT ACATTCCGAT CGGGAGATGA GGTTAATCCT 2 8 80 

ACTCCTCTGT TTATGGCCGA AGTATTCGAC TTGAATGGAA TCAATATCAC GGGTAGCGGA 2 94 0 

GTAGGGCATG ATATTACGCT TTGTATCGAT GGCCGTGCCG ACCTGACCTA CAACCTC7\AT 3 000 

GCATATTTCA CAA/3TTCGGC TACGGATGCA GGTGTGGGCA CTATTCTCTT CATGATACCG 3060 

15 GCTTTGGCCG AAGGAGATCA TACTGCCCGA CTGACGGTTT GGGACATTTT CAATAATGCC 312 0 

GTCCATCATG ACTTTTCATT CAGAGTGGTA GATGGCATTG CTCCGGATGT GGCTGATGTG 3180 

ATTCTATTCC CGAATCCGGT ACGCGAGAGT GCTACGTTCC GAATCTTCCA CAATCGCCCC 32 4 0 

GGAAGCGATT TGAACGTGGC CGTGGAGATC TATGACTTCA CCGGTCGTCT TGTGAACAGT 3 300 

TTGCCAGTCA AGACCTATTC GTCTTCCTAC GGAGAACCTA TAGAGATCAA GTGGGATCTG 3360 

20 ACCTCCAAAT ACGGAGTGAA GATCGGAAAC GGATTCTACC TCTATCGTTG TGTGGTGAAC 342 0 

TCTCCCGGAG GACAGACGGC CTCCATGGCC AAGAAAATGA TCGTGGTAGG ACAA 34 7 4 



25 



35 



40 



(2) in FORMAT ion FOR SEQ ID HO: 2 60 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 2883 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIJDEDNESS: double 
30 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv) AMTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHOHAS GINGIVALIS 



(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 

(B) LOCATION 1. . .2883 

45 {>:i) SEQUENCE DESCRIPTION: SEQ ID ! JO: 2 60 

ATGAAGAAAC TTTTCCCATT ATTATTACTC ATATTGTCGA TTTTGGTCGG ATGTGGAAAA 60 

AAAGAAAAAC ACTCTGTAAC TGAAATCGCC CGAGAGAA7\A AGCGTATTAC TGCATTGCTG 120 

TACGAAAAAG AACTCCCCAC TGATTCTGTT AAACAGCTTT ACGAGAACAG CGTACAGAAC 100 

50 AAGAACCTTG TGGGACAAAT GTTATTCGCG ATCGAGGTCG GCAAACGGAT GCGTAATATG 2 40 

TCCCAATACA CAGATGCGAT GCTATATCAC CAAAAAGGGT TGAACGCTGC ATTGAATCTA 300 

AGGGACACCA TCGTAGCCGC ACAAGCATGG AATCATCTCG GAACGGATAG CCGTCGTATC 360 

GGTGCTTTGG CAGAAGCTTC GGATTATCAT TACAAGGCTC TTTCCTTGAT AGAATCTTTT 4 20 

AGCGGAAACC AGAATAGGCC TGCGATCAAA GCCAGATCGG CGGCCCTG7\A CGGCATCGGC 4 80 

55 AACATCAATC TTGAGTTAGG ATACCATGAT GAGGCCGAAA AGAATTTCCT GAAAGCACTG 54 0 

CAAGGTGAGA AAGAACTCGA CAGTCCTCTC GGGCAAGCTA TCAACTATGC GAACCTCGGA 600 

CGTATCTATC GACAACGCAA AGAATACGAC AAGGCTCGTA CCTACTTCCT CTTGTCTCTG 660 

GAGCAGAACA ATATGGCAGA GAATCTGATG GGTATCGGAC TCTGTAGCAT CAATCTCGGA 720 

GAAGTAGACG AAGAAAAAGG GGATTATCAA AAGGCTTTGC AAGAGTATGC CACGGCATAC 7 80 

60 AAACTGATGG AACAGTTGTC CGATCGATGG CACTGGCTGA ATTCCTGTAT CCCGATGGCA 8 40 

CGTATCAATC TCAAACAAGG TAACGAAAGG CTCTACCAGC ATTTCATTTC TTTGGCCGAA 900 

GGGACTGCGA AAGAAATTAA TTCGACTTCA CATCTGATAG AAATATACAA TCTTCAATAC 960 

GAGAATCTCG AGCGTAAAAA AGAATACAAA CAAGCCCTCG AAGCATTCTG TCTGAGCAAG 1020 

ACGTTGAGCG ACAGCATGTC CATTGCGCAC AAGGTCAGCA GCATACAAGA AACGCGATTC 1080 

65 AACTACGAAC GAAACAAGTC CCAAAAAGAG CTTGAAGAAA TACAGCAAGT AAGCAAGGCA 114 0 

AAACAAGAGA AATCGAAGTT TATCCTCTTG AGCACTCTTT TTGCCCTTTT CATCTCGATT 1200 

CTTTTGATTT CTGTTCTGAC ATATGCATAC CGTCAGGGCA AGAAGCATAA CAAGCTGATC 12 60 

AAAGAGACGG ATAAACTTCG CTCCGGCTTT TTCACCGGTA TTACACACGA ATTTCGTACG 1320 

CCTATCACCG TCATACAAGG TTTGAATGAG AAAATGAGTT CAAGTCCTGA TCTCCAAGCA 1380 

70 TCGGACAGAA CCGAGCTGCA CAAGATAATA GACAGACAGA GTAGCCATAT GCTGAATTTG 14 40 

GTGAACCAGC TGTTGGATAT TTGCAAGATC AGAAGCGGAG TATCCACGCC CGAATGGCGC 1500 

AATGGCGACA TCGTCTCCTT CGTACAGATT CTCATCGATT CGTTTGCACC ATACGCACAG 1560 

GCTCAAGACA TAACCTTGGA GCTACAACCC GAGAGCAAAC CTATTGTCGT GGACTTCGTC 1620 

CCCTCCTACT TGCAAAAAAT CATATCCAAT CTTTTGTCCA ATGCCATCAA GTATTCTTTA 1680 

75 GCCGGAGGGA GAGTGGTCAT ATCTCTGGCA AAAACCAAGA ATGAAAAAAA TCTGATCATA 17 40 
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CGCGTTGCAG ACAATGGCAT AGGAATAGAT 
TTCTATCGAG GACAGTCCGC TACCGAAAAG 
AATATACTGG TCGAAAACCT TCGAGGTACG 
AGTGCCTTCA CCATCAGTAT TCCTACACAA 
5 TGGCTACCCT CCTCCGATGA CATTGTCATG 
ACATCTCCGA TGGTAGCAGC TCTGAATCAT 
CTCGTCGAGG ACAATAAGGA TATCAACCTG 
AATGTGCTAT CCGCCGCAAA CGGAAAAGAG 
GACATTATCA TTACGGATAT TATGATGCCG 

10 ATGAAGCAAT CGCCTCTGCT CTGTCACATT 
GAACAGGACA GATTGGAAGG AATCAAAAGC 
TCTCCGGAGG AGCTTTTGAT GCGGATCGAG 
AAGTTCTATA TGCAAAAACT CATGCTGGAT 
GATGACAGCA GTATGCAGTT TCTCCTTGCT 

15 CAAAATCCGG ATTTTTCCGC TCAAGACTTG 
CTCAACAGAA AGCTCACGAG TGTCGTAGGT 
AAGATAAAAT TGGCCTGCAA GCTCCTTGCC 
ATTGAGGCAG GCTTTTCGGA TCCGGCTTAC 
TGCTCTCCCT CCCAATATCG GCAAAAACTC 

20 GTT 



AAAACTGATC AGGCTCATAT CTTCGACATC 1800 

CATGGATCAG GCGTCGGACT CTCGTTTACC 1860 

ATCAAAGTGG AAAGCCAGCC GGGGAAAGGA 1920 

AACCAGTCCT CTTCGGCAGA GATTCTTCCT 1980 

CCTGTCCACA TCGCGCCCGA TGACTCACCG 20^0 

CGCTTCGAGG ACGAACGTCC GACCATACTG 2100 

CTCGTCAAAC TACTCCTTTG CGATCGCTAC 2160 

GGTATAGCCC TCGCTACCGA GCATATTCCC 2220 

ATAATGGATG GGATAGAAAT GACAATCCGG 22 80 

CCCATTGTCG CTTTGACGGC CAAGAGTACC 23A0 

GGTGTAGTCT CTTATCTATG CAAGCCATTC 24 00 

CAGCTTCTGA AAGACCGTGA GTTGCTCAAG 24 60 

CGGAAGCCGG AGGAGGAGCC TCAACCGATA 2520 

GCCAAAGATG CAGTGTCCGG TGGAATCAA/^. 2580 

GCCG.i\7VAAAA TGTGCATGAG TCCATCCCAA 264 0 

TGCTCCACCA TCGGCTACAT ACAGCAGATC 27 00 

GATGAGAGCA AAAACATCTC CGACATTAGC 2760 

TTCTCTCGCA CCTTCAAACG CTACATGAAC 2820 

CTTGCCATGC CGGGGAGCGA CAAGGAGACA 2 880 

2883 



(2) 1 11 FORMAT I ON FOR SEQ ID 110:261 



25 
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(i) SEQl'EIJCE CHARACTERISTICS: 

(A) LENGTH: 1668 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDHESS: double 

(D) TOPOLOGY: circular 

(ii) nOLECULE TYPE: DHA (genomic) 
(iii) IIYPOTHETICAL: NO 
{ivj AMTI- SENSE: NO 



40 



45 



50 



55 



60 



65 



70 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

( i X } FEATURE : 

(A) NA14E/KEY: mi£C_f eature 

(B) LOCATION 1 . . ,1668 

(xi) SEQUENCE DESCRIPTION: SEQ ID 110:261 



ATGAAAAAGC 
GCAATCGCCC 
ATGCAACGAG 
TACGAACGCT 
AAAGAGCTGG 
ACCGTTATAA 
GCGAAAACTG 
AACAAAGTAG 
CCCAAATACG 
GGTGGCAACT 
GAGAACTCCT 
ACACATCATG 
GGCAAGTATT 
CACCAAGCCC 
AAGTACGAGG 
CTGAACAACA 
AACGTCTATA 
ACACCTTG^GT 
TATCTCTATA 
GAAGCAGATG 
ATCAATGGTT 
ACTTATAGCT 
GACAATAGTG 
ACGTGTATGA 
TGGTTCAACG 
TATCGGATAA 
GTAGCAGGGA 
CTGGTTGTTG 



TTTTACAGGC 
AAACGCAAAT 
CATTCCAGGA 
CTGCAGCCGT 
CCAAGAACGA 
CCCAGTACAC 
ACTCTTACTG 
GTCTCGTGGA 
AAGCACAATA 
ACATGACGGA 
CTCTGTCTCA 
ATGTGGTACA 
TGGCACCGAA 
TGGAAGATAT 
TATATCGCGC 
GGGTATTTGT 
AGACGGCAAT 
TAGGAACAGA 
TC7»AGCACTA 
TCGTCTCATG 
CCGGTAGCTT 
TTACAGGTCT 
GTCGCAAAGA 
ACGAAACCAA 
CCGGTCGTTC 
AGCTTTATAA 
CG AGTGTCTT 
AAGGAAATGG 



TAAAGCCTTG 
GCAAGCAGAC 
7VACGAATCCC 
TTTGGTACGC 
CAAGGTGATT 
CCAAAGCGGT 
GACACGCGAC 
CTTTATTTAT 
TCTGGGCATC 
CGGATATGGA 
AGCTCAAGTA 
AGATCCGAAC 
CAAAATCCTC 
GGCAGCCTAC 
TTTGGCCACC 
TCCTGTCAAT 
GCCCGGTTAC 
TGCCCTGCAT 
CCCGATACTG 
CGCCAATGCT 
TAAGGCTGCT 
TAACAAGAAT 
GACTTATCCC 
TACATGTACT 
AGAACTGGCT 
CACCGCAGGA 
CAGTATGGAT 
AATCCGTGAG 



ATTCTGGCAT 
CGAACAAACG 
CCTGCAGGTC 
TACCCGTTCG 
ACCATTGTGG 
GTGAATCTCT 
TATACCGGTT 
AACCGCCCTC 
GAGATGTTCG 
TCCGCTGTGC 
AATCAAAAGA 
GGCGAATATA 
ATCAGGAAAG 
TTCGCAGCAC 
AATGAACAAC 
GGCCCCGCCT 
GAAATTATAG 
TGTCGTACTC 
GGCGAACAGG 
ACTATCTCGC 
GATATGACGA 
GATAAGGTAG 
TTTATCGGCG 
GTGACCGGAG 
GTTTCGGTAA 
GAAGAAGTCG 
GTGTATTCTC 
ACAATGAAAA 



TGGGACTCTT 
GTC7\ATTTGC 
CTGTGCGTGC 
GTATCCCGAT 
CGAGTGAAAG 
CTAATTGCGA 
GGTTCGCAAT 
GTCCTAACGA 
GGATGAAGCT 
AGTCACATAT 
TGAAAGACTA 
TCAACCATGT 
TGCCTGACAA 
AGACCTGCGC 
CGTACACGAA 
CCGTGGACAA 
GTGTCAAAGG 
ACGAGGTAGC 
CAGGCCCTGA 
CGGTACAATG 
TGGAATCAAC 
AATACTATAT 
AACCTGATCC 
CTGCCAAAGC 
GTTTGAATAT 
CTGCAATGAC 
AGGCTCCGGG 
TTCTCAAA 



CCAACTGCCC 
AACAGAAGAG 
TATCGCTGAG 
GGAATTGATC 
CCAAAAAAAC 
TTTCATCATT 
GTACGATACG 
TGATGAATTC 
CAAGCAGACC 
CGCATATACG 
TCTCGGCATC 
GGACTGTTGG 
TCACCCTCAG 
ATGGGGAACG 
CTCTCTGATT 
CGATGCTCTG 
GGCTTCAGGA 
GGATAAGGGC 
TTATAAGATC 
TTACTATCGT 
AGGTCACTAT 
CTCTGCCGCT 
TTTCAAGTTT 
TCTTCGTGCA 
TGCCGGCACA 
CAAGGAATTA 
CACATATGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
12 60 
1320 
1380 
1440 
1500 
1560 
1620 
1668 



75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1 J_> 



wo 99/29870 
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180/490 



(2) III FORI I AT lOI J FOR SEQ ID MO: 2 62 



10 



(5) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 128J base pairs 
{B) TYPE: nucleic acid 

(C) STRAl IDE DUES S : double 

(D) TOPOLOGY: circular 

(ii) llOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: HO 



(iv) AlfTI-SEHSE: HO 

15 (vi) OR I GI DAL SOURCE: 

(A) ORG AH ISM: PORYPHVRO! lOlIAS GIHGIVALIS 



(ix) FEATURE: 

(A) I IE/ KEY: misc_feature 
20 (B) LOCATIOH l..,12B4 



{>:!) SEQUEtlCE DESCRIPTIOU: SEQ ID MO: 262 



ATGAAACTTT CATCTAAGAA AATCTTAGCA 

25 GTGCAGGCAC AGTTTGTTCC GGCTCCCACC 

AAGGCCGTAG GCGAAAAAAT CGAATTGTTG 

ATCGATCTCA ATGGGGATGC CACTTACCAA 

GCATACCACG AATACACGAT CGGGACGCAA 

TTGGGCTGTC GATCTACCGG TGCAACGGCT 

30 TATCTCGCAT GCCCGAAAAA TAATCTGAAA 

CTGCGAGTTT GGTGCGACTC TAACGAAATA 

TTGATCATCC TCGGCTGTGA CAGGAATAAG 

AAGTTGGCCT CTCTTTGGTG TTCTGATAAT 

CCTCGTCTCA ATGATCTTTG GTGCTTCGGT 

35 AATCCTCTAT TGGTAACACT TTGGTGCAGT 

AAGAATTCGG ACGTTGCTTA CCTTTGGTGT 

TCGGGGGTGA AGGGACTGAG TGTTTTGGTT 

ATGACGAAAG TGGTGAATGC TTTGCCCACA 

TTCGTCGTTG TAGACCTCAA GGACACTGAT 

40 GAAAAAGCTA AAAGTAAGAA CTGGCGAGTA 

CTTCCATACG AAGGAAGTCC GACATCGAAC 

TATCCCAATC CGGTAGGAAG ATATGCGCTC 

GAAGCTGCTT TATACGATAT GAATGGGGTA 

CGTCAGAACA TTGACCTGAC ACATCTTCCC 

45 TATACCACTA AGCTCATCAA ACAG 



ATCATTGCAT TGCTGACGAT GGGACATGCT 60 

ACAGGGATTC GCATGTCTGT CACTACAACC 120 

GTTCATTCCA TAGAGAAGAA AGGCATCTGG 180 

CAAGGAGAGG AAATAACCGT ATTCGATGAG 2-^0 

ACCCTCACTA TCTATGGTAA TACGACCCGA 300 

GTCGATGTAA CGAAAAACCC TAATCTGACC 360 

TCATTGGACT TGACGCAAAA CCCAAAGCTG 4 20 

GAAAGTTTGG ACCTGAGTGG CAATCCGGCT 4 80 

CTGACTGAGC TGAAGACCGA TAACAACCCC 54 0 

AACCTGACGG AGTTGGAACT CAGTGCCAAT 600 

AATCGGATCA CGAAACTCGA TCTGAGTGCC 660 

GACAATGAGC TTTCGACCTT GGATCTTTCC 720 

TCATCGAACA AACTTACATC CTTGAATCTG 7 80 

TGTCATTCCA ATCAGATCGC AGGTGAAGAA 84 0 

CTATCTCCCG GCGCAGGCGC TCAGAGC7\AG 900 

GAGAAGAATA TCTGTACCGT AAAGGATGTG 960 

TTTGACTTCA ACGGTGATTC TGACAATATG 1020 

TTGGCAGTAG ATGCTCCCAC TGTCAGGATA 1080 

GTCGAGATCC CCGAGTCTCT TTTAGGGCAG 114 0 

AAAGTCTATA GTTTCGCGGT AGAGTCTCTT 1200 

GACGGCACTT ATTTCTTCCG TCTCGATAAC 1260 

1284 



(2) inFORMATIOH FOR SEQ ID 110:263 

50 (i) SEQUEHCE CHARACTERISTICS: 

{A) LEHGTH: 930 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDMESS : double 

(D) TOPOLOGY: circular 

55 

(ii) MOLECULE TYPE: DHA (genomic) 
(iii) HYPOTHETICAL: HO 
60 (iv) AHTI-SEMSE: NO 



(vi) ORIGIHAL SOURCE: 

(A) ORGAHISH: PORYPHYROIIOHAS GIMGIVALIS 

65 ( ix ) FEATURE : 

(A) HfiJiE/KEY: misc_featur^ 

(B) LOCATIOH 1 ... 930 



70 



(>:i) SEQUEIICE DESCRIPTION: SEQ ID 1^!0: 263 



ATGAGAAAAA CAATAATTTT CTGCTTGTTG CTCGCCCTAT TTGGCTGTTC TTGGGCACAA 60 

GAAAGAGTCG ATGAAAAAGT ATTCTCCGCA GGAACAAGTA TTTTTAGGGG CATCCTTGAA 120 

AAGGTGAAAG CACCGCTTAT GTATGGAGAT CGTGAGGTAT GGGGTATGGC TCGTGCGAGC 180 

GAGGATTTCT TTTTTATACT TCCCGTTACG GATGACCTCA CTCCCGTGCT TTTCTATAAC 24 0 

75 CGTCTTACAA ACGAACCCTG CTTTGTGTCA GACCAAGGAA TAACTGAGTA TTTCAAATTC 300 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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wo 99/29870 
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GCTCAAGAAG GTGATTACAT TGAAGTCGAA GGAAGCTCTG TATTCATGGC GAATCTTTTG 360 

TACTATCGTT TTTTCCCGAC AAGAATTACC TCCTATAATG CTCCCATTGA AGGTGTTGTG 4 20 

AGCAAGACGG GAAATCCTGC TTTTACAATC CCGATGCTCC CGGGGGTTTC TGATTGCATA 4 80 

GAAATCTCAA ACAACCGCAA AGTCTTTCTG ACCAATCAAT TAGGGGTTGT AT^ACATCACT 54 0 

5 GACGGGATGG AACCTCCGAT TATTGCCGGA GTCTCTGCTT CCTATGGATC TTCCGTCCGG 600 

GTGTATGGTC ATGTCTCACA GCGGTGGGAC ATCATAGGCC ATTGCTATTT GGATATCTAC 660 

CCAACCAATT GCTATCCGCT CAGCACGAAA CCCGTTGCAG GAGACGATGA GGTTTTTGTC 120 

AAACAACAAG GCAGGCAAAT AGAGATCGAT AGCAACAGCC CCATAGTCCA AGTGGTCGTA 780 

TACGATCTTG AGGGGAAAAG TGTTTTTCGC AAAAGAATGA CCGAAAACGC TTATACCCTA 84 0 

10 TCCTTTAGAG CACCCATGCT CGGCTTTATG ACCATCATGA TCGAAACACA AAATTCGATT 900 

ATCAATAAAA AACTTAATGT TACACAGCTA 930 



15 



25 



(2) in FORMAT I OH FOR SEQ ID HO: 264 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LEllGTH: 1215 base pair:: 

(B) TYPE: nucleic acid 

(C) STRAHDEDHESS: double 
20 (D) TOPOLOGY: circular 



fii) I'lOLECL'LE TYPE: DfJA (genomic) 
(iii) HYPOTHETICAL: HO 
(iv) AMTI-SEII3E: HO 



(vi) ORIGIHAL SOURCE: 

(A) ORG AH ISM: PORY PHYROMOHAS GIHGIVALIS 



30 

ii>:) FEATURE: 



(A) HAME/KEY: mi sc_f ea ture 

(B) LOCATION 1 . . .1215 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID HO: 264 



ATGAAAAAAA C7\ACCATTAT TTCTTTGATT 
CAAACCAAGG ACAATTCTTC TTACAAACCT 
TACTCTCTCC CGACTCAAAA TCGTGCGCAG 

40 ACCGTCTCCA CAAACCAGTC TGCAGAT ACT 
ATCGCTCGTG ACATAAAAGC C7\ATGGGGTA 
GCCAACGGGC GAATATCGCA TGTGGATCTT 
CTCGACACCC GATTCAAGTA CACCTATGAT 
TTCATGCTGG TAATGGATCC GAATACACCT 

45 CAGGGCAGAC TGACCCACTG GATTTCTTTT 

TATCACTATA ATGAAAAAGG TCTGTTGGTC 
ACCTATTCAG ACACCGGCAA AACGGAATAC 
GCCGAGTACT TCGTCGTCCA GCAAGGAAAG 
ACCTATGAGG ACAATATCTG CATACAATAT 

50 TACAAGCGAG ACATCGAGAG CGATAAGTCC 
ATGCCGGAAC AOACCTGGCC TAATATGTAC 
TATTCCTCCT ACGAAGGAGA TGTGGCTACT 
GCTCTTACCT CAA.TGGC7\AC ACCTTCGACA 
TCAACGGACC GGTTAGTGAT TCTGGCCAAC 

55 CAGGGTAAGC TTATCCGTGA TTGTGCCTTG 
TCTTTGACCA AAGGGACATA CCTGCTTAAA 
AAAGTCGTGA TTCGA 



GTCTTCGGTG CTTTCTTTGC AGCCGTGGGC 60 

TTTTCGAAAG AAGATATTGC CGGAGGAGTT 120 

AAGGACAATG CCGAGTGGCT TCTTACAGCG 180 

CACTTTATCT TCGATGAGAA CAACCGCTAT 24 0 

AGAAAATCCA CGGACTCCAT TTACTACGAT 300 

TATATCTCGT TCAGTGGCGG AGAGCCTGCA 360 

GACGAGGGAA AGATGACCGT GAGGGAAGTA 4 20 

ATCTCACGCT TGGAATATCA TTATGATGCA 4 80 

GCTTTCGGGG CAGAATCCCA AAAGAATACG 54 0 

AGCGAAGTGC TGAGCAATGC AATGGGGACA 600 

AGCTATGACG ATGCAGATAA TATGGTGAAG 660 

GCATGGCAAG TACTCAAAAG AGAGGAATAC 7 20 

TTGGCTATTA ACGGTACCGA CACAAAGGTG 7 80 

ATCTCCGCAA ATGTCATTGA CATTCCGTCA 84 0 

GGATTCAACG CAAAGCGACT GAAAGAGACT 900 

CCTATATTCG ACTATATCTA TACGTACAAG 960 

GAAGCTCAGG TAGCAGTCTA TCTCAATCCG 1020 

GGCATCACAC ATCTGAGCAT GTACGACTTG 1080 

AGCGGCGATA AGGTGGAAAT GGGTGTCGGA 1140 

GTGAATACGG ATCAGGGAGC CTTTGTGAGA 1200 

1215 



60 (2) I H FORM ATI OH FOR SEQ ID HO: 265 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LEr(GTH: 454 amino acids 

(B) TYPE: amino acid 
65 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



70 



(vi) ORIGIHAL SOURCE: 

(A) ORGAHISM: Porphyromonas gingival is 



(i;-:) FEATURE: 
75 (A) NAIiE/KEY; misc feature 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1^I_> 



wo 99/29870 
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(B) LOCATION 1 ... 454 

{>:!) SEQUENCE DESCRIPTION: SEQ ID HO:265 

5 Phe Cys Val Met Ala Lys Val lie Lys Thr Lys hys Gly Leu Ala Leu 
15 10 15 

Asn Leu Lys Gly Lys Pro Leu Pro Glu Met Leu Ala Glu Pro Ala Gin 

20 " 25 30 

Ser Pro Thr Tyr Ala Val Val Pro Asp Asp Phe Glu Gly Val lie Pro 
10 35 40 45 

Lys Val Thr Ala Arg Pro Gly Asp Lys Val Arg Ala Gly Ser Ala Leu 

50 55 60 

Met His His Lys Ala Tyr Pro Glu Met Lys Phe Thr Ser Pro Val Ser 
65 70 75 80 

15 Gly Glu Val lie Ala Val Asn Arg Gly Ala Lys Arg Lys Val Leu Ser 
85 90 95 

lie Glu Val Lys Pro Asp Gly Leu Asn Glu Tyr Glu Ser Phe Pro Val 

100 105 110 

Gly Asp Pro Ser Ala Leu Ser Ala Glu Gin lie Lys Glu Leu Leu Leu 
20 115 120 125 

Ser Ser Gly Met Trp Gly Phe He Lys Gin Arg Pro Tyr Asp He Val 

130 135 140 

Ala Thr Pro Asp He Ala Pro Arg Asp He Tyr He Thr Ala Asn Phe 
145 150 155 160 

25 Thr Ala Pro Leu Ala Pro Asp Phe Asp Phe He Val Arg Gly Glu Glu 

165 170 175 

Arg Ala Leu Gin Thr Ala He Asp Ala Leu Ala Lys Leu Thr Thr Gly 

180 185 190 

Lys Val Tyr Val Gly Leu Lys Pro Gly Ser Ser Leu Gly Leu His Asn 
30 195 200 205 

Ala Glu He Val Glu Val His Gly Pro His Pro Ala Gly Asn Val Gly 

210 215 220 

Val Leu He Asn His Thr Lys Pro He Asn Arg Gly Glu Thr Val Trp 
225 230 235 240 

35 Thr Leu Lys Ala Thr Asp Leu He Val He Gly Arg Phe Leu Leu Thr 

245 250 255 

Gly Lys Ala Asp Phe Thr Arg Met He Ala Met Thr Gly Ser Asp Ala 

260 265 270 

Ala Ala His Gly Tyr Val Arg He Met Pro Gly Cys- Asn Val Phe Ala 
40 275 280 285 

Ser Phe Pro Gly Arg Leu Thr He Lys Glu Ser His Glu Arg Val He 

290 295 300 

Asp Gly Asn Val Leu Thr Gly Lys Lys Leu Cys Glu Lys Glu Pro Phe 
305 310 315 320 

45 Leu Ser Ala Arg Cys Asp Gin He Thr Val He Pro Glu Gly Asp Asp 

325 330 335 

Val Asp Glu Leu Phe Gly Trp Ala Ala Pro Arg Leu Asp Gin Tyr Ser 

340 345 350 

Met Ser Arg Ala Tyr Phe Ser Trp Leu Gin Gly Lys Asn Lys Glu Tyr 
50 355 360 365 

Val Leu Asp Ala Arg He Lys Gly Gly Glu Arg Ala Met He Met Ser 

370 375 380 

Asn Glu Tyr Asp Arg Val Phe Pro Met Asp He Tyr Pro Glu Tyr Leu 
385 390 395 400 

55 Leu Lys Ala He He Ala Phe Asp lie Asp Lys Met Glu Asp Leu Gly 

405 410 415 

He Tyr Glu Val Ala Pro Glu Asp Phe Ala Thr Cys Glu Phe Val Asp 

420 425 430 

Thr Ser Lys He Glu Leu Gin Arg He Val Arg Glu Gly Leu Asp Met 
60 435 440 445 

Leu Tyr Lys Glu Met Asn 
450 



65 



70 



(2) IHFORMATIOM FOR SEQ ID WO: 266 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
75 (vi) ORIGIIiAL SOURCE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_L> 
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35 



55 



60 



65 



70 



75 



(A) ORGAllISM: Porphyromonas gingival is 

(i;0 FEATURE: 

(A) NAllE/KEY: misc_feature 

(B) LOCATION 1. . .201 

(xi) SEQUEIICE DESCRIPTION: SEQ ID MO: 266 





Glu 


Leu 


Ser 


Lys 


Cys 


Tyr 


Met 


Asp 


Lys 


Val 


Ser 


Tyr 


Ala 


Leu 


Gly 


Leu 


10 


1 








5 










10 










15 






Ser 


He 


Gly 


Asn 


Asn 


Phe 


Lys 


Ser 


Ser 


Gly 


He 


Asp 


Ser 


Val 


Val 


Met 








20 










25 










30 








Asp 


Asp 


Phe 


Met 


Gin 


Gly 


Leu 


Ser 


Asp 


Val 


Leu 


Gl u 


Glu 


Lys 


Ala 


Pro 






35 










40 










45 








15 


Gin 


Leu 
50 


Ser 


Tyr 


Asp 


Glu 


Ala 
55 


Lys 


Arg 


Glu 


He 


Glu 
60 


Ala 


Tyr 


Phe 


Met 




Asp 


Leu 


Gin 


Gin 


Lys 


Ala 


Val 


Lys 


Leu 


Asn 


Lys 


Glu 


Ala 


Gly 


Glu 


Glu 




65 










70 










75 










80 




Phe 


Leu 


Lys 


He 


Asn 


Ala 


His 


Lys 


Glu 


Gly 


Val 


Thr 


Thr 


Leu 


Pro 


Ser 


20 








85 










90 










95 






Gly 


Leu 


Gin 


Tyr 


Glu 


Val 


He 


Lys 


Met 


Gly 


Glu 


Gly 


Pro 


Lys 


Pro 


Thr 








100 










105 










110 








Leu 


Ser 


Asp 
115 


Thr 


Val 


Thr 


Cys 


His 
120 


Tyr 


His 


Gly 


Thr 


Leu 
125 


He 


Asn 


Gly 


25 


lie 


Val 
130 


Phe 


Asp 


Ser 


Ser 


Met 
135 


Asp 


Arg 


Gly 


Glu 


Pro 
140 


Ala 


Ser 


Phe 


Pro 




Leu 


Arg 


Gly 


Val 


He 


Ala 


Gly 


Trp 


Thr 


Glu 


He 


Leu 


Gin 


Leu 


Met 


Pro 




145 






150 










155 










160 




Val 


Gly 


Ser 


Lys 


Trp 


Lys 


Val 


Thr 


He 


Pro 


Ser 


Asp 


Leu 


Ala 


Tyr 


Gly 


30 








165 










170 










175 






Asp 


Arg 


Gly 


Ala 
180 


Gly 


Glu 


His 


He 


Lys 
185 


Pro 


Gly 


Ser 


Thr 


Leu 
190 


He 


Phe 




He 


He 


Glu 
195 


Leu 


Leu 


Ser 


He 


Asn 
200 


Lys 

















(2) I14FORMATIOH FOR SEQ ID 110:2 67 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 amino acids 
40 (B) TYPE: amino acid 

{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAllISM: Porphyromonas gingivalis 

50 (ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1 . . .27 9. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267 



Glh 


Lys 


Asn 


Lys 


Arg 


Lys 


Met 


Lys 


Lys 


Ala 


Leu 


Leu 


He 


Gly 


Ala 


Ala 


1 








5 










10 










15 




Leu 


Leu 


Gly 


Ala 


Val 


Ser 


Phe 


Ala 


Ser 


Ala 


Gin 


Ser 


Leu 


Ser 


Thr 


li- 






20 










25 










30 






Lys 


Val 


Gin 
35 


Asn 


Asn 


Ser 


Val 


Gin 
40 


Gin 


Pro 


Arg 


Glu 


Glu 
45 


Ala 


Thr 


ne 


Gin 


Val 


Cys 


Gly 


Glu 


Leu 


Ala 


Giu 


Gin 


Val 


Asp 


Cys 


He 


Gly 


Thr 


Gly 




50 






55 










60 










Asn 


Ser 


Ala 


He 


He 


Ala 


Ala 


Ala 


Ala 


Lys 


Phe 


Glu 


Ser 


Asp 


Asp 


Leu 


65 










70 










75 










8 0 


Glu 


Ser 


Tyr 


Val 


Gly 


Trp 


Glu 


He 


Met 


Ser 


Val 


Asp 


Phe 


Phe 


Pro 


Gly 








85 










90 










95 




Tyr 


Lys 


Ala 


Cys 
100 


Lys 


Tyr 


Thr 


Ser 


Ala 
105 


Val 


Trp 


Ala 


Asp 


Asp 
110 


Met 


Thr 


He 


Leu 


Gly 
115 


Gin 


Ser 


Glu 


Asp 


Ser 
120 


Asp 


Pro 


Glu 


Met 


Gin 

125 


Thr 


He 


Asn 


Asn 


Leu 
130 


Ala 


Leu 


Lys 


Thr 


Ser 
135 


Val 


Lys 


He 


Glu 


Ala 
14 0 


Gly 


Lys 


Asn 


Tyr 


He 


Val 


Gly 


Tyr 


He 


Ala 


Asn 


Thr 


Ala 


Gly 


Gly 


His 


Pro 


He 


Gly 


Cys 


145 








150 










155 










160 
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184/ 490 



35 



60 



65 





Gin 


Gly 


Pro 


Ala 
165 


Val 


A^p 


Gly 




Gly 
170 


Asp 


Leu 


Val 


Ser 


He 
176 


Ser 


Glu 


Asp 


Gly 


Gly 
180 


Ala 


Thr 


Phe 


Pro 


Pro 
185 


Phe 


Glu 


Ser 


Leu 


His 
190 


Gin 


Ala 


Val 


Pro 


Thr 
195 


Leu 


Asn 


Tyr 


Asn 


He 
200 


Tyr 


Val 


Val 


Val 


His 
205 


Leu 


Lys 


Lys 


Gly 


Glu 
210 


Gly 


Val 


Glu 


Ala 


Vai 
215 


Leu 


Thr 


A5n 


Asp 


Lys 
220 


Ala 


Asn 


Ala 


Tyr 


Val 


Gin 


A5n 


Gly 


Val 


lie 


Tyr 


Val 


Ala 


Gly 


Ala 


Asn 


Gly 


Arg 


Gin 


Val 


225 










230 










235 










240 


Ser 


Leu 


Phe 


Asp 


Met 

245 


Asn 


Gly 


Lys 


Val 


Val 
2 50 


Tyr 


Thr 


Gly 


Val 


Ser 
255 


Glu 


T!ir 


lie 


Ala 


Ala 
260 


Pro 


Gin 


Lys 


Gly 


Met 
265 


Tyr 


He 


Leu 


Arg 


Val 
270 


Gly 


Ala 


Lys 


Ser 


lie 

'^7 5 


Lys 


Leu 


Ala 


He 





















10 



15 



(2) INFORMATION FOR SEQ ID NO: 268 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
30 <A) ORGANISM: Porphyromcnas gingival is 

(ix) FEATURE: 

(A) NAME /KEY: mi sc_f ea ture 
{ B) LOCATION 1 ... 157 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 68 





Arg 


Unk 


Phe 


Leu 


Pro 


Glu 


Lys 


Ala 


Leu 


Tyr 


He 


Gly 


Cys 


Arg 


Val 


Glu 


40 


1 








5 










10 










15 




Thr 


Gin 


Glu 


Gly 
20 


His 


Ala 


Val 


Gly 


Phe 
25 


Gly 


Leu 


Asp 


Asp 


Gly 
3 0 


Pro 


Ala 




Met 


Lys 


Gly 

35 


Lys 


Gly 


Asp 


Leu 


Val 


Gly 


Ser 


Tyr 


Leu 


Pro 
45 


Gly 


Ala 


Ala 


45 


Pro 


Met 


Pro 


Phe 


Val 


Pro 


Leu 


Ser 


Asp 


He 


Pro 


Ala 


Arg 


Ser 


Met 


Asp 




50 










55 








60 












Ala 


Asn 


Phe 


Tyr 


He 


Tyr 


Ser 


Arg 


lie 


Ser 


Leu 


Gly 


Ser 




Thr 


Gin 




65 










70 










75 










80 




Asp 


Val 


Leu 


Gin 


His 
85 


Arg 


Met 


Lys 


Val 


Tyr 

90 


Pro 


Asn 


Pro 


Ai a 


Thr 
95 


Thr 


50 


Glu 


Leu 


His 


Val 
100 


Glu 


Ala 


Leu 


Ser 


Ala 
105 


Trp 


Val 


Gly 


Glu 


Gin 
HO 


Ala 


Ala 




Val 


Tyr 


Asp 
115 


Met 


Arg 


Gly 


Arg 


Arg 
120 


Val 


Ser 


Ala 


Arg 


Thr 

125 


Val 


Asp 


Ser 


55 


Glu 


Lys 


Leu 


Cys 


He 


Asp 


He 


Ala 


Ser 


Leu 


Pre 


Val 


Gly 


Val 


Tyr 


Met 




130 










135 










140 












Leu 


Arg 


He 


Gly 


Ser 


Tyr 


Ser 


Ala 


Lys 


Phe 


Glu 


Lys 


Arg 










145 










150 










155 













(2) INFORMATION FOR SEQ ID NO: 2 69 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 562 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



70 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
75 (B) LOCATION 1 . . . 562 



BNSDOCID: <WO 9929870AUL> 
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(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 269 





Thr 


Arg 


Asn 


Val 


Ser 


Leu 


He 


Lys 


Met 


Pro 


Arg 


He 


Met 


Lys 


Leu 


Lys 


5 


I 






5 










10 










15 






He 


Ala 


Leu 


Arg 


Leu 


Leu 


Leu 


Ala 


Thr 


Phe 


Ala 


He 


Val 


Leu 


Phe 


Ser 










20 










25 










30 








Pro 


Leu 


Ala 


Lys 


Ala 


Gin 


Met 


Asp 


He 


Gly 


Gly 


Asp 


Asp 


Val 


Leu 


He 








35 










40 










45 








10 


Glu 


Thr 


Met 


Ser 


Thr 


Leu 


Ser 


Gly 


Tyr 


Ser 


Glu 


Asp 


Phe 


Tyr 


Tyr 


Lys 






50 










55 










60 












Met 


Ala 


Val 


Ala 


Asp 


Asn 


Gly 


Trp 


He 


Tyr 


Val 


Met 


Leu 


Asp 


Phe 


Ser 




65 










70 










75 










80 




Arq 


He 


Tyr 


Phe 


Asp 


Asp 


Val 


Arg 


Leu 


Tyr 


Arg 


Ser 


Lys 


Asp 


Gly 


Gly 


15 










05 










90 










95 






Ala 


Thr 


Tyr 


Gin 


Lys 


Leu 


Gly 


Ser 


Leu 


Gly 


Ser 


Leu 


Val 


Pro 


Tyr 


Asp 










100 










105 










110 








Phe 


Asp 


Val 


Ser 


His 


Cys 


Asp 


Phe 


He 


Val 


Thr 


Gly 


Lys 


Asp 


Glu 


Asp 








115 










120 










125 








20 


Asp 


He 


Asn 


Val 


Trp 


Thr 


Val 


Met 


Thr 


Ala 


Phe 


Glu 


Tyr 


Val 


Gly 


Gly 




130 










135 










140 












Thr 


He 


Gly 


Asn 


Gly 


Val 


Leu 


Leu 


Met 


His 


Arg 


His 


Asp 


Ala 


Asp 


He 




145 










150 










155 










160 




Asn 


Asn 


Thr 


Glu 


Cys 


Val 


Tyr 


Lys 


Lys 


Asp 


Phe 


Pro 


Asn 


Asn 


Arg 


Leu 


25 










165 










170 










175 






Met 


Gly 


Val 


Ala 


He 


Ala 


Ser 


Asn 


Tyr 


Arg 


Ala 


Pro 


Ser 


Pro 


Tyr 


Gly 










130 










165 










190 








Leu 


Gly 


Gly 


Asp 


Pro 


Phe 


Ala 


Leu 


Ala 


Val 


Ala 


Val 


Ser 


Gly 


Ser 


Gly 








195 








200 










205 








30 


Ser 


Asp 


His 


Ser 


Phe 


Leu 


Asp 


Tyr 


He 


Phe 


Ser 


Leu 


Asp 


Gly 


Gly 


Val 






210 










215 










220 












His 


Phe 


Glu 


Gin 


Lys 


Arg 


He 


Tyr 


Thr 


Arg 


Pro 


Gin 


Lys 


Leu 


Thr 


He 




225 










230 










235 










240 




Asn 


Arg 


Val 


Asp 


Leu 


Ser 


Leu 


Gly 


Ser 


Thr 


Ser 


Pro 


Ser 


Leu 


Gly 


Phe 


35 








245 










250 










255 






Asn 


Thr 


Trp 


Pro 


Leu 


Met 


Gly 


Val 


Val 


Phe 


Glu 


I'let 


Asn 


Lys 


Asn 


Leu 










260 










265 










270 








Asp 


Gly 


Phe 


Asp 


He 


Gly 


Phe 


He 


Ser 


Asn 


Phe 


Val 


Asp 


Tyr 


Asp 


Pro 








275 










280 










285 








40 


Arg 


Tyr 


Ala 


Trp 


Ser 


Glu 


Pro 


He 


He 


He 


Glu 


Glu 


Asp 


Cys 


Gly 


Trp 






2 90 










295 










300 












Thr 


Asp 


Phe 


Asn 


Pro 


Leu 


Gly 


Ala 


Leu 


Ser 


He 


Glu 


He 


Gin 


I-let 


Met 




305 










310 










315 










320 




Leu 


Asp 


Asp 


Asn 


Ser 


Asp 


Asn 


Thr 


Val 


Gly 


Gly 


Glu 


Arg 


Ser 


His 


Asn 


45 








325 










330 










335 






Phe 


Leu 


He 


Thr 


Tyr 


Pro 


Gly 


His 


Tyr 


Val 


Tyr 


Pro 


Lys 


Gin 


Ser 


Phe 










340 










345 










350 








Asn 


Tyr 


Ser 


Pro 


Gl- 


His 


Thr 


Pro 


Thr 


Lys 


Lys 


Asp 


Leu 


Val 


Phe 


Lys 


50 






355 










360 










365 








His 


~ J — 


He 


Gly 


ue 


Pro 


Ala 


Leu 


Ala 


Tyr 


Asp 


Lys 


Glu 


Gly 


Asp 


Arg 






370 










375 










380 












Tyr 


Leu 


Thr 


Thr 


Phe 


Gin 


Asp 


His 


Asn 


Leu 


Met 


Arq 


Tyr 


Arg 


Trp 


He 




385 










390 










395 










400 




Lys 


Tyr 


Asp 


Asp 


He 


Asn 


Ser 


Phe 


Tyr 


Gly 


Trp 


Ser 


Trp 


Pro 


Tyr 


Val 


55 








405 










410 










415 






Tyr 


Ala 


Lys 


Glu 


Ala 


Lys 


Asp 


Lys 


Lys 


Arg 


Arg 


Arg 


Pro 


Gin 


Val 


Ala 










420 










425 










4 30 








Leu 


Asn 


Pro 


Thr 


Asn 


Gly 


Lys 


Ala 


Cys 


Trp 


Val 


Trp 


His 


Thr 


Arg 


Lys 








435 










440 










445 








60 


Ser 


Pro 


Tyr 




Glu 


Thr 


Lys 


Pro 


His 


Pro 


Thr 


Pro 


Val 


He 


He 


Lys 






450 








455 










460 












His 


Phe 


Leu 


Trp 


Ser 


Asp 


Thr 


Glu 


Trp 


Val 


His 


Ala 


Leu 


Asp 


Val 


Gly 




465 










470 










475 










480 


65 


Asp 


Val 


Leu 


Gin 


Lys 


Glu 


Gly 


Ser 


Met 


Lys 


Leu 


Tyr 


Pro 


Asn 


Pro 


Ala 










4 85 










490 










495 






Lys 


Glu 


Tyr 


Val 


Leu 


He 


Asn 


Leu 


Pro 


Lys 


Glu 


Gly 


Gly 


His 


Glu 


Ala 










500 










505 










510 








Val 


Val 


Tyr 


Asp 


Met 


Gin 


Gly 


Arg 


He 


Val 


Glu 


Lys 


Val 


Ser 


Phe 


Ser 


70 






515 










520 










525 








Gly 


Lys 


Glu 


Tyr 


Lys 


Leu 


Asn 


Val 


Gin 


Tyr 


Leu 


Ser 


Lys 


Gly 


Thr 


Tyr 






530 










535 










540 












Met 


Leu 


Lys 


Val 


Val 


Ala 


Asp 


Thr 


Glu 


Tyr 


Phe 


Val 


Glu 


Lys 


He 


He 



545 550 555 560 

Val Glu 

75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929e70Al_l_> 



wo 99/29870 



PCT/AU98/01023 



186/490 



10 



(2) lUFORriATION FOR SEQ ID NO: 27 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEtfGTH: 391 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(vi) ORIGIMAL SOURCE: 

(A) ORGAJIISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) WAI-IE/KEY: mis-c_f eature 
^ B) LOCATIOH 1 ... 391 

<xi) SEQUENCE DESCRIPTIOti: SEQ ID 110:270 

Gin Met Lys Arg Leu Leu Pro Phe Leu Leu Leu Ala Gly Leu Val Ala 

I 5 10 15 

Vdl Gly Asn Val Ser Ala Gin Ser Pro Arg lie Pro Gin Val Asp Val 

20 25 30 

His Thr Arg lie Ala Arg Asn Ala Arg Tyr Arg Leu Asp Lys He Ser 

35 ^0 45 

Val Pro Asp Ser Arg Gin lie Phe Asp Tyr Phe Tyr Lys Glu Glu Thr 

55 60 
He Pro Thr Lys He Gin Thr Thr Thr Gly Gly Ala He Thr Ser He 

"70 75 80 

Asp Ser Leu Phe Tyr Glu Asp Asp Arg Leu Val Gin Val Arg Tyr Phe 

85 90 95 

Asp Asn Asn Leu Glu Leu Lys Gin Ala Glu Lys Tyr Val Tyr Asp Gly 

1C*0 105 110 

Ser Lys Leu Val Leu Arg Glu He Arg Lys Ser Pro Thr Asp Glu Thr 

lis 120 125 

Pro He Lys Lys Val Ser Tyr His Tyr Leu Cys Gly Ser Asp Met Pro 

130 135 140 

Phe Glu He Thr Thr Glu Met Ser Asp Gly Tyr Phe Glu Ser His Thr 
l^S 150 155 160 

Leu Asn Tyr Leu Asn Gly Lys He Ala Arg He Asp He Met Thr Gin 

165 170 175 

Gin Asn Pro Ser Ala Glu Leu He Glu Thr Gly Arg Met Val Tyr Glu 

180 185 190 

Phe Asp Ala Asn Asn Asp Ala Val Leu Leu Arg Asp Ser Val Phe Leu 

l^S 200 205 

Pro Leu Gin Asn Lys Trp Val Glu Met Phe Thr His Arg Tyr Thr Tyr 

210 215 220 

Asp Asn Lys His Asn Cys He Arg Trp Glu Gin Asp Glu Phe Gly Thr 
225 230 235 240 

Leu Thr Leu Ala Asn Asn Phe Glu Tyr Asp Thr Thr He Pro Leu Ser 

2^5 250 255 

Ser Val Leu Phe Pro Thr His Glu Glu Phe Phe Arg Pro Leu Leu Pro 

260 265 270 

Asn Phe Met Lys His Met Arg Thr Lys Gin Thr Tyr Phe Asn Asn Ser 

275 280 285 

Gly Glu Gly Leu Ser Glu Val Cys Asp Tyr Asn Tyr Phe Tyr Thr Asp 

290 295 300 

Met Gin Gly Asn Ala Leu Thr Asp Val Ala Val Asn Glu Ser He Lys 
310 315 320 



305 



He Tyr Pro Arg Pro Ala Thr Asp Phe Leu Arg He Glu Gly Ser Gin 

325 330 335 

Leu Leu Arg Leu Ser Leu Phe Asp Met Asn Gly Lys Leu He Arg Ala 

3*10 345 350 

Thr Glu Leu Thr Gly Asp Leu Ala He He Gly Val Ala Ser Leu Pro 

355 360 365 

Arg Gly Thr Tyr He Ala Glu He Thr Ala Ala Asn Ser Lys Thr II- 

3*70 375 380 

Arg Ala Lys Val Ser Leu Arg 
385 390 

12) INFORMATIOli FOR SEQ ID HO: 271 

U) SEQUENCE CHARACTERISTICS: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I 



wo 99/29870 PCT/AU98/01023 

187/ 490 



(A) LENGTH: 428 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 tii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

{A) HAllE/KEY: misc__f eature 
( B) LOCATIOII 1 ... ^28 



(xi) SEQUENCE DESCRIPTION: SEQ ID 110:271 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



(2) INFORIIATIOIJ FOR SEQ ID NO: 272 
75 (i) SEQUENCE CHARACTERISTICS: 



Lys 


Arg 


Asn 


Pro 


Leu 


Pro 


Leu 


Thr 


Ala 


Ser 


Asn 


Arg 


Lys 


He 


Ph- 


He 


1 






5 










10 










is 




Lys 


Met 


Arg 


Gin 


His 


Leu 


Ser 


Leu 


Phe 


Pro 


Phe 


He 


Leu 


Phe 


Leu 


Leu 








2 0 










25 










30 






Leu 


Ala 


Phe 


Ser 


Tyr 


Val 


Gly 


Cys 


Arg 


Thr 


Val 


Arg 


Gin 


Thr 


Pro 


Lys 






35 










40 










45 








Gin 


Ser 


Glu 


Arg 


Tyr 


Val 


Val 


Val 


Leu 


Ser 


Leu 


Asp 


Gly 


Phe 


Arg 


Pro 




50 










55 










60 










Asp 


Tyr 


Thr 


Asp 


Arg 


Ala 


Arg 


Thr 


Pro 


Ala 


Leu 


Asp 


Arg 


Met 


Ala 


Gin 


65 










7 0 










75 










eo 


Glu 


Gly 


Leu 


Ser 


Gly 


Ser 


Leu 


Gin 


Pro 


Cys 


Phe 


Pro 


Ser 


Leu 


Thr 


Phe 










85 










90 










95 




Pro 


Asn 


His 


Tyr 


Ser 


Met 


Ala 


Thr 


Gly 


Leu 


Tyr 


Pro 


Asp 


His 


His 


Gly 








100 










105 










110 






lie 


Val 


Ala 


Asn 


Glu 


Phe 


Val 


Asp 


Ser 


Leu 


Leu 


Gly 


He 


Phe 


Arg 


He 






115 










120 










125 








Ser 


Asp 


Arg 


Lys 


Ala 


Val 


Glu 


Thr 


Pro 


Gly 


Phe 


Trp 


Gly 


Gly 


Glu 


Pro 




130 










135 










140 










Val 


Trp 


Asn 


Thr 


Ala 


Ala 


Arg 


Gin 


Gly 


He 


Arg 


Thr 


Gly 


Val 


Tyr 


Phe 


145 








150 










155 










160 


Trp 


Val 


Gly 


Ser 


Glu 


Thr 


Ala 


Val 


Asn 


Gly 


Asn 


Arg 


Pro 


Trp 


Arg 


Trp 






165 










170 










175 




Lys 


Lys 


Phe 


Ser 


Ser 


Thr 


Val 


Pro 


Phe 


Arg 


Asp 


Arg 


Ala 


Asp 


Ser 


Val 






180 










185 










190 






lie 


Ala 


Trp 


Leu 


Gly 


Leu 


Pro 


Glu 


Lys 


Glu 


Arg 


Pro 


Arg 


Leu 


Leu 


Met 






195 










200 










205 








Trp 


Tyr 


He 


Glu 


Glu 


Pro 


Asp 


Met 


He 


Gly 


His 


Ser 


Gin 


Thr 


Pro 


Glu 




210 










215 










220 










Ser 


Pro 


Leu 


Thr 


Leu 


Ala 


Met 


Val 


Glu 


Arg 


Leu 


Asp 


Ser 


Val 


Val 


Giy 


225 










230 










235 










24 0 


Tyr 


Phe 


Arg 


Lys 


Arg 


Leu 


Asp 


Ser 


Leu 


Pro 


He 


Ala 


Ala 


Gin 


Thr 


Asp 










245 










250 










255 




Phe 


He 


He 


Val 


Ser 


Asp 


His 


Gly 


Met 


Ala 


Thr 


Tyr 


Glu 


Asn 


Glu 


Lys 








260 










265 










270 






Cys 


Val 


Asn 


Leu 


Ser 


His 


Tyr 


Leu 


Pro 


Ala 


Asp 


Ser 


Phe 


Leu 


Tyr 


Met 






275 










280 










285 








Ala 


Thr 


Giy 


Ala 


Phe 


Thr 


His 


Leu 


Tyr 


Pro 


Lys 


Pro 


Ser 


Tyr 


Thr 


Glu 




290 








295 










300 










Arg 


Ala 


Tyr 


Glu 


He 


Leu 


Arg 


Ala 


He 


Pro 


His 


He 


Ser 


Val 


Tyr 


Arg 


305 










310 










315 










320 


Lys 


Gly 


Glu 


Val 


Pro 


Lys 


Arg 


Leu 


Arg 


Cys 


Gly 


Thr 


Asn 


Pro 


Arg 


Leu 










325 










330 










335 




Gly 


Glu 


Leu 


Val 


Val 


lie 


Pro 


Asp 


He 


Gly 


Ser 


Thr 


Val 


Phe 


Phe 


Ala 






340 










345 










350 






lie 


Asn 


Glu 


Asp 


Val 


Arg 


Pro 


Gly 


Ala 


Ala 


His 


Gly 


Tyr 


Asp 


Asn 


Gin 






355 










360 










365 








Ala 


Pro 


Glu 


Met 


Arg 


Ala 


Leu 


Leu 


Arg 


Ala 


Val 


Gly 


Pro 


Asp 


Phe 


Arg 




370 










375 










380 










Pro 


Gly 


Ser 


Arg 


Val 


Glu 


Asn 


Leu 


Pro 


Asn 


He 


Thr 


He 


Tyr 


Pro 


Leu 


385 






390 










395 










4 00 


He 


Cys 


Arg 


Leu 


Leu 


Gly 


He 


Glu 


Pro 


Ala 


Pro 


Asn 


Asp 


Ala 


Asp 


Giu 










405 










410 










415 




Thr 


Leu 


Leu 


Asn 


Gly 


Leu 


He 


Arg 


Asp 


Lys 


Arg 


Pro 
















420 










425 

















SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(A) LENGTH: 282 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISH: Porphyromonas gingivali^ 

(ix) FEATURE; 

(A) tJAIlE/KEY: ini sc_f eat ure 

(B) LOCATION 1 . . .282 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272 



Leu Ser Arg Gly Val Phe Pro Leu Met Tyr Gly Arg Arg Gly Ser II « 
IS 10 15 

Arg Ala Ser Ser Gly His Arg Asp Lys He Phe hys Asn Thr lie He 

^0 25 30 

Arg Phe He Thr Met Lys Val Gly Leu Phe He Pro Cys Tyr Val Asn 

35 40 45 

Ala Val Tyr Pro Glu Val Gly He Ala Thr Tyr hys Leu Leu Lys Ser 

50 55 60 

Leu Asp He Asp Val Asp Tyr Pro Met Asp Gin Thr Cys Cys Gly Gin 

~fO 75 80 

Fro net Ala Asn Ala Gly Phe Glu Gin Lys Ala Gin Lys Leu Ala Leu 

85 90 95 

Arg Phe Glu Glu Leu Phe Glu Ser Tyr Asp Val Val Val Gly Pro Ser 

iOO 105 110 

Ala Ser Cys Val Ala Phe Val Lys Glu Asn Tyr Asp His He Leu Arg 

115 120 125 

Pro Thr Gly His Val Cys Lys Ser Ala Ala Lys Val Arg Asp He Cys 
35 130 135 140 

Glu Phe Leu His Asp Asp Leu Lys He Thr Ser Leu Pro Ser Arg Ph^ 
150 155 160 

Ala His Lys Val Ser Leu His Asn Ser Cys His Gly Val Arg Glu Leu 
.r. 1-65 170 175 

His Leu Ser Thr Pro Ser Glu Val His Arg Pro Tyr His Asn Lys Val 

180 185 190 

Arg Arg Leu Leu Glu Met Val Gin Gly He Glu Val Phe Glu Pro Lys 

195 200 205 

Arg He Asp Glu Cys Cys Gly Phe Gly Gly ll-et Tyr Ser Val Glu Glu 
45 210 215 220 

Pro Glu Val Ser Thr Cys Met Gly His Asp Lys Val Leu Asp His He 
225 230 235 240 

Ser Thr Gly Ala Glu Tyr He Thr Gly Pro Asp Ser Ser Cys Leu Met 
cn 245 250 255 

OU His Met Gin Gly Val He Asp Arg Glu Lys Leu Pro He Lys Thr He 
260 265 270 

His Ala Val Glu He Leu Ala Ala Asn Leu 
275 280 

55 (2) INFORMATION FOR SEQ ID NO: 27 3 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1 . . .251 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273 
Pro Leu Lys Lys Arg Met Asp He Val Ser Met Ala Asp Lys Ala Leu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID; <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



189/ 490 



50 



55 



60 



65 



70 



75 



1 








5 










10 
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Val 


Val 


Gl U 


net 


Arg 


Asp 


Val 


Thr 


Leu 


Cys 


Gin 


Glu 


Glu 


Asn 


Val 


He 








20 










25 










30 






Phe 


Gin 


Asn 


Leu 


Asn 


Leu 


Thr 


Leu 


Ser 


Ala 


Gly 


Asp 


Phe 


Val 


Tyr 


Leu 






35 










40 










45 








He 


Gly 


Ser 


Val 


Gly 


Ser 


Gly 


Lys 


Ser 


Thr 


Leu 


Leu 


Lys 


Ala 


Leu 


Tyr 




50 










55 










60 










Ala 


Glu 


Val 


Pro 


He 


Ser 


Ala 


Gly 


Tyr 


Ala 


Arg 


Val 


He 


Asp 


Tyr 


Asp 


65 










70 










7 5 










80 


Leu 


Ala 


Lys 


Leu 


Lys 


Arg 


Lys 


Gin 


Leu 


Pro 


Tyr 


Leu 


Arg 


Arg 


Asn 


Leu 










86 










90 










95 




Gly 


He 


Val 


Phe 


Gin 


Asp 


Phe 


Gin 


Leu 


Leu 


Asn 


Gly 


Arg 


Thr 


Val 


Ala 








100 










105 










110 






Glu 


Asn 


Leu 


A^p 


Phe 


Val 


Leu 


Arg 


Ala 


Thr 


Asp 


Trp 


Lys 


Asn 


Arg 


Ala 






115 










120 










125 








Asp 


Arg 


Glu 


Gin 


Arg 


He 


Giu 


Glu 


Val 


Leu 


Thr 


AL-q 


Val 


Gly 


Met 


Ser 




130 










135 










140 










Arg 


Lys 


Ala 


Tyr 


Lys 


Arg 


Pro 


His 


Glu 


Leu 


Ser 


Gly 


Gly 


Glu 


Gin 


Gin 


146 










150 










166 










160 


Arg 


Val 


Gly 


He 


Ala 


Arg 


Ala 


Leu 


Leu 


Ala 


Lys 


Pro 


Ala 


Leu 


He 


Leu 










165 










170 










175 




Ala 


Asp 


Glu 


Pro 


Thr 


Gly 


Asn 


Leu 


Asp 


Ser 


Val 


Thr 


Gly 


Leu 


Gin 


He 








180 










185 










190 






Ala 


Ser 


Leu 


Leu 


Tyr 


Glu 


He 


Ser 


Lys 


Gin 


Gly 


Thr 


Ala 


Val 


Leu 


Met 






195 










200 










205 








Ser 


Thr 


His 


Asn 


Ser 


Ser 


Leu 


Leu 


Ser 


His 


Leu 


Pro 


Ala 


Arg 


Thr 


Leu 




210 










215 










220 










Ala 


Val 


Arg 


Lys 


Asn 


Gly 


Asp 


Ala 


Ser 


Ser 


Leu 


Val 


Glu 


Leu 


Ser 


Ala 


1 o r 










230 










235 










24 0 


Asp 


Ala 


Val 


Ser 


Arg 


Lys 


Asn 


Thr 


Glu 


He 


Asp 
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15 



20 



25 



30 

245 

(2} IMFORtlATION FOR SEQ ID nO:27 4 

35 (1} SEQUEHCE CHARACTERISTICS: 

(A) LEifGTH: 238 amino acids 

( B ) TYPE : amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

( A ) NAI IE/ K'E 1 : mi s c_f ea t u r e 

(B) LOCATION 1. , .238 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 4 
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Lys 


Thr 


1 








5 










10 










15 




Ser 


Lys 


Thr 


Met 


He 


Glu 


He 


Ser 


Asn 


Leu 


Thr 


Lys 


Val 


Phe 


Arg 


Thr 








20 










2 5 










30 






Glu 


Glu 


He 


Glu 


Thr 


Val 


Ala 


Leu 


Asp 


Gly 


Val 


Ser 


Leu 


Lys 


Val 


Asp 






35 










40 










45 








Lys 


Gly 


Glu 


Phe 


He 


Ala 


He 


Met 


Gly 


Pro 


Ser 


Gly 


Cys 


Gly 


Lys 


Ser 




50 










55 










60 










Thr 


Leu 


Leu 


Asn 


He 


Leu 


Gly 


Leu 


Leu 


Asp 


Asn 


Pro 


Thr 


Ser 


Gly 


He 


65 










70 










75 










80 


Tyr 


Lys 


Leu 


Asp 


Gly 


Ala 


Glu 


Val 


Gly 


Asn 


Leu 


Arg 


Glu 


Lys 


Asp 


Arg 










85 










90 










95 




Thr 


Ala 


Val 


Arg 


Lys 


Gly 


Asn 


He 


Gly 


Phe 


Val 


Phe 


Gin 


Ser 


Phe 


Asn 








100 










105 










110 






Leu 


He 


Glu 


Glu 


Met 


Thr 


Val 


Ser 


Glu 


Asn 


Val 


Glu 


Leu 


Pro 


Leu 


Val 






115 










120 










125 








Tyr 


Leu 


Gly 


Val 


Lys 


Ala 


Ser 


Glu 


Arg 


Lys 


Glu 


Arg 


Val 


Glu 


Glu 


Ala 




130 










135 










140 










Leu 


Arg 


Lys 


Met 


Ser 


He 


Ser 


His 


Arg 


Ala 


Gly 


His 


Phe 


Pro 


Asn 


Gin 


145 










150 








155 










160 


Leu 


Ser 


Gly 


Gly 


Gin 


Gin 


Gin 


Arg 


Val 


Ala 


He 


Ala 


Arg 


Ala 


Val 


Val 










165 










170 










175 




Ala 


Asn 


Pro 


Lys 


Leu 


He 


Leu 


Ala 


Asp 


Glu 


Pro 


Thr 


Gly 


Asn 


Leu 


Asp 
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10 



15 



WO 99/29870 PCT/AU98/01023 

190/490 

180 185 190 

Ser Lys Asn Gly Ala Asp Val llet Glu Leu Leu Arg Gly Leu Asn Arg 

195 200 205 

Glu Gly Ala Thr lie Val Met Val Thr His Ser Glu His Asp Ala Arg 

210 215 220 

Ser Ala Gly Arg lie lie Asn Leu Phe Asp Gly Lys He Arg 
225 230 235 

{2) III FORI lAT I ON FOR SEQ ID 110:275 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) N/yiE/KEY: mi sc_ feature 
25 {B) LOCATION 1...60^ 

{:-:i} SEQUENCE DESCRIPTION: SEQ ID NO:275 

Ser Thr Glu Thr Asn Ser Lys Ser Glu Met Lys Glu Phe Phe Lys llet 
30 1 5 10 15 

Phe Phe Ala Ser He Leu Gly Val He Thr Ala Gly He He Leu Ph- 

20 25 30 

Cys He Phe Leu Phe He Phe Phe Gly He Val Ala Gly He Ala S-r 

35 40 45 

Lys Ala Thr Gly Gly Thr He Pro Lys He Glu Ala Asn Ser He L-u 

SO 55 60 

His He Unk Asn Ser Ser Phe Pro Glu He Val Ser Ala Asn Pro Trp 
65 70 75 80 

Ser Met Leu Thr Gly Lys Asp Glu Ser Val Ser Leu Ser Gin Ala Val 

85 90 95 

Glu Ala He Gly Gin Ala Lys Asn Asn Pro Asn He Thr Gly He Phe 

100 105 110 

Leu Asp Leu Asp Asn Leu Ser Val Gly Met Ala Ser Ala Glu Glu Leu 

115 120 125 

Arg Arg Ala Leu Gin Asp Phe Lys Met Ser Gly Lys Phe Val Val S^r 

130 135 140 

Tyr Ala Asp Arg Tyr Thr Gin Lys Gly Tyr Tyr Leu Ser Ser He Ala 
I'^S 150 155 160 

Asp Lys Leu Tyr Leu Asn Pro Lys Gly Met Leu Gly Leu He Gly He 

165 170 175 

Ala Thr Gin Thr Met Phe Tyr Lys Asp Ala Leu Asp Lys Phe Gly Val 

180 185 190 

Lys Met Glu He Phe Lys Val Gly Thr Tyr Lys Ala Ala Val Glu Pro 

195 200 205 

Phe Met Leu Asn Arg Met Ser Asp Ala Asn Arg Glu Gin He Thr Thr 

210 215 220 

Tyr He Asn Gly Leu Trp Asp Lys He Thr Ser Asp He Ala Glu Ser 
225 230 235 240 

Arg Lys Thr Ala Met Asp Ser Val Lys Met Phe Ala Asp Lys Gly Glu 

245 250 255 

Met Phe Gly Leu Ala Glu Lys Ala Val Glu Met Lys Leu Val Asp Giu 

260 265 270 

Leu Ala Tyr Arg Thr Asp Val Glu Lys Glu Leu Lys Lys Met Ser Gin 

275 280 285 

Arg Gly Glu Lys Asp Glu Leu Arg Phe Val Ser Leu Ser Gin Val Leu 

290 295 300 

Ala Asn Gly Pro Met Asn Lys Thr Lys Gly Ser Ara He Ala Val L-u 
305 310 315 ' 320 

Phe Ala Glu Gly Glu He Thr Glu Glu He He Lys Lys Pro Phe Asp 

325 330 335 

Thr Asp Gly Ser Ser He Thr Gin Glu Leu Ala Lys Glu He Lys Ala 

3^0 345 350 

Ala Ala Asp Asp Asp Asp He Lys Ala Val Val Leu Arg Val Asn Ser 

355 360 365 

Pro Gly Gly Ser Ala Phe Thr Ser Glu Gin He Trp Lys Gin Val Ala 
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35 



40 



45 



50 



55 



60 



65 



70 



75 
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10 



15 



20 



25 



30 



50 



55 



60 



65 



70 



75 





370 










375 










380 










Asp 


Leu 


Lys 


Ala 


Lys 


Lys 


Pro 


He 


Val 


Val 


Ser 


Met 


Gly 


Asp 


Val 


Ala 


385 










390 










395 










400 


Ala 


Ser 


Gly 


Gly 


Tyr 


Tyr 


He 


Ala 


Cys 


Ala 


Ala 


Asn 


Ser 


He 


Val 


Ala 








4 05 










410 










415 




Glu 


His 


Thr 


Thr 

A20 


Leu 


Thr 


Gly 


Ser 


He 
425 


Gly 


He 


Phe 


Gly 


Met 
4 30 


Phe 


Pro 


Asn 


Phe 


Ala 
435 


Gly 


Val 


Ala 


Lys 


Lys 
440 


He 


Gly 


Val 


Asn 


Met 
445 


Asp 


Val 


Val 


Gin 


Thr 
450 


Ser 


Lys 


Tyr 


Ala 


Asp 
455 


Leu 


Gly 


Asn 


Thr 


Phe 
460 


Ala 


Pro 


Met 


Thr 


Val 


Glu 


Asp 


Arg 


Ala 


Leu 


He 


Gin 


Arg 


Tyr 


He 


Glu 


Gin 


Gly 


Tyr 


Asp 


465 










470 










475 










480 


Leu 


Fhe 


Leu 


Thr 


Arg 
485 


Val 


Ser 


Glu 


Gly 


Arg 
4 90 


Asn 


Arg 


Thr 


Lys 


Ala 
495 


Gin 


He 


Asp 


Ser 


He 
500 


Ala 


Gin 


Gly 


Arg 


Val 
505 


Trp 


Leu 


Gly 


Asp 


Lys 
510 


Ala 


Leu 


Ala 


Leu 


Gly 
515 


Leu 


Val 


Asp 


Glu 


Leu 
520 


Gly 


Gly 


Leu 


Asp 


Thr 


Ala 


He 


Lys 


Arg 


Ala 
530 


Ala 


Lys 


Leu 


Ala 


Gin 
535 


Leu 


Gly 


Gly 


Asn 


Tyr 
54 0 


Ser 


He 


Glu 


Tyr 


Gly 


L^'£ 


Thr 


Lys 


Arg 


Asn 


Phe 


Phe 


Glu 


Glu 


Leu 


Leu 


Ser 


Ser 


Ser 


Ala 












550 










555 










560 


Ala 


Asp 


Met 


Lys 


Ser 
565 


Ala 


He 


Leu 


Ser 


Thr 

570 


He 


Leu 


Ser 


Asp 


Pro 
57 5 


Glu 


He 


Glu 


Val 


Leu 
580 


Arg 


Glu 


Leu 


Arg 


Ser 
585 


Ilet 


Pro 


Pro 


Arg 


Pro 
590 


Ser 


Gly 


He 


Gin 


Ala 
595 


Arg 


Leu 


Pro 


Tyr 


Tyr 
600 


Phe 


Met 


Pro 


Tyr 











(2) IIIFORIIATIOII FOR SEQ ID lJO:27 6 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LEIJGTH: 324 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

40 (iii) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingival is 

45 (ix) FEATURE: 

(A) IIAIJE/KEY: misc_feature 

(B) LOCATIOM 1 ... 324 



{;-:i) SEQUEMCE DESCRIPTION: SEQ ID IJO:276 



Leu 


Thr 


Leu 


Phe 


Cys 


Cys 


Asn 


Tyr 


Phe 


Lys 


Gin 


Met 


Arg 


Ala 


Asn 


He 


1 








5 










10 










15 




Trp 


Gin 


He 


Leu 


Ser 


Val 


Ser 


Val 


Leu 


Phe 


Phe 


Phe 


Gly 


Thr 


Ala 


He 








20 










25 










30 






Gly 


Gin 


Ala 


Gin 


Ser 


Arg 


Asn 


Arg 


Thr 


Tyr 


Glu 


Ala 


Tyr 


Val 


Lys 


Gin 






35 










40 










45 








Tyr 


Ala 


Asp 


Glu 


Ala 


He 


Arg 


Gin 


Met 


Ser 


Arg 


Tyr 


Asn 


He 


Pro 


Ala 




50 










55 










60 










Ser 


He 


Thr 


He 


Ala 


Gin 


Ala 


Leu 


Val 


Glu 


Thr 


Gly 


Ala 


Gly 


Ala 


Ser 


65 










70 










75 










80 


Thr 


Leu 


Ala 


Ser 


Val 


His 


Asn 


Asn 


His 


Phe 


Gly 


He 


Lys 


Cys 


His 


Lys 










85 










90 










95 




Ser 


Trp 


Thr 


Gly 


Lys 


Arg 


Thr 


Tyr 


Arg 


Thr 


Asp 


Asp 


Ala 


Pro 


Asn 


Glu 








100 










105 










110 






Cys 


Phe 


Arg 


Ser 


Tyr 


Ser 


Ala 


Ala 


Arg 


Glu 


Ser 


Tyr 


Glu 


Asp 


His 


Ser 






115 










120 










125 








Arg 


Phe 


Leu 


Leu 


Gin 


Pro 


Arg 


Tyr 


Arg 


Pro 


Leu 


Phe 


Lys 


Leu 


Asp 


Arg 




130 










135 










140 










Glu 


Asp 


Tyr 


Arg 


Gly 


Trp 


Ala 


Thr 


Gly 


Leu 


Gin 


Arg 


Cys 


Gly Tyr 


Ala 


145 










150 










155 










160 


Thr 


Asn 


Arg 


Gly 


Tyr 


Ala 


Asn 


Leu 


Leu 


He 


Lys 


Met 


Val 


Glu 


Leu 


Tyr 










165 










170 










175 




Glu 


Leu 


Tyr 


Ala 


Leu 


Asp 


Arg 


Glu 


Lys 


Tyr 


Pro 


Ser 


Trp 


Phe 


His 


Lys 








180 










185 










190 






Ser 


Tyr 


Pro 


Gly 


Ser 


Asn 


Lys 


Lys 


Ser 


His 


Gin 


Thr 


Thr 


Lys 


Gin 


Lys 
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10 



15 



20 



25 



45 



50 



55 



60 



65 







195 










200 










205 








Gin 


Ser 


Gly 


Leu 


Lys 


His 


Glu 


Ala 


Tyr 


Phe 


Ser 


Tyr 


Gly 


Leu 


Leu 


Tyr 




210 










215 










220 






He 


He 


Ala 


Lys 


Gin 


Gly 


Asp 


Thr 


Phe 


Asp 


Ser 


Leu 


Ala 


Glu 


Glu 


Phe 


225 










230 










235 










240 


Asp 


Met 


Arg 


Ala 


Ser 


Lys 


Leu 


Ala 


Lys 


Tyr 


Asn 


Asp 


Ala 


Pro 


Val 


Asp 


Phe 








245 










250 










255 


Pro 


He 


Glu 


Lys 


Gly 


Asp 


Val 


He 


Tyr 


Leu 


Glu 


Lys 


Lys 


His 


Ala 








260 










265 








270 






Cys 


Ser 


He 


Ser 


Lys 


His 


Thr 


Gin 


His 


Val 


Val 


Arg 


Val 


Gly 


Asp 


Ser 






275 










280 










285 




Met 


His 


Ser 


He 


Ser 


Gin 


Arg 


Tyr 


Gly 


He 


Arg 


Met 


Lys 


Asn 


Leu 


Tyr 




290 










295 










300 






Lys 


Leu 


Asn 


Asp 


Lys 


Asp 


Gly 


Gl u 


Tyr 


He 


Pro 


Gin 


Glu 


Gly 


Asp 


He 


305 










310 










315 








320 


Leu 


Arg 


Leu 


Arg 



























(2) INFOKMATIOIJ FOR SEQ ID nO:277 

(i) SEQUEUCF: CHARACTERISTICS: 

(A) LENGTH: 533 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

30 (vi) ORIGIHAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 
^ (A) NAME/KEY: misc_feature 

»55 (B) LOCATION 1...533 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:277 

Arg He Pro Asp Glu Gin Thr Gly Arg He Met Asp Gly Arg Arg Tyr 

1 5 10 15 

Ser Asp Gly Leu His Gin Ala He Glu Ala Lys Glu His Val Lys Val 

20 25 30 

Glu Ala Ala Thr Gin Thr Phe Ala Thr He Thr Leu Gin Asn Tyr Phe 

35 40 45 

Arg Met Tyr His Lys Leu Ala Gly Met Thr Gly Thr Ala Glu Thr Glu 

SO 55 
Ala Gly Glu Leu Trp Asp He Tyr Lys Leu Asp Val Val Val He Pro 

"^0 75 80 

Thr Asn Lys Pro He Ala Arg Lys Asp Met Asn Asp Arg He Tyr Lys 

85 90 95 

Thr Ala Arg Glu Lys Tyr Ala Ala Val He Glu Glu He Val Arg Leu 

iOO 105 110 

Val Glu Glu Gly Arg Pro Val Leu Val Gly Thr Thr Ser Val Glu H- 

115 120 125 

Ser Glu Leu Leu Ser Arg Met Leu Arg Leu Arg Gly He Gin His Asn 

130 135 
Val Leu Asn Ala Lys Leu His Gin Lys Glu Ala Glu He Val Ala Gin 
1^0 155 l^;o 

Ala Gly Gin Lys Gly Thr Val Thr He Ala Thr Asn Met Ala Gly Arg 

165 170 175 

Gly Thr Asp He Lys Leu Ser Ala Glu Val Lys Lys Ala Gly Gly Leu 

180 185 190 

Ala He He Gly Thr Glu Arg His Glu Ser Arg Arg Val Asp Arg Gin 

195 200 205 

Leu Arg Gly Arg Ser Gly Arg Gin Gly Asp Pro Gly Ser Ser He Ph- 

215 220 
Tyr Val Ser Leu Glu Asp His Leu Met Arg Leu Phe Ala Thr Glu Lys 
— ^ 230 235 2^0 

He Ala Ser Leu Met Asp Arg Leu Gly Phe Lys Glu Gly Glu Val Leu 



70 



24e 



255 



Glu Asn Asn Met Leu Ser Lys Ser Val Glu Arg Ala Gin Lys Lys Val 
260 265 270 

Glu Glu Asn Asn Phe Gly He Arg Lys His Leu Leu Glu Tyr Asp Asp 

7C 275 280 285 

/O \al Met Asn Ser Gin Arg Glu Val He Tyr Thr Arg Arg Arg His Ala 
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290 295 300 

Leu Met Gly Glu Arg lie Gly Met Asp Val Leu Asn Thr He Tyr Asp 
305 310 315 320 

Val Cys Lys Ala Leu He Asp Asn Tyr Ala Glu Ala Asn Asp Phe Glu 
5 325 330 335 

Gly Pile Lys Glu Asp Leu Met Arg Ala Leu Ala He Glu Ser Pro He 

3'10 345 350 

Thr Gin Glu He Phe Arg Gly Lys Lys Ala Glu Glu Leu Thr Asp Met 
355 360 365 

10 Leu Phe Asp Glu Ala Tyr Lys Ser Phe Gin Arg Lys Met Asp Leu He 
370 375 380 

Ala Glu Val Ala His Pro Val Val His Gin Val Phe Glu Thr Gin Ala 
385 390 395 400 

Ala Val Tyr Glu Arg He Leu He Pro He Thr Asp Gly Lys Arg Val 
15 405 410 415 

Tyr Asn He Gly Cys Asn Leu Arg Glu Ala Asp Glu Thr Gin Gly Lys 

420 425 430 

Ser He He Lys Glu Phe Glu Lys Ala He Val Leu His Thr He Asp 
435 440 445 

20 Glu Ser Trp Lys Glu His Leu Arg Glu Met Asp Glu Leu Arg Asn Ser 
450 455 460 

Val Gin Asn Ala Ser Tyr Glu Asn Lys Asp Pro Leu Leu He Tyr Lys 
465 470 475 480 

Leu Glu Ser T:,-r Glu Leu Phe Arg Lys Met Val Glu Ala Met Asn Arg 
25 4 85 4 90 4 95 

Lys Thr Val Ala He Leu Met Arg Ala Arg He Pro Val Pro Glu Ala 

500 505 510 

Pro Ser Gin Glu Glu Leu Glu His Arg Arg Gin He Glu He Arg His 
515 520 525 

30 Ala Thr Gin Gin Arg 
530 

(2) I H FORMAT I on FOR SEQ ID NO: 27 8 

35 (i) SEQUEMCF. CHARACTERISTICS: 

(A) LEMGTH: 7 20 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

40 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIUAL SOURCE: 
45 (A) ORGAttISM: Po rphyromona s gingivalis 

(ix) FEATURE; 

(A) HAIIE/KEY: misc_feature 

(B) LOCATION 1 . . .720 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID lJO:278 





Lys 


Ser 


Cys 


Arg 


Val 


He 


Gly 


Gin 


Thr 


Arg 


Arg 


Tyr 


Gly 


cys 


Cys 


Pro 




1 








5 










10 










15 




55 


Val 


Gly 


Leu 


Arg 


Pro 


cys 


His 


Gin 


Ser 


Leu 


Ser 


Gly 


Val 


cys 


Arg 


Cys 








20 










25 










30 








Leu 


Phe 


Gin 
35 


Ser 


Arg 


Pro 


Ala 


Val 
40 


Ala 


Phe 


Ala 


Arg 


Lys 
45 


Gly 


Gin 


Arg 


60 


Arg 


His 


Arg 


Arg 


Ser 


Glu 


Ser 


Gly 


Arg 


Arg 


He 


Arg 


Ala 


Leu 


Gin 


Gly 




50 










55 










60 












Val 


Gin 


His 


His 


Gin 


Thr 


Asn 


Glu 


His 


Glu 


Val 


Met 


He 


Ser 


Va 1 


Asn 




65 










70 










75 










80 




Asn 


Leu 


Thr 


Va 1 


Asp 


Phe 


Gly 


Thr 


Arg 


Leu 


Leu 


Phe 


Asp 


Gin 


Val 


Ser 


65 










85 










90 










95 




Phe 


Val 


He 


Asn 
100 


Arg 


Arg 


Asp 


Arg 


He 
105 


Ala 


Leu 


Val 


Gly 


Lys 
110 


Asn 


Gly 




Ala 


Gly 


Lys 
115 


Ser 


Thr 


Leu 


Leu 


Lys 
120 


Leu 


He 


Ala 


Gly 


Met 
125 


Glu 


Glu 


Pro 


70 


Thr 


Ser 


Gly 


His 


He 


Ala 


Arg 


Pro 


Lys 


Gly 


He 


Arg 


He 


Gly 


Tyr 


Leu 




130 










135 










140 












Pro 


Gin 


Val 


Met 


Arg 


Leu 


Gin 


Asp 


Gly 


His 


Thr 


Val 


Tyr 


Glu 


Glu 


Val 




145 










150 










155 










160 




Glu 


Gin 


Ala 


Phe 


Asn 


Asp 


He 


Arg 


Gin 


He 


Glu 


Glu 


Glu 


He 


Arg 


Arg 


75 










165 










170 










175 




Leu 


Ser 


Asp 


Glu 


Met 


Ala 


Gly 


Arg 


Thr 


Asp 


Tyr 


Glu 


Ser 


Asp 


Asp 


Tyr 
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lie Arg Leu He Glu His Tyr Thr Asn Met Ser Glu Thr Leu Ser Leu 

195 200 205 

Met Gin Gin Gly Asn Tyr His Ala Ala He Glu Gin Thr Leu He Gly 

210 215 220 

Leu Gly Phe Gly Arg Glu Asp Fhe His Arg Pro Thr Ala Asp Phe Ser 
225 230 235 240 

Gly Gly Trp Arg Met Arg lie Glu Leu Ala Lys Leu Leu Leu Gin Arg 

245 250 255 

Pro Glu Val Leu Leu Leu Asp Glu Pro Thr Asn His Leu Asp He Glu 

260 265 270 

Ser He Gly Trp Leu Glu Gin Phe He Ala Thr Asn Ala Gly Ala Val 

275 280 285 

He Leu Val Ser His Asp Arg Ala Phe He Asp Asn Thr Thr Thr Arg 

290 295 300 

Thr He Glu He Glu Leu Gly His He Tyr Asp Tyr Lys Thr Asn Tyr 
305 310 315 320 

Ser His Tyr Val Glu Leu Arg Glu Glu Arg Leu Arg Gin Gin Met Arg 

325 330 335 

Ala Tyr Glu Asn Gin Gin Lys I let He Arg Asp Thr Glu Asp Fhe He 

3^0 34S 350 

Glu Arg Phe Arg Tyr Lys Ala Thr Lys Ser Val Gin Val Gin Ser Arg 

355 360 365 

He Lys Gin Leu Glu Lys Val Glu Arg Val Glu He Asp Glu Ara Asp 

370 375 380 

Arg Ser Ala Phe His Phe Arg Phe He Pro Ala Gin Pro Ser Gly Ser 
385 390 395 ^00 

Tyr Pro Leu He Val Asp Asp Leu Ala Lys Ala Tyr Gly Asp His Gin 

-305 ^10 415 

Val Phe Ser Gly Ala Thr Tyr Thr He Glu Arg Gly Glu Lys Val Ala 

420 425 ^30 

Phe Val Gly Lys Asn Gly Ala Gly Lys Ser Thr Met Val Lys Cys He 

435 440 44b 

1-iet Gly Glu Leu Thr Asp Tyr Thr Gly Lys Leu Glu Leu Gly His Asn 

450 455 460 

Val Gin Leu Gly Tyr Phe Ala Gin Asn Glu Ala Gin Glu Leu Arg Gly 
465 470 475 480 

Asp Leu Thr Val Phe Asp Thr He Asp Arg Glu Ala Val Gly Asp He 

4 85 4 90 4 95 

Arg Leu Arg Leu Asn Asp Leu Leu Gly Ala Phe Leu Phe Gly Gly Glu 

500 505 510 

Ala Ser Glu Lys Lys Val Ser Val Leu Ser Gly Gly Glu Arg Ala Ara 

515 520 525 

Leu Ala He He Arg Leu Leu Leu Gin Pro Ala Asn Phe Leu He Leu 

530 535 540 

Asp Glu Pro Thr Asn His Leu Asp Met Arg Ser Lys Asp Val Leu Lys 
545 550 555 560 

Glu Ala He Lys Asn Phe Asp Gly Thr Val He Val Val Ser His Asp 

565 570 575 

Arg Glu Phe Leu Asp Gly Leu Val Ser Lys Val Tyr Glu Phe Ala Asp 

580 585 590 

Gly Gin Val Asn Glu His Leu Gly Gly He Tyr Asp Tyr Leu Arg Thr 

595 600 605 

Arg Arg Met Gin Thr Leu Thr Glu Leu Glu Arg Thr Thr Thr He Glu 

610 615 620 

Tlir Lys Thr Thr Arg Glu Ala He Pro Glu Thr Glu Ala Lys Ala Asp 
^25 630 635 640 

Tyr Arg Arg Gin Lys Glu Val Ala Lys Gin Leu Arg Thr Leu Glu Arg 

645 650 655 

Thr Val Ala Thr Cys Glu Glu Arg He Gly Lys Leu Glu Ser Glu Leu 

660 665 670 

Gin Ala He Glu Met Leu Leu Gin Asp Pro Lys His Ala Thr Asp Ala 

675 680 685 

Asn Leu Phe Glu Arg Tyr Ala Gly Met Lys Gin Glu Leu Glu Lys Ala 

690 695 700 

Met Glu Asp Trp Glu Gin Ala Ser Glu Ala Leu Ser Glu Ala Gin Gly 
705 710 715 720 

(2) lUFORMATIOIJ FOR SEQ ID i:0:27 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 386 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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10 



(in MOLCCULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAl^lISM: Porphyromonas gingival i£ 

(i:-:) FEATURE: 

(A) IWIE/KEV: misc_feature 
{ B) LOCATIOIJ 1 ... 386 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Tyr 


Leu 


Lys 


Leu 


Leu 


He 


Leu 


Gin 


He 


Ala 


Leu 


Met 


Asn 


Phe 


Leu 


Lys 


1 








5 










10 










15 




Lys 


Glu 


Pro 


Phe 


Lys 


He 


Phe 


Ser 


Met 


He 


Tyr 


Leu 


Leu 


Leu 


Asp 


Thr 








20 










25 










30 






He 


Thr 


Asn 


Arg 


Ala 


Gly 


Thr 


Glu 


Arg 


Ala 


Val 


He 


Asn 


Leu 


Ala 


Asn 






35 










40 










45 








Asn 


Leu 


His 


Ala 


Asn 


Gly 


His 


Arg 


Val 


Ser 


Leu 


Val 


Ser 


Val 


Cys 


Thr 




50 










55 










60 










Lys 


Glu 


Gly 


Glu 


Pro 


Ser 


Phe 


Gin 


Val 


Glu 


Lys 


Gly 


He 


Glu 


Val 


His 


65 










70 










75 










80 


His 


Leu 


Gly 


He 


Arg 


Leu 


Tyr 


Gly 


Asn 


Ala 


Leu 


Ala 


Arg 


Lys 


Thr 


Val 










85 










90 










95 




Tyr 


Phe 


Lys 


Ala 


Tyr 


Arg 


Arg 


He 


Lys 


Ala 


Leu 


Tyr 


Lys 


Lys 


Arg 


Glu 








100 










105 










110 






Pro 


Val 


Leu 


Leu 


He 


Gly 


Thr 


Asn 


He 


Phe 


li- 


Asn 


Thr 


He 


Leu 


Ser 






115 










120 










125 








Gin 


He 


Ser 


Asn 


Arg 


Gly 


Arg 


He 


Phe 


Thr 


ne 


Gly 


Cys 


Glu 


His 


He 




130 










135 










I'lO 










Ser 


Tyr 


Asp 


He 


Ala 


Arg 


Pro 


He 


Thr 


Lys 


Arg 


He 


Arg 


Gly 


Phe 


Leu 


145 










150 










155 










160 


Tyr 


Ser 


Gly 


Leu 


Asp 


Ala 


Val 


Val 


Ala 


Leu 


Thr 


Lys 


Arg 


Asp 


Gin 


Gin 










165 










170 










175 




Ser 


Phe 


Glu 


Ala 


He 


Leu 


Arg 


Gly 


Arg 


Ser 


Lys 


Ala 


Tyr 


Val 


He 


Pro 








180 










185 










190 






Asn 


Gin 


Val 


Ser 


Phe 


Thr 


Thr 


Val 


Gin 


Arg 


Asp 


Ala 


Thr 


Thr 


His 


Lys 






195 










200 






205 






Gin 


Met 


Leu 


Ala 


He 


Gly 


Arg 


Leu 


Thr 


Tyr 


Gin 


Lys 


Gly 


Phe 


Glu 


Phe 




210 










215 










220 










Met 


He 


Glu 


Asp 


Ala 


Ser 


Arg 


Val 


Leu 


Arg 


Glu 


Arg 


Pro 


Asp 


Trp 


Lys 


225 










230 










235 










240 


Leu 


He 


He 


Val 


Gly 


Asp 


Gly 


Glu 


Asn 


Glu 


Ser 


Met 


Leu 


Arg 


Lys 


Glu 










245 










250 










255 




He 


Ala 


Ser 


Arg 


Asn 


Met 


Glu 


Ser 


Gin 


He 


Glu 


He 


His 


Pro 


Ser 


Thr 








260 










265 










270 






Pro 


Glu 


He 


Arg 


Lys 


Tyr 


Tyr 


Glu 


Ser 


Ser 


Ala 


He 


Tyr 


Leu 


Met 


Thr 






275 










280 










285 








Ser 


Arg 


Phe 


Glu 


Gly 


Leu 


Pro 


Met 


Val 


Leu 


Leu 


Glu 


Ala 


Glu 


Ala 


Tyr 




290 










295 










300 










Ala 


Leu 


Pro 


He 


He 


Ser 


Tyr 


Asp 


Cys 


Pro 


Thr 


Gly 


Pro 


Arg 


Glu 


Leu 


305 










310 










315 










320 


He 


Glu 


Asn 


Gly 


Arg 


Asn 


Gly 


Phe 


Leu 


Val 


Pro 


Met 


Glu 


Ala 


His 


Glu 










325 










330 










335 




Asp 


Phe 


Ala 


Asp 


Lys 


Leu 


Arg 


Leu 


Leu 


Met 


Asp 


Asp 


Glu 


Thr 


Leu 


Arg 








3-10 










3^15 










350 






Lys 


Lys 


Met 


Gly 


Gin 


Glu 


Ser 


Glu 


Leu 


Met 


Val 


Lys 


Ser 


Tyr 


Ser 


Pro 






355 










360 








365 






Ala 


Asn 


He 


Tyr 


Glu 


Cys 


Trp 


Lys 


Lys 


Leu 


Phe 


Val 


Glu 


He 


Gly 


Tyr 



370 
Met Asn 
385 



375 



380 



(2) INFORMATION FOR SEQ ID NO: 2 80 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 PCT/AU98/01023 

196/490 

(vi) ORIGIUAL SOURCE: 

{A) ORGAMISM: Porphyromonas gingival is 

(ix) FEATURE: 
5 (A) NAIIE/KEY: misc_feature 

( B) LOCATIOU 1 ... 655 

(xi) SEQUEMCE DESCRI PTIOE^ : SEQ ID 110:280 

10 Thr Thr Lys Glu Asn Glu Thr Thr Thr Lys Asn Glu Tyr Arg He flet 
1 5 10 15 

Gly Lys He He Gly He Asp Leu Gly Thr Thr Asn Ser Cys Val Ser 

20 25 30 

Val Leu Glu Gly Asn Glu Pro He Val He Thr Asn Ser Glu Gly Lys 
15 35 40 ^5 

Arg Thr Thr Pro Ser Val Val Ala Phe Val Asp Gly Gly Glu Arg Lys 

50 55 60 

Val Gly Asp Pro Ala Lys Arg Gin Ala He Thr Asn Pro Thr Lys Thr 
65 70 75 80 

ZU He Tyr Ser He Lys Arg Phe Met Gly Glu Thr Tyr Asp Gin Val Ser 

85 90 95 

Arg Glu Val Glu Arg Val Pro Phe Lys Val Val Arg Gly Asp Asn Asn 

100 105 HO 

Thr Fro Arg Val Asp He Asp Gly Arg Leu Tyr Thr Pro Gin Glu He 

115 120 125 

Ser Ala Met He Leu Gin Lys Met Lys Lys Thr Ala Glu Asp Tyr Leu 

130 135 140 

Gly Gin Glu Val Thr Glu Ala Val He Thr Val Pro Ala Tyr Phe Asn 
„^ 1-55 ISO 155 160 

JU Asp Ala Gin Arg Gin Ala Thr Lys Glu Ala Gly Glu He Ala Gly Leu 

165 17 0 17 5 

Lys Val Arg Arg He Val Asn Glu Pro Thr Ala Ala Ser Leu Ala Tyr 

180 185 190 

Gly Leu Asp Lys Ser Asn Lys Asp Met Lys He Ala Val Phe Asp Leu 
o5 195 200 205 

Gly Gly Gly Thr Phe Asp He Ser He Leu Glu Leu Gly Asp Gly Val 

210 215 220 

Phe Glu Val Lys Ser Thr Asn Gly Asp Thr His Leu Gly Gly Asp Asp 
225 230 235 240 

^tU Phe Asp His Val He He Asp Trp Leu Ala Glu Glu Phe Lys Ser Gin 

24b 250 255 

Glu Gly Val Asp Leu Arg Gin Asp Pro Met Ala Met Gin Arg Leu Lys 

260 265 270 

Glu Ala Ala Glu Lys Ala Lys He Glu Leu Ser Ser Thr Ser Ser Thr 
45 275 280 285 

Glu He Asn Leu Pro Tyr He Met Pro Val Asn Gly He Pro Lys His 

290 295 300 

Leu Val Met Thr Leu Thr Arg Ala Lys Phe Glu Gin Leu Ala Asp Arg 
305 310 315 320 

OU Leu He Gin Ala Cys Val Ala Pro Cys Glu Thr Ala Leu Lys Asp Ala 

325 330 335 

Gly Met Ser Arg Gly Asp He Asp Glu Val He Leu Val Gly Gly Ser 

340 345 350 

Thr Arg He Pro Ala He Gin Glu He Val Glu Lys He Phe Gly Lys 
55 355 360 365 

Ala Pro Ser Lys Gly Val Asn Pro Asp Glu Val Val Ala Val Gly Ala 

370 375 380 

Ala He Gin Gly Gly Val Leu Thr Gly Glu Val Lys Asp Val Leu Leu 
385 390 395 400 

Leu Asp Val Thr Pro Leu Ser Leu Gly He Glu Thr Met Gly Gly Val 

405 410 415 

Met Thr Arg Leu He Asp Ala Asn Thr Thr He Pro Thr Lys Lys Ser 

420 425 430 

Glu He Phe Thr Thr Ala Val Asp Asn Gin Pro Ser Val Glu He His 

435 A40 
Val Leu Gin Gly Glu Arg Ser Leu Ala Lys Asp Asn Lys Ser He Gly 

450 455 460 

Arg Phe Asn Leu Asp Gly He Ala Pro Ala Pro Arg Gin Thr Pro Gin 
470 475 480 

/U He Glu Val Thr Phe Asp He Asp Ala Asn Gly He Leu Asn Val Thr 

4 85 4 90 4 95 

Ala His Asp Lys Ala Thr Gly Lys Lys Gin Asn He Arg He Glu Ala 

500 505 510 

Ser Ser Gly Leu Ser Asp Asp Glu He Lys Arg Met Lys Glu Glu Ala 
515 520 525 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



60 



65 



BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



197/490 





Gin 


Ala 

530 


Asn 


Ala 


Glu 


Ala 


Asp 
535 


Lys 


Lys 


Glu 


Lys 


Glu 
540 


Arg 


He 


Asp 


Lys 




He 


Asn 


Gin 


Ala 


Asp 


Ser 


Met 


He 


Phe 


Gin 


Thr 


Glu 


Lys 


Gin 


Leu 


Lys 




5^15 










550 










555 










560 


5 


Glu 


Leu 


Gly 


Asp 


Lys 
565 


Phe 


Pro 


Ala 


Asp 


Lys 
570 


Lys 


Ala 


Pro 


He 


Asp 
575 


Thr 




Ala 


Leu 


Asp 


hys 
580 


Leu 


Lys 


Glu 


Ala 


His 
585 


Lys 


Ala 


Gin 


Asp 


Val 

590 


Ala 


Ala 




He 


Asp 


Thr 


Ala 


Met 


Ala 


Glu 


Leu 


Gin 


Thr 


Ala 


Leu 


Ser 


Ala 


Ala 


Gly 


10 






595 










600 










605 










Glu 


Glu 
610 


Leu 


Tyr 


Lys 


Asn 


Ala 
615 


Gly 


Ala 


Ala 


Gin 


Gly 
620 


Gly 


Ala 


Gin 


Pro 




Gly 


Pro 


Asp 


Phe 


Gly 


Gly 


Ala 


Gin 


Gly 


Pro 


Ser 


Ala 


Gly 


Asp 


Gin 


Pro 




625 










630 










635 










640 


15 


Ser 


Asp 


Asp 


Lys 


Asn 


Val 


Thr 


Asp 


Val 


Asp 


Phe 


Glu 


Glu 


Val 


Lys 












645 










650 










655 





(2) HI FORMAT I OH FOR SEQ ID 110:281 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: A61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISJI: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_f eat u re 

(B) LOCATION 1. . . ^67 

35 



40 



45 



50 



55 



60 



65 



70 



75 





(xi) 


SEQUENCE DESCRIPTIOII : SEQ ID HO: 281 












Lys 


Trp 


Ala 


Arg 


Thr 


Thr 


Thr 


Leu 


Arg 


He 


Ser 


Asn 


Ala 


Thr 


He 


Tyr 


1 




5 










10 










15 




Lys 


Thr 


1-let 


Arg 


Tyr 


Asp 


Leu 


Ala 


He 


He 


Gly 


Gly 


Gly 


Pro 


Ala 


Gly 








20 










2 5 










30 






Tyr 


Thr 


Ala 


Ala 


Glu 


Arg 


Ala 


Ala 


Lys 


Gly 


Gly 


Leu 


Lys 


Thr 


Leu 


Leu 






35 










40 










4b 








He 


Glu 


Lys 


Asn 


Ala 


Leu 


Gly 


Gly 


Val 


Cys 


Leu 


Asn 


Glu 


Gly 


cys 


He 




50 










55 










60 










Pro 


Thr 


Lys 


Thr 


Leu 


Leu 


Tyr 


Ser 


Ala 


Lys 


Val 


Leu 


His 


Gin 


He 


Ala 


65 








70 










75 










80 


Thr 


Ala 


Ser 


Lys 


Tyr 


Ala 


Val 


Ser 


Gly 


Thr 


Ala 


Asp 


Gly 


Leu 


Asp 


Leu 








85 










90 










95 




Gly 


Lys 


Val 


He 


Ala 


Arg 


Lys 


Gly 


Lys 


He 


He 


Arg 


Lys 


Leu 


Thr 


Ala 








100 










105 










110 






Gly 


He 


Arg 


Ser 


Arg 


Leu 


Thr 


Glu 


Ala 


Gly 


Val 


Glu 


Met 


Val 


Thr 


Ala 






115 










120 










125 








Gl u 


Ala 


Thr 


Val 


Thr 


Gly 


Cys 


Asp 


Ala 


Asp 


Gly 


He 


He 


Gly 


He 


Thr 




130 










135 










140 










Ala 


Gly 


Glu 


Ala 


Gin 


Tyr 


Lys 


Ala 


Ala 


Asn 


Leu 


Leu 


Leu 


Cys 


Thr 


Gly 


1^5 










150 










155 










160 


Ser 


Glu 


Thr 


Phe 


He 


Pro 


Pro 


He 


Pro 


Gly 


Val 


Glu 


Gin 


Thr 


Glu 


Tyr 










165 










170 










175 




Trp 


Thr 


Asn 


Arg 


Glu 


Ala 


Leu 


Gin 


Asn 


Lys 


Glu 


He 


Pro 


Thr 


Ser 


Leu 








180 










185 










190 






Val 


He 


He 


Gly 


Gly 


Gly 


Val 


He 


Gly 


Met 


Glu 


Phe 


Ala 


Ser 


Phe 


Phe 






195 










200 










205 








Asn 


Gly 


He 


Gly 


Thr 


Gin 


Val 


His 


Val 


Val 


Glu 


Met 


Leu 


Pro 


Glu 


He 




210 








215 










220 










Leu 


Asn 


Gly 


He 


Asp 


Pro 


Glu 


His 


Ala 


Ala 


Met 


Leu 


Arg 


Ala 


His 


Tyr 


225 








2 30 










235 










240 


Glu 


Lys 


Glu 


Gly 


He 


Lys 


Phe 


Tyr 


Leu 


Gly 


His 


Lys 


Val 


Thr 


Ser 


Val 








245 










250 










255 




Arg 


Asn 


Gly 


Ala 


Val 


Thr 


Val 


Glu 


Tyr 


Glu 


Gly 


Glu 


Ser 


Lys 


Glu 


He 




260 










2 65 










270 






Glu 


Gly 


Glu 


Arg 


He 


Leu 


Met 


Ser 


Val 


Gly 


Arg 


Arg 


Pro 


Val 


Leu 


Gin 






275 










280 










285 








Gly 


Phe 


Glu 


Ser 


Leu 


Gly 


Leu 


Val 


Leu 


Ala 


Gly 


Lys 


Gly 


Val 


Lys 


Thr 




290 










2 95 










300 











BNSDOCID: <WO 9929S70A1J_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



198/490 



Asn 


Glu 


Arg 


Met 


Gin 


Thr 


Ser 


Leu 


Pro 


Asn 


Val 


Tyr 


Ala 


Ala 


Gly 


Asp 


305 










310 










315 








320 


He 


Thr 


Gly 


Phe 


Ser 


Leu 


Leu 


Ala 


His 


Thr 


Ala 


Val 


Arg 


Glu 


Ala 


Glu 










32 5 










330 








335 




Val 


Ala 


Val 


Asp 


Gin 


He 


Leu 


Gly 


Lys 


Thr 


Asp 


Glu 


Thr 


Met 


Ser 


Tyr 








34 0 










345 










350 




Arg 


Ala 


Val 


Pro 


Gly 


Val 


Val 


Tyr 


Thr 


Asn 


Pro 


Glu 


Val 


Ala 


Gly 


Val 






355 










360 










365 






Gly 


Glu 


Thr 


Glu 


Glu 


Ser 


Leu 


Arg 


Lys 


Ala 


Gly 


Arg 


Ala 


Tyr 


Thr 


Val 




370 










37 5 










380 








Arg 


Arg 


Leu 


Pro 


Met 


Ala 


Phe 


Ser 


Gly Arg 


Phe 


Val 


Ala 


Glu 


Asn 


Glu 


385 










390 










3 95 










4 00 


Gin 


Gly 


Asn 


Gly 


Glu 
405 


Cys 


Lys 


Leu 


Leu 


Leu 
410 


Asp 


Glu 


G3 u 


Asn 


Arg 
415 


Leu 


He 


Gly 


Ala 


His 


Leu 


He 


Gly 


Asn 


Pro 


Ala 


Gly 


Glu 


Leu 


He 


Val 


Thr 








420 










425 








430 






Ala 


Ala 


Met 


Ala 


He 


Glu 


Thr 


Gly 


Met 


Thr 


Asp 


Arg 


Gin 


He 


Glu 


Arg 






435 










440 










445 






He 


He 


Phe 


Pro 


His 


Pro 


Thr 


Val 


Gly 


Glu 


He 


Leu 


Lys 


Glu 


Thr 


Leu 




4 50 










455 










460 








Ala 


Gly 


Gly 





























465 



(2) IMFORIIATIOM FOR SEQ ID NO: 282 

(i) SEQUEtlCE CHARACTERISTICS: 

(A) LENGTH: 94 5 amino acids 
(E) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAIIISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) HAJ4E/KEY: misc_feature 

(B) LOCATIOM 1 ... 945 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282 



Pro 


Lys 


He 


Leu 


Met 


Glu 


Leu 


Lys 


Arg 


Phe 


Leu 


Ser 


Leu 


Gly 


Leu 


Leu 


1 








5 










10 








15 




Leu 


Val 


Gly 


Phe 

20 


He 


Pro 


Met 


Lys 


Leu 

25 


Ser 


Ala 


Gin 


Gin 


Ala 
30 


Gin 


Pro 


Leu 


Pro 


Thr 


Asp 


Pro 


Ala 


Val 


Arg 


Val 


Gly 


Lys 


Leu 


Asp 


As n 


Gly 


Leu 






3 5 










40 










45 






Thr 


Tyr 


Phe 


He 


Arg 


His 


Asn 


Glu 


Asn 


Pro 


Lys 


Asp 


Arg 


Ala 


Asp 


Phe 




50 










55 










60 








Phe 


He 


Ala 


Gin 


Lys 


Val 


Gly 


Ser 


He 


Leu 


Glu 


Glu 


Asp 


Ser 


Gin 


Ser 


65 










7 0 










75 










80 


Gly 


Leu 


Ala 


His 


Phe 


Leu 


Glu 


His 


Met 


Ala 


Phe 


Asn 


Gly 


Thr 


Lys 


Asn 










85 










90 








95 




Phe 


Pro 


Gly 


Lys 


Asn 


Leu 


He 


Asn 


Tyr 


Leu 


Glu 


Thr 


He 


Gly 


Val 


Arg 








100 










105 










110 




Phe 


Gly Gin 


Asn 


Leu 


Asn 


Ala 


Ser 


Thr 


Gly 


Phe 


Asp 


Lys 


Thr 


Glu 


Tyr 






115 










120 








125 








Thr 


He 


Met 


Asp 


Val 


Pro 


Thr 


Thr 


Arg 


Gin 


Gly 


He 


He 


Asp 


Ser 


Cys 




130 










135 






1 4 0 








Leu 


Leu 


He 


Leu 


His 


Asp 


Trp 


Ser 


Asn 


Asn 


He 


Thr 


Leu 


Asp 


Gly 


His 


145 










150 










155 






160 


Glu 


He 


Asp 


Glu 


Glu 


Arg 


Gly 


Val 


He 


Gin 


Glu 


Glu 


Trp 


Arg 


Ala 


Arg 


Arg 








165 










no 








175 


Asp 


Ala 


Asn 
180 


Leu 


Arg 


Met 


Phe 


Glu 
185 


Ala 


He 


Leu 


Ala 


Lys 
190 


Ala 


Met 


Pro 


Gly 


Asn 


Lys 


Tyr 


Ala 


Glu 


Arg 


Met 


Pro 


He 


Gly 


Leu 


Met 


Asp 


Val 






195 










200 










205 






Val 


Leu 


Asn 


Phe 


Lys 


His 


Asp 


Glu 


Leu 


Arg 


Asn 


Tyr 


Tyr 


Lys 


Lys 


Trp 




210 










215 










220 






Tyr 


Arg 


Pro 


Asp 


Leu 


Gin 


Gly 


Leu 


Val 


He 


Val 


Gly Asp 


He 


Asp 


Val 


225 


Tyr 








230 










235 








240 


Asp 


Val 


Glu 


Asn 


Lys 


He 


Lys 


Glu 


Leu 


Phe 


Lys 


Asp 


Val 


Pro 


Ala 










245 










250 








255 





SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



Pro Val Asn Pro 
260 

Glu Pro lie Val 

275 

5 Leu Ser lie Ser 

2 90 

Ser He Phe Gly 
305 

Ala Val Asn Glu 

10 

Phe Le\j Ser Ala 
340 

Thr Lys Asp Ala 
355 

15 Glu Lys Ala Met 
370 

Phe Gly He Thr 
385 

Lys Arg Tyr Glu 

20 

Ala Tyr Ala Asn 
420 

Pro Gly lie Glu 
435 

25 Val Pro Leu Glu 
^50 

Val Lys Asn Ala 
4 65 

Lys He Pro Ser 

30 

G.l n Gin Lys Val 
500 

Met Glu Lys Ala 
515 

35 Gin Lys Phe Gly 

530 

Tyr Leu Lys Lys 
54 5 

Leu Ser Pro Gly 

40 

Val Met Asn Ser 
580 

He Gin Leu Asp 
595 

45 Ser Leu Ser Leu 

610 

Asp Met Glu Thr 

625 

Arg Lys Asp Pro 

50 

Asn Leu Lys Asn 
660 

He Arg His Thr 
675 

55 Ala Ala Asp Val 
690 

Asn Glu Arg Phe 
705 

Asn Leu Asp Glu 

60 

Ser Leu Pro Asn 
740 

Pro Ala Ala Arg 
755 

65 Asp Thr Pro Ser 
770 

Tyr Thr Leu Lys 
785 

Asp Gin Val Tyr 

70 

Ser Val Ala Ala 
820 

Leu Met Gin He 
835 

75 Asn Ala He Val 



Ala 


Glu 


Arg 


He 


Tyr 


Thr 










265 




Ala 


He 


Ala 


Thr 


Asp 


Ala 








280 






Phe 


Lys 


Ser 


Asp 


Pro 


Thr 






295 








Leu 


Val 


Glu 


Asp 


Tyr 


Met 




310 










Arg 


Leu 


Ser 


Glu 


He 


Thr 


325 










330 


Gly 


Ala 


Phe 


Phe 


Ser 


Asn 










345 




Phe 


Asn 


Phe 


Val 


Ala 


Thr 








360 






Asn 


Ala 


Leu 


Val 


Ala 


Glu 






375 








Lys 


Gly 


Glu 


Tyr 


Asp 


Arg 




390 










Asn 


Gin 


Tyr 


Asn 


Giu 


Arg 


4 05 










410 


Glu 


Tyr 


Ser 


Thr 


Tyr 


Phe 










425 




Val 


Glu 


Tyr 


Gin 


Thr 


Val 








440 






Ala 


Phe 


Asn 


Gin 


Ala 


He 






4 55 








Val 


Val 


Thr 


Leu 


Thr 


Gly 




470 










Glu 


Ala 


Asp 


Phe 


Leu 


Ala 


485 










4 90 


Glu 


Ala 


Lys 


Lys 


Asp 


Glu 










505 




Pro 


Lys 


Ala 


Gly 


Lys 


He 








520 






Thr 


Thr 


Glu 


Leu 


Thr 


Leu 






535 








Thr 


Asp 


Phe 


Lys 


Ser 


Asn 




550 










Gly 


He 


Leu 


Ser 


Gly 


Lys 


565 










570 


Phe 


Met 


Asn 


Val 


Gly 


Gly 










585 




Lys 


Val 


Leu 


Thr 


Gly 


Arg 








600 






Leu 


Ser 


Glu 


Gly 


Leu 


Ser 






615 








Phe 


Phe 


Gin 


Leu 


He 


Tyr 




630 










Glu 


Ala 


Phe 


Lys 


Ala 


Thr 


645 










650 


Gin 


Glu 


Ala 


Asn 


Pro 


Met 










665 




Met 


Tyr 


Gly 


Asp 


Asn 


Pro 








680 






Glu 


Lys 


Val 


Asn 


Tyr 


Asp 






695 








Ala 


Asp 


Ala 


Gly 


Asp 


Phe 




710 










Ala 


Lys 


Met 


Lys 


Pro 


Leu 


725 










730 


Leu 


Lys 


Arg 


Gly 


Asp 


Lys 










745 




Ser 


Gly 


Lys 


He 


Asp 


Cys 








760 






Thr 


Thr 


He 


Phe 


Asp 


Val 






775 








Asn 


Ser 


Leu 


Leu 


Leu 


Glu 




790 










Thr 


Ala 


Thr 


Val 


Arg 


Glu 


805 










810 


Phe 


Gly 


Gly 


Leu 


Glu 


Gin 










825 




Tyr 


Phe 


Pro 


Thr 


Asp 


Pro 








840 






Phe 


Ala 


Glu 


Leu 


Glu 


Lys 



199/490 



Pro 


Val 


Glu 


Asp 


Asn 


Asp 








270 






Glu 


Ala 


Thr 


Thr 


Thr 


Gin 






285 








Pro 


Gin 


Glu 


Val 


Arg 


Gl y 




300 










Lys 


Gin 


Val 


He 


Thr 


Thr 


315 










320 


His 


Lys 


Pro 


Asn 


Ala 


Pro 










335 




Phe 


Met 


Tyr 


He 


Thr 


Gin 








350 






Val 


Arg 


Glu 


Gly 


Giu 


Ala 






365 








He 


Glu 


Ser 


Leu 


Arg 


Gin 




380 










Ala 


Arg 


Thr 


Asn 


Val 


Leu 


395 










400 


Asp 


Lys 


Arg 


Lys 


Asn 


Asn 










415 




Thr 


Asp 


Gly 


Gly 


Tyr 


He 








430 






Asn 


Ala 


Phe 


Ala 


Pro 


Gin 






445 








Ala 


Gin 


F'let 


He 


Asp 


Pro 




460 










Pro 


Ser 


Lys 


Ala 


Glu 


Ala 


475 










480 


Ala 


Phe 


Lys 


Ala 


Ala 


Arg 










4 95 




Val 


Ser 


Asp 


Gin 


Lys 


Leu 








510 






Val 


Ser 


Glu 


Lys 


Lys 


Asp 






525 








Ser 


Asn 


Gly 


He 


Lys 


Val 




540 










Glu 


He 


Leu 


Met 


Ser 


Ala 


555 










560 


His 


Ala 


Pro 


Asn 


Gin 


Ser 










575 




Leu 


Gly 


Asn 


Phe 


Asp 


Ala 








590 






Ser 


Ala 


Ser 


Val 


Ser 


Pro 






605 








Gly 


Lys 


Thr 


Thr 


Val 


Glu 




620 










Leu 


Gin 


Met 


Thr 


Ala 


Asn 


635 










640 


Gin 


Glu 


Lys 


Leu 


Tyr 


Asn 










655 




Ala 


Ala 


Leu 


Met 


Asp 


Ser 








67 0 






Met 


Met 


Lys 


Pro 


Met 


Lys 






685 








Gin 


Val 


Met 


Ala 


Phe 


Tyr 




700 










Met 


Phe 


Phe 


Phe 


He 


Gly 


715 










720 


He 


Glu 


Thr 


Tyr 


Leu 


Ala 










735 




Met 


Asn 


Lys 


Ala 


Gin 


Val 








750 






Lys 


Phe 


Glu 


Lys 


Glu 


Met 






7 65 








Val 


Ser 


Gly 


Asn 


Val 


Glu 




780 










Val 


Phe 


Ser 


Ala 


Val 


Met 


795 










800 


Lys 


Glu 


Gly 


Gly 


Ala 


Tyr 










815 




Tyr 


Pro 


Gin 


Pro 


Lys 


Ala 








830 






Ala 


Arg 


Ala 


Glu 


Glu 


Met 






845 








Leu 


Ala 


Lys 


Glu 


Gly 


Pro 



le 26) (RO/AU) 



BNSDOCID: <WO 9929870A1 J_> 



wo 99/29870 



PCT/AU98/01023 



200/ 490 







850 










855 










8 60 












Asn 


Val 


Glu 


Tyr 


Phe 


Lys 


Lys 


Thr 


He 


Glu 


Asn 


Leu 


As 11 


Lys 


Gin 


His 




865 










870 










875 










880 




Lys 


Glu 


Ser 


Leu 


Arg 


Glu 


Asn 


Arg 


Phe 


Trp 


Leu 


Glu 


Ala 


Met 


Lys 


Ala 


5 










885 










890 










895 






Ser 


Phe 


Phe 


Glu 
900 


Gly 


Asn 


Asp 


Phe 


He 
905 


Thr 


Asp 


Tyr 


Glu 


Ser 
910 


Val 


Leu 




Aso 


Gly 


Leu 


Thr 


Pro 


Ala 


Glu 


Leu 


Gin 


Lys 


Phe 


Ala 


Ala 


Asp 


Leu 


Leu 


10 






91^- 










920 










925 








Lys 


Gin 
930 


Gin 


Asn 


Arg 


Val 


Val 
935 


Val 


r4et 


Met 


Ala 


Pro 
940 


Val 


Ala 


Lys 


Ala 



Gin 
945 

15 (2) lUFORIIATIOlJ FOR SEQ ID MO: 2 83 

(i) SEQUENCE CHARACTERISTICS: 

{A\ LEHGTH: 686 amino acids 
(B) TYPE: amino acid 
20 ( D ) TO POLOG Y : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



25 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

{A\ ORGAN I SI 1: Po rphyromonas gingival is 



(ix) FEATURE: 
30 (A) NAME/KEY: misc_feature 

(B) LOCATION 1 . . . 686 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 83 



Tyr 


Thr 


Met 


Ser 


Lys 


Lys 


Gly 


Thr 


He 


Gly 


Val 


Thr 


Ser 


Asp 


Asn 


He 


1 








5 










10 










15 




Phe 


Pro 


Val 


He 


Lys 


Lys 


Phe 


Leu 


Tyr 


Ser 


Asp 


His 


Glu 


He 


Phe 


Leu 








20 










25 










30 






Arg 


Glu 


He 


Val 


Ser 


Asn 


Ala 


Val 


Asp 


Ala 


Thr 


Gin 


Lys 


Leu 


Lys 


Thr 






35 










40 










45 








Leu 


Thr 


Ser 


Val 


Gly 


Glu 


Phe 


Lys 


Gly 


Glu 


Thr 


Gly 


Asp 


Leu 


Arg 


Val 




50 










55 










60 










Thr 


Val 


Ser 


Val 


Asp 


Glu 


Val 


Ala 


Arg 


Thr 


He 


Thr 


Val 


Ser 


Asp 


Arg 


65 










70 










75 










80 


Gly 


Val 


Gly 


Met 


Thr 


Glu 


Glu 


Glu 


Val 


Glu 


Lys 


Tyr 


He 


Asn 


Gin 


He 










85 










90 










95 




Ala 


Phe 


Ser 


Ser 


Ala 


Glu 


Glu 


Phe 


Leu 


Gl u 


Lys 


Tyr 


Lys 


Asp 


Asp 


Lys 








100 










105 










HO 






Ala 


Ala 


He 


He 


Gly 


His 


Phe 


Gly 


Leu 


Gly 


Phe 


Tyr 


Ser 


Ala 


Phe 


Met 






115 










120 










125 








Val 


Ser 


Glu 


Arg 


Val 


Asp 


Val 


He 


Thr 


Arg 


Ser 


Phe 


Arg 


Glu 


Asp 


Ala 




130 










135 










140 










Thr 


Ala 


Val 


Lys 


Trp 


Ser 


Cys 


Asp 


Gly 


Ser 


Pro 


Glu 


Tyr 


Thr 


Leu 


Glu 


145 










150 










155 










160 


Pro 


Ala 


Azp 


Lys 


Ala 


Asp 


Arg 


Gly 


Thr 


Asp 


He 


Val 


Met 


His 


He 


Asp 










165 










170 










175 




Glu 


Glu 


Asn 


Ser 


Glu 


Phe 


Leu 


Lys 


Lys 


Glu 


Lys 


He 


Glu 


Gly 


Leu 


Leu 








180 










185 










190 






Gly 


Lys 


Tyr 


Cys 


Lys 


Phe 


Leu 


Thr 


Val 


Pro 


He 


He 


Phe 


Gly 


Lys 


hys 






195 










200 










205 








Gin 


Glu 


Trp 


Lys 


Asp 


Gly 


Lys 


Met 


Gin 


Asp 


Thr 


Asp 


Glu 


Asp 


Asn 


Gin 




210 










215 










220 










He 


A^n 


Asp 


Thr 


His 


Pro 


Ala 


Trp 


Thr 


Lys 


Lys 


Pro 


Ala 


Asp 


Leu 


Lys 


225 










230 










235 










240 


Asp 


Glu 


Asp 


Tyr 


Lys 


Glu 


Phe 


Tyr 


Arg 


Ser 


Leu 


Tyr 


Pro 


Met 


Ser 


Glu 










245 










250 










255 




Glu 


Pro 


Leu 


Phe 


Trp 


He 


His 


Leu 


Asn 


Val 


Asp 


Tyr 


Pro 


Phe 


Asn 


Leu 








260 










265 










2~ 0 






Thr 


Gly 


He 


Leu 


Tyr 


Phe 


Pro 


Lys 


He 


Lys 


Asn 


A:rn 


Leu 


Asp 


Leu 


Gin 






275 










260 










285 








Arg 


Asrn 




He 


Gin 


Leu 


Tyr 


Cys 


Asn 


Gl n 


Val 


Tyr 


Val 


Thr 


Asp 


Glu 




2 90 










295 










300 










Val 


Gin 


Gly 


He 


Val 


Pro 


Asp 


Phe 


Leu 


Thr 


Leu 


Leu 


His 


Gly 


Val 


He 


305 










310 










315 










320 


A^p 


Ser 


Pro 


Asp 


He 


Pro 


Leu 


Asn 


Val 


Ser 


Arg 


Ser 


Tyr 


Leu 


Gin 


Ser 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID; <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



201/490 



10 



15 



20 



25 



30 



35 



40 



45 



55 



60 











325 










330 










335 




Asp 


Ala 


Asn 


Val 


Lys 


Lys 


He 


Ser 


Ser 


His 


He 


Thr 


Lys 


Lys 


Val 


Ala 








340 










345 










350 






Asp 


Arg 


Leu 


Glu 


Glu 


He 


Phe 


hys 


Asn 


Asp 


Arg 


Pro 


Thr 


Phe 


Glu 


Glu 






355 










360 










365 








Lys 


Trp 


Asp 


Ser 


Leu 


Lys 


Leu 


Phe 


Val 


Glu 


Tyr 


Gly 


Met 


Leu 


Thr 


Asp 


370 










37 5 










380 










Glu 


Lys 


Phe 


Tyr 


Gl u 


Arg 


Ala 


Ala 


Lys 


Phe 


Phe 


Leu 


Phe 


Thr 


Asp 


Met 


385 








390 










395 










4 00 


Asp 


Gly 


His 


Lys 


Tyr 


Thr 


Phe 


Asp 


Glu 


Tyr 


Arg 


Thr 


Leu 


Val 


Glu 


Gly 










405 










410 










415 




Val 


Gin 


Thr 


Asp 


Lys 


Asp 


Gly 


Gin 


Val 


Val 


Tyr 


Leu 


Tyr 


Ala 


Thr 


Asp 








420 










425 










430 






hys 


His 


Gly 


Gin 


Tyr 


Ser 


His 


Val 


Lys 


Arg 


Ala 


Ser 


Asp 


Lys 


Gly 


Tyr 




435 










4 40 










445 








Ser 


Val 


Met 


Leu 


Leu 


Asp 


Gly 


Gin 


Leu 


Asp 


Pro 


His 


He 


Val 


Ser 


Leu 




450 










455 










460 










Leu 


Glu 


Gl n 


Lys 


Leu 


Glu 


Lys 


Thr 


His 


Phe 


Val 


Arg 


Val 


Asp 


Ser 


Asp 


465 










470 










475 










480 


Thr 


lie 


Asn 


Asn 


Leu 


He 


Arg 


Lys 


Glu 


Glu 


Arg 


Ala 


Glu 


Val 


Lys 


Leu 










485 










490 










495 




Ser 


Asp 


Thr 


Glu 


Arg 


Ala 


Thr 


Leu 


Val 


Lys 


Leu 


Phe 


Glu 


Ala 


Arg 


Leu 






500 










505 










510 






Pro 


Arg 


Asp 


Glu 


Lys 


Lys 


His 


Phe 


Asn 


Val 


Ala 


Phe 


Glu 


Ser 


Leu 


Gly 






515 










520 










525 








Ala 


Glu 


Gly 


Glu 


Ala 


He 


Leu 


He 


Thr 


Gin 


Ala 


Glu 


Phe 


Met 


Arg 


Arg 




530 










535 










540 










Met 


Arg 


Asp 


Met 


Ala 


Gin 


Leu 


Gin 


Pro 


Gly 


Met 


Ser 


Phe 


Tyr 


Gly 


Glu 


545 








550 










555 










560 


Leu 


Pro 


Asp 


Ser 


Tyr 


Asn 


Leu 


Val 


Leu 


Asn 


Thr 


Asp 


His 


Pro 


Leu 


He 










565 










570 










57 5 




Asp 


Arg 


Val 


Leu 


Ser 


Gly 


Glu 


Lys 


Glu 


Ser 


Val 


Glu 


Pro 


Ser 


Leu 


Thr 








580 










585 










590 






Glu 


Leu 


Arg 


Ala 


Lys 


He 


Ala 


Glu 


Leu 


Lys 


Ala 


Glu 


Glu 


Ala 


Lys 


Leu 






595 










600 










605 








Leu 


Asp 


Glu 


Glu 


Lys 


Gly 


Lys 


Lys 


Pro 


Glu 


Glu 


He 


Pro 


Val 


Ala 


Thr 




610 










615 










620 










Lys 


Glu 


Ala 


Lys 


Glu 


Asn 


Asn 


Ala 


Val 


Glu 


Gin 


Ala 


Lys 


Thr 


Glu 


Gly 


625 








630 










635 










640 


Ser 


lie 


Asn 


Asp 


Gin 


Leu 


Thr 


Lys 


Tyr 


Ala 


Gin 


Asp 


Asn 


Glu 


Leu 


He 










645 










650 










655 




Gly 


Gin 


Leu 


He 


Asp 


Leu 


Ala 


Leu 


Leu 


Gly 


Ser 


Gly 


Leu 


Leu 


Thr 


Gly 








660 










665 










670 






Glu 


Ala 


Leu 


Ala 


Glu 


Phe 


He 


Arg 


Arg 


Ser 


Gin 


Arg 


Leu 


Leu 










675 










680 










685 









(2) m FORMATION FOR SEQ ID t JO: 284 



(i) SEQUEIICE CHARACTERISTICS: 
50 (A} LENGTH: 4 82 amino acids 

{B} TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingival is 



(ix) FEATURE: 

(A) MAJ1E/KEY: inisc_f eature 
{ B) LOCATIOll 1 . - .4 82 

65 (xi) SEQUENCE DESCRIPTION: SEQ IDHO:284 

Asp He Arg Gin Lys Arg Pro Cys Phe Asn Ala Asn Leu Tyr Phe Tyr 

1 5 10 15 

Arg Ala Met Glu Lys Leu He Asp He Leu Val Val Asp Asp Asp Val 

70 20 25 30 

Ala Val Cys Ala Ala Leu Arg Leu Val Leu Lys Arg Ala Gly Tyr Asn 

35 40 45 

Pro Val He Ala Asn Ser Pro Asp Glu Ala Leu Ser He Met Arg Asn 

50 55 60 

75 Pro Asp Gly Gly Cys Lys Pro Ala Val He Leu Met Asp Met Asn Phe 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870Al_t_> 



wo 99/29870 PCT/AU98/01023 

202/490 

65 70 75 80 

Ser Leu Ser Thr Sen Gly Arg Glu Gly Leu Glu Leu Leu Glu Lys Met 

85 90 95 

Gin He Phe Thr Ser Cys Pro Val He Leu Met Thr Ala Trp Ala Ser 
5 100 105 110 

He Pro Leu Ala Val Glu Gly Met Arg Leu Gly Ala Phe Asp Phe He 

115 120 125 

Gly Lys Pro Trp Asp Asn Asp Arg Leu Leu Arg Thr He Asp Thr Ala 
130 135 140 

10 Leu His Leu Ala Ala Pro Ser Ala Val Ala Asn Pro Ser Glu Gin Ser 
l-JS 150 155 160 

Asp Arg Asp Thr Ala Arg Gin Pro Lys Ala Thr Val Gin Glu Asn Asp 

165 170 175 

P^o Cys Ala His He He Gly Arg Ser Asp Ala He Cys Lys He Lys 
15 180 185 190 

Glu Arg He Arg Arg He Ala Pro Thr His Ala Ser Val Leu He Thr 

195 200 205 

Gly Glu Ser Gly Thr Gly Lys Glu Leu He Ala Glu Ala Leu His Arg 

210 215 220 

Gly Ser Lys Arg Ala Ser Ala Pro Phe Val Lys Val Asn Leu Gly Gly 
225 230 235 240 

He Pro Glu Ser Leu Phe Glu Ser Glu Leu Phe Gly His Lys Lys Gly 

245 250 255 

Ala Phe Thr Asn Ala Phe Ser Asp Arg Lys Gly Arg Phe Glu Leu AJ a 
^5 260 265 270 

Asp Gly Gly Thr He Phe Leu Asp Glu He Gly Glu Leu Pro Val Gly 

275 280 285 

Asn Gin Val Lys Leu Leu Arg Val Leu Gin Glu Gin Thr Phe Glu Pro 

290 295 300 

Leu Gly Glu Ser Val Ser His Arg Val Asp He Arg Val Val Ser Ala 
305 310 315 320 

Thr Asn Ala Ser Leu Glu Arg Met Val Ala Glu Gly Arg Phe Arg Glu 

325 330 335 

Asp Leu Tyr Tyr Arg He Asn Leu He His Leu His Leu Pro Pro Leu 

340 345 350 

Arg Glu Arg Gin Glu Asp He Gin Leu Leu Val Glu Ala Phe Ser Glu 

355 360 365 

Ala Phe Ala Gin Ser Asn Gly Leu Pro His Ala Val Trp Ser Ala Glu 

3'70 375 380 

Ala Met Arg Arg He Cys Ala Met Pro Leu Pro Gly Asn Val Arg Glu 
385 390 395 400 

Leu Lys Asn Val Val Glu Arg Thr Leu Leu Leu Ser Gly Ser Arg Glu 

405 410 415 

He Ser Ala Arg Asp Val Ala Asp Phe Gly Ser Gin Val Thr Ala Ala 

^120 425 430 

Asp His Ser Asp Glu Arg Ala Leu Thr Asp Met Glu Glu Ala Ala He 
4 35 4 4 0 4 41 



30 



35 



40 



45 



50 



60 



65 



70 



75 



Arg Glu Thr Leu Thr Lys Tyr Asn Gly Asn Val Ser Arg Ala Ala Arg 

^SO 455 460 

Ala Leu Gly Leu Ser Arg Ala Ala Leu Tyr Arg Arg Met Glu Lys Tyr 
^65 470 475 480 

Gly Leu 



55 (2) IlIFORMATION FOR SEQ ID NO: 285 



(i) SEQUEIICE CHARACTERISTICS: 

(A) LEIIGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIJAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAI-IE/KEY: misc_feature 

(B) LOCATION 1. . .263 

(xi) SEQUENCE DESCRIPTION: SEQ ID 110:285 
Arg Ser Leu Gin Ser Phe Gin Asn Lys Lys His Ser Ser Met Leu Lys 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



203/490 

1 5 10 15 

He Lys Asn Leu His Ala Thr Val Gin Gly Lys Glu He Leu Lys Gly 

20 25 30 

He Asn Leu Glu He Asn Ala Gly Glu He His Ala He Met Gly Pro 
5 35 40 45 

Asn Gly Ser Gly Lys Ser Thr Leu Ser Ser Val Leu Val Gly His Pro 

50 55 60 

Ser Phe Glu Val Thr Glu Gly Glu Val Thr Phe Asn Gly He Asp Leu 
65 "70 75 80 

10 Leu Glu Leu Glu Pro Glu Glu Arg Ala His Leu Gly Leu Phe Leu Ser 

85 90 95 

Phe Gin Tyr Pro Val Glu He Pro Gly Val Ser Met Val Asn Phe Met 

100 105 110 

Arg Ala Ala Val Asn Glu His Arg Lys Ala He Gly Ala Glu Pro Val 
15 115 120 125 

Ser Ala Ser Asp Phe Leu Lys Met Met Arg Glu Lys Arg Ala He Val 

130 135 " 140 

Glu Leu Asp Asn Lys Leu Ala Ser Arg Ser Val Asn Glu Gly Phe Ser 
145 150 155 160 

20 Gly Gly Glu Lys Lys Arg Asn Glu He Fhe Gin Met Ala Met Leu Glu 

165 170 175 

Pro Lys Leu Ala He Leu Asp Glu Thr Asp Ser Gly Leu Asp He Asp 

180 185 190 

Ala Leu Arg He Val Ala Gly Gly Val Asn Arg Leu Arg Ser Pro Glu 
25 195 200 205 

Asn Ala Ala He Val He Thr His Tyr Gin Arg Leu Leu Glu Tyr lie 

210 215 220 

Lys Pro Asp Phe Val His Val Leu Tyr Lys Gly Arg He Val Lys Ser 
225 230 235 240 

30 Gly Gly Ala Glu Leu Ala Leu Thr Leu Glu Glu Lys Gly Tyr Asp Trp 

245 250 255 

He Lys Glu Glu He Gly Glu 
260 

35 (2) IMFORMATIOH FOR SEQ ID NO: 286 

(i) SEQUEIJCE CHARACTERISTICS: 

{A) LENGTH: 4 62 amino acids 
(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Hii) HYPOTHETICAL: YES 



45 



55 



60 



65 



70 



75 



vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona s gingival is 



(ix) FEATURE: 
50 (A) HAIIE/KEY: misc_feature 

(B) LOCATION 1 ... 462 



(xi) SEQUENCE DESCRIPTION: SEQ ID I>]0:2 86 



Ser 


Met 


Ala 


Lys 


Glu 


Lys 


Thr 


He 


Tyr 


Val 


Cys 


Arg 


Ser 


Cys 


Gly 


Thr 


1 








5 










10 










15 




Lys 


Tyr 


Ala 


Lys 


Trp 


Gin 


Gly 


Asn 


Cys 


Asn 


Ala 


Cys 


Gly 


Glu 


Trp 


Asn 








20 










25 










30 






Cys 


He 


Asp 


Glu 


Glu 


Lys 


Val 


Pro 


Ala 


Pro 


Ala 


Ser 


Gly 


Lys 


His 


Ala 






35 










40 










45 








Ala 


Lys 


Ser 


Phe 


Met 


Pro 


Arg 


Glu 


Gin 


Asp 


Asn 


Arg 


Pro 


Arg 


Leu 


Leu 




50 










55 










60 










Gin 


Asp 


Val 


Glu 


Ser 


Gly 


Asp 


Glu 


Glu 


Arg 


He 


Arg 


Leu 


Gly 


Asp 


Glu 


65 










70 










75 










80 


Glu 


Phe 


Asp 


Arg 


Val 


Leu 


Gly 


Gly 


Gly 


He 


Val 


Lys 


Gly 


Ala 


Phe 


Val 










85 










90 










95 




Leu 


Leu 


Gly 


Gly 


Glu 


Pro 


Gly 


He 


Gly 


Lys 


Ser 


Thr 


Leu 


He 


Leu 


Gin 








100 










105 










HO 






Thr 


Val 


Leu 


Arg 


Leu 


Pro 


Gin 


Leu 


Arg 


Thr 


Leu 


Tyr 


Val 


Ser 


Gly 


Glu 






115 










120 










125 








Glu 


Ser 


Ala 


Arg 


Gin 


Leu 


Lys 


Met 


Arg 


Ala 


Glu 


Arg 


Leu 


Gly 


Gin 


Ala 




130 










135 










140 










Met 


Asn 


Gly 


Cys 


Tyr 


Val 


Tyr 


Cys 


Glu 


Thr 


Asn 


He 


Glu 


Arg 


He 


Leu 


145 










150 










155 










160 


Ser 


Arg 


Ala 


Glu 


Glu 


Leu 


Thr 


Pro 


Asp 


Leu 


Leu 


Val 


He 


Asp 


Ser 


He 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 PCT/AU98/01023 

204/ 490 









1 65 










170 










175 






Gin 


Thr 


V;^l T\T Thr 

V a X J-^-J. Ill J. 


Glu 


Glu 


Met 


Glu 


Ser 


Ser 


Ala 


Gl V 


Ser 


Val 


Gl 








180 








185 










190 








Gin 


I1& 


Arg Glu Cys 


Ala 


Al a 


Leu 


Leu 


Leu 


- 








Thr 


Thr 


5 






1 95 






200 










205 










Gl V 


lie 


Pro Val lie 


Val 


He 


Gly 


Hi^ 


He 


Thr 


Lys 


Glu 


Gl V 


Ser 


He 






210 






215 










220 












Ala 


Gl V 


Pro Lys Val 


Leu 


Glu 


His 


He 


Val 


Asp 


Thr 


Val 


Leu 


Leu 


Phe 


10 


225 






230 










235 










24 0 


ir-' 


Gl V 


Ac-o T.-'*5 Hi«; 


His 


Leu 


Tvr 


Arg 


He 


Leu 


Arg 


Gl V 


Gin 


Lys 


Asn 








24 5 










250 










255 






A 

rg 


1 yr 


ijiiy iiix 




Gl u 


L'='U 


ni \j- 
iji y 


lis 


T\,-r 

■I y ^ 


Gl u 


riet 


Arg 


Gin 


A^ 
sp 








260 








2 65 










27 0 








Gly 


L^u 


A\ i_ y w J. y V d X 


Glu 


Asn 


Pro 


Ser 


Glu 


His 


Leu 


He 


Thr 


Ar o 


As n 


15 






27 5 






2 80 










285 










rg 


Glu 


ZlcT~i Taii Qcir' 


Gl \^ 


He 


Ala 


He 


AL a 


Val 


Ala 


Met 


Glu 


Gl -^r 
'o J. J- 


He 






2^0 




























Arg 


Pro 


He Leu He 


Glu 


Al a 




Al a 


Leu 


Va 1 


Ser 


Ser 


Ala 


He 




20 


305 






310 










31 5 










320 


Ala 


As n 


Pro Gin Arg 


Ser 


Ala 


Thr 


Gly 


Phe 


Asp 


He 


Ara 


Arg 


Met 


Asn 








325 










330 










335 






Met 


Leu 


Leu Ala Val 


Leu 


Glu 




Arg 


Ala 


Gl V 


Phe 


Lvs 


L=u 


He 


Gl n 








3-5 0 








3 4 5 










350 






25 




Asp 


vdj. irii^ 


As n 


11" 


Al a 


Gl -i/ 


G 1 - - 


He 


L* *s 


He 


Ala 


rti P 


Pro 






355 






360 










365 










Ala 


Thr 


Asp Leu Ala 


Val 


He 


Ser 


Ala 


Val 


Leu 


Ala 


Ser 


Ser 


Leu 


Asp 






37 0 






37 5 










3 8 <j 












lie 


Val 


He Pro Pro 


Al a 


Val 


Cys 


Met 


Thr 


Gl v 
ox y 




Val 


Gl 


Leu 


Ser 


30 


385 






390 










395 










4 00 


Gl V 


Glu 


lie Arg Pro 


Val 


Ser 


Arg 


He 


Glu 


Gin 


Arg 


He 


Thr 


Glu 


Ala 








405 










410 










415 






Arg 


Arg 


He Gly Phe 


Lys 


Glu 


He 


Leu 


Val 


Pro 


Al a 


Asp 


Asn 


Phe 


Arg 








420 








425 










4 30 






35 


Glrv 


Glu 


Asp Ala Gly 


Arg 


Phe 


Gly 


He 


Arg 


Leu 


Val 


Pro 


Val 


Arg 


Lys 






435 






440 










445 










Val 


Glu 


Glu Ala Phe 


Arg 


His 


Leu 


Phe 


Ser 


Lys 


Gly 


Arg 


Glu 










450 






455 










4 60 












(2) 


IigFORl lATIOlI FOR 


SEQ 


ID HO: 287 
















40 
































(i) 


SEQUEHCE CHARACTERISTICS: 






















(A) LEIJGTH: 373 amino 


acids 




















(B) TYPE: 


amino acid 
























(D) TOPOLOGY: 


linear 


















45 


































(ii) 


MOLECULE TYPE: 


protein 






















(iii> 


Hi POT H ET I C AL : YES 




















50 




(vi) 


ORIGINAL SOURCE: 


























(A) ORGAMISM: 


Porphyromonas gingivalis 












(ix) 


FEATURE : 
























55 






(A) NAME/ KEY: 


misc feature 


















(B) LOCATION H . . 37 3 






















(xi) 


SEQUENCE DESCRIPTION: SEQ ID MO: 287 










60 


Gly 


Ser 


Cys Gly Ser 


Ser 


Pro 


Ala 


He 


Gly 


Arg 


Ser 


Gly 




Ser 


Arg 


1 




5 










10 










15 






Cys 


Asp 


Arg Arg He 


Gly 


His 


Gly 


Leu 


Ser 


Thr 


Ser 


Ala 


Phe 


Arg 


Thr 








20 








25 










30 








Tyr 


Ala 


Gly Leu Val 


Cys 


His 


Ser 


Arg 


Leu 


Arg 


Asp 


Gin 


Ser 


Ser 


Cys 


65 






35 






40 










45 








Arg 


Val 
50 


Cys Gly Met 


Arg 


Pro 
55 


He 


Gly 


Gin 


Leu 


Phe 

60 


Phe 


Pro 


Ser 


Cys 




Pro 


Arg 


Arg Ser Ala 


Ser 


Asp 


Ala 


Gly 


Arg 


Thr 


Glu 


Arg 


He 


Gin 


Ser 




65 






70 










75 










80 




Pro 


Leu 


Gin Asn Leu 


Leu 


Gin 


Glu 


Met 


Gin 


Phe 


Ser 


Pro 


Thr 


Asn 


Arg 


70 






85 










90 










95 




Arg 


Asn 


Pro Phe Asp 


Asn 


Met 


Asn 


Ser 


Arg 


His 


Leu 


Thr 


He 


Thr 


He 








100 








105 










110 








lie 


Ala 


Gly Leu Ser 


Leu 


Phe 


Val 


Leu 


Thr 


Leu 


Gly 


Gly 


n- • = 
"J — 


Ser 


Val 


75 






115 






120 










125 








Ala 


Gin 


Gin Asp Thr 


Gin 


Trp 


Thr 


Leu 


Gly 


Gly 


Lys 


Leu 


Phe 


Thr 


Ser 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



205/490 



130 135 140 

Ala Trp lie Gin Arg Ser Ala Glu Tyr Gin Ala Leu Cys He Gin Ala 
145 150 155 160 

Tyr Asn IJe Ala Thr Glu Arg Val Asp Ala Leu Pro Ala Glu Arg Lys 
5 165 170 175 

Gin Gly Asp Arg Pro Tyr Ala He Val Thr A5p He Asp Glu Thr He 

180 185 190 

Leu Asp Asn Thr Pro Asn Ser Val Tyr Gin Ala Leu Arg Gly Lys Asp 
195 200 205 

10 Tyr Asp Glu Glu Thr Trp Gly Lys Trp Cys Ala Gin Ala Asp Ala Asp 
210 215 220 

Thr Leu Ala Gly Ala Leu Ser Phe Phe Leu His Ala Ala Asn Lys Gly 
225 230 235 240 

He Glu Val Phe Tyr Val Thr Asn Arg Arg Asp Asn Leu Arg Glu Ala 
15 245 250 255 

Thr Leu Gin Asn Leu Gin Arg Tyr Gly Phe Pro Phe Ala Asp Glu Glu 

2G0 265 270 

His Leu Leu Thr Thr His Gly Pro Ser Asp Lys Glu Pro Arg Arg Leu 
275 280 285 

20 Lys He Gin Glu Gin Tyr Glu He Val Leu Leu He Gly Asp Asn Leu 
290 295 300 

Gly Asp Phe His His Phe Phe Asn Thr Lys Glu Glu Ser Gly Arg Lys 
305 310 315 320 

Gin Ala Leu Gly Leu Thr Ala Gly Glu Phe Gly Arg His Phe He Met 
25 325 330 335 

Leu Fro Asn Pro Asn Tyr Gly Ser Trp Glu Pro Ala Trp T^^'r Gly Gly 

340 345 350 

Lys Tyr Pro Pro Leu Pro Glu Arg Asp Lys Ala Leu Lys Gin Leu His 
355 360 365 

30 Ser Gin Asn Ser Arg 
370 

(2} IMFORMATIOli FOR SEQ ID MO: 288 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

{ vi ) ORIGII JAL SOURCE : 
45 (A) ORGAinSM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) rJAI IE/KEY: misc_feature 

(B) LOCATIOIi 1 . . .4 26 



50 



(::i) SEQUEtlCE DESCRI PTIOI ) : SEQ ID HO: 288 



Arg Trp Gly Phe Ser Asn Phe Val Ala Met Ser Thr Asn He Asp Val 
15 10 15 

55 Gin Gin He Lys Gin Arg Phe Gly He He Gly Ser Ser Pro Leu Met 
20 25 30 

Glu His Ala He Arg Val Ala Ala Gin Val Ala Pro Thr Asp Met Ser 

35 40 45 

Val Leu Val Thr Gly Glu Ser Gly Ser Gly Lys Glu Phe Phe Pro Gin 
60 50 55 . 60 

He He His Tyr Tyr Ser Ala Arg Lys His His Ser Tyr He Ala Val 
65 7 0 7 5 8 0 

Asn Cys Gly Ala He Pro Glu Gly Thr He Asp Ser Glu Leu Phe Gly 
8 5 90 95 

65 His Arg Lys Gly Ser Phe Thr Gly Ala Val Ser Asp Arg Lys Gly Tyr 
100 105 ' HO 

Phe Glu Glu Ala Ser Gly Gly Thr He Phe Leu Asp Glu Val Gly Glu 

115 120 125 

Leu Pro Leu Pro Thr Gin Ala Arg Leu Leu Arg Val Leu Glu Thr Gly 
70 130 135 ' 140 

Glu Phe He Pro Val Gly Ala Ser Gin Ser Gin Lys Thr Asp Val Arg 
145 150 155 160 

He Val Ala Ala Thr Asn Val Asn Leu Lys Glu Ala Val Ala Asn G.ly 
165 170 175 

75 Lys Phe Arg Glu Asp Leu Phe Phe Arg Leu Asn Thr Val Pro He Glu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 PCT/AU98/01023 

206/ 490 



40 



45 



50 



55 



60 



65 



70 



75 









180 










185 










190 






Val 


Pro 


A] a 


Leu 


Arg 


Met 


Arg 


Pro 


Asp 


Asp 


Val 


Pro 


Leu 


Leu 


Phe 


Arg 






195 










200 










205 






Arg 


Phe 


Al a 


Ala 


Asp 


Ser 


Ala 


Glu 


Lys 


Tyr 


Arg 


Met 


Pro 


Pro 


Leu 


Arg 




210 










215 










220 








Leu 


Ser 


Asp 


Glu 


Ala 


Arg 


Thr 


He 


Leu 


Met 


Arg 


Tyr 


Arg 


Trp 


Pro 


Gly 


225 










230 










235 










240 


Asn 


Val 


Arg 


Glu 


Leu 


Arg 


Asn 


He 


Thr 


Asp 


Arg 


Leu 


Ser 


He 


Leu 


Glu 










2^35 










250 








255 




Glu 


Glu 


Arg 


Thr 


Val 


Ser 


Ala 


Glu 


Thr 


He 


Thr 


Arg 


Tyr 


Leu 


Asp 


Ala 








260 










265 








270 






Glu 


Gly 


Met 
275 


Gin 


Asp 


Leu 


His 


Pro 
280 


Val 


Val 


He 


Arg 


Arg 
285 


Asn 


Glu 


Thr 


Thr 


Glu 
290 


Ala 


Asp 


Ly^; 


Gin 


He 
295 


Pro 


His 


Tyr 


Glu 


Arg 
300 


Glu 


He 


He 


Tyr 


Gin 


Val 


Leu 


Tyr 


Asp 


Met 


Lys 


Lys 


Glu 


He 


Ala 


Asp 


Leu 


Lys 


Gly 


Met 


305 










310 










315 






320 


net 


Asn 


Arg 


Leu 


Ala 

325 


His 


His 


Glu 


Gin 


Pro 
330 


Ser 


Trp 


Pro 


Val 


Gly 
335 


Ser 


A.^p 


Val 


Trp 


Gly 


Asn 


Asp 


Asp 


Lys 


Arg 


Thr 


Ala 


Asp 


Pro 


Lys 


Trp 


Gly 








340 










345 










350 




Val 


Ser 


Thr 

355 


His 


Lys 


Ala 


Pro 


He 
360 


Ala 


Asn 


Ala 


Ala 


Glu 

365 


Pro 


Val 


Glu 


Pro 


He 
37 0 


Gin 


Glu 


Ala 


Ser 


Glu 
375 


Tyr 


Thr 


Glu 


Asp 


Pro 
380 


Val 


Ser 


Leu 


Glu 


Glu 


Val 


Glu 


Lys 


Lys 


Met 


He 


Ser 


Leu 


Ala 


Leu 


Glu 


Arg 


His 


Gly 


Gly 


385 










390 










395 








400 


Arg 


Arg 


Lys 


Gin 


Thr 


Ala 


Glu 


Glu 


Leu 


Lys 


He 


Ser 


Glu 


Arg 


Thr 


Leu 










4 05 










410 








415 




Tyr 


Arg 


Lys 


He 
420 


Lys 


Glu 


Tyr 


Gly 


Leu 
425 


Glu 















10 



15 



20 



25 



30 



( 2 ) I M FORMAT I ON FOR SEQ ID NO : 2 8 9 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 653 amino acids 

(B) TYPE: amino acid 
{D} TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) 1IAI4E/KEY: misc_feature 

(B) LOCATION 1 . . . 653 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289 



Lys 


Asn 


Leu 


Pro 


Arg 


Tyr 


Ala 


Pro 


Ala 


Phe 


Gly 


Arg 


Asn 


Arg 


Lys 


Arg 


1 








5 










10 










15 


Glu 


Gin 


Pro 


Asp 


Pro 


Asp 


Asn 


Asp 


Asp 


Ser 


Arg 


Gly 


Arg 


Ser 


His 


Arg 


Gin 






20 










25 










30 




Ser 


GJy 


Arg 


Ala 


Val 


Thr 


Val 


Val 


Cys 


Arg 


Arg 


He 


Gin 


Thr 


Pro 






35 










40 








45 








Leu 


Pro 


His 


Met 


He 


Leu 


Leu 


Phe 


Gly Gly 


Thr 


Thr 


Glu 


Gly Arg 


Ala 


Ala 


50 










55 










60 










Ala 


Arg 


Val 


Leu 


Asp 


Glu 


Ala 


Gly 


Ser 


Pro 


Phe 


Phe 


Tyr 


Ser 


Thr 


65 










7 0 








75 








80 


Lys 


Gly 


Asn 


Leu 


Gin 


Glu 


He 


Gin 


Ser 


Ser 


His 


Gly 


His 


Arg 


Leu 


Thr 


Gly 








85 










90 








95 




Ala 


Met 


Thr 


Val 


Ala 


Asp 


Met 


Val 


Ser 


Phe 


Cys 


Arg 


Lys 


Glu 


Glu 


He 






100 










105 








110 






Arg 


Leu 


He 


Val 


Asp 


Ala 


Ala 


His 


Pro 


Phe 


Ala 


Glu 


Glu 


Leu 


His 


Ala 




115 










120 










125 








Ser 


Val 


Ala 


Glu 


Ala 


Ser 


Glu 


Gin 


Thr 


Gly 


He 


Pro 


Val 


Val 


Arg 




130 










135 








140 








Tyr 


Glu 


Arg 


Gin 


Tyr 


Pro 


Pro 


Arg 


Glu 


Glu 


Gly 


He 


Val 


Trp 


Cys 


Ala 


145 










150 










155 






160 


Asn 


Tyr 


Asp 


Thr 


Ala 


Ala 


Glu 


Arg 


Met 


Leu 


Gly 


Asp 


Gly Val 


Gin 


Arg 










165 










170 










175 


Leu 


Leu 


Met 


Leu 


Thr 


Gly 


Val 


Asn 


Thr 


He 


Pro 


Lys 


Leu 


Ala 


Ala 


Phe 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I 



wo 99/29870 



PCT/AU98/01023 



207/490 



180 185 190 

Trp Lys Glu Arg Thr Thr Phe Cys Arg lie Leu Lys Arg A^p Glu Ser 

195 200 205 

Val Ala Leu Ala Glu Lys Asn Gly Phe Pro Ala Glu Arg lis Val Phe 
5 210 215 220 

Phe Glu Pro His Ala Asp Glu Glu Leu Met Gin Ala Val Arg Pro Asp 
225 230 235 240 

Ala lie lie Thr Lys Glu Ser Gly Glu Ser Gly Tyr Phe Arg Glu Lys 
245 250 255 

10 He Glu Ala Ala Arg Arg Met Gly He Arg He Tyr Ala Val Val Arg 
260 265 2*70 

Pro Pro Leu Pro Pro Ser Phe He Pro Val Gly Gly Pro Val Gly Leu 

275 280 285 

Arg Arg Ala Val Glu Arg Leu Val Pro Gly Phe Phe Ser Leu Arg Ser 
15 290 295 300 

Gly Phe Thr Thr Gly Thr Thr Ala Thr Ala Ala Val Val Ala Ala Met 
305 310 315 320 

Tyr Arg Leu Met Gly Leu Gly Ser Leu Ala Glu Ala Pro Val Glu Leu 
325 330 335 

20 Pro Ser Gly Glu He Val Ser Leu Pro He Ala Glu He Arg Glu Glu 
340 345 350 

Glu Asp Ala Val Val Ser Ala Val Leu Lys Asp Ala Gly Asp Asp Pro 

355 360 365 

Asp Val Thr Asn Gly Met Ala Val Cys Ala Thr He Arg Leu Asn Pro 
25 370 375 380 

Glu His Glu Glu Val Arg Phe Leu Gin Gly Glu Gly Val Gly Val Val 
385 390 395 400 

Thr Leu Pro Gly Leu Gly Leu Glu Val Gly Gly Pro Ala He Asn Leu 
405 410 415 

30 Val Pro Arg Arg Met Met Thr Ala Glu Val Arg Arg Leu Tyr Ala Gin 
420 425 430 

Gly Gly Val Asp He Thr He Ser Val Pro Glu Gly Arg Glu Ala Ala 

435 440 445 

Thr Gin Thr Phe Asn Pro Arg Leu Gly He Arg Asp Gly He Ser He 
35 450 455 460 

He Gly Thr Ser Gly Val Val Lys Pro Phe Ser Ala Glu Ala Phe Val 
465 470 475 480 

Gly Ala He Arg Lys Gin Val Gly He Ala Thr Ala Leu Gly Ala Asn 
485 490 495 

40 His He Val Leu Asn Ser Gly Ala Lys Ser Glu Arg Tyr Val Lys Gly 
500 505 510 

Ala Tyr Pro Ala Leu He Pro Gin Ala Phe Val Gin Tyr Gly Asn Phe 

515 520 525 

Val Gly Glu Ser Leu Ser Cys Val Ala Ser Phe Pro Ser Val Arg Ser 
45 530 535 54 0 

Val Thr Val Gly He Met Leu Gly Lys Ala Val Lys Leu Ala Glu Gly 
545 550 555 560 

Tyr l^eu Asp Thr His Ser Lys Lys Val Val Met Asn Arg Asp Phe Leu 
565 570 575 

50 His Glu Leu Ala Arg Gin Ala Gly Cys Ser Glu Asp He His Ala He 
580 585 590 

He Asp Ser Leu Asn Leu Ala Arg Glu Leu Trp Thr Met Pre Ser Ala 

595 600 605 

Glu Asp Ser Asp Arg Leu Leu Arg Lys He Ala Glu Arg Ser Trp Glu 
55 610 615 620 

Thr Cys Arg Pro Ser Val Pro Ser Ala Glu Leu Glu Leu Leu Leu He 
625 630 635 640 

Asp Glu Ser Gly Ala He Arg Phe Arg He Gly Gly Glu 
645 650 



60 



(2) IIJFORJ'IATIOIJ FOR SEQ ID MO: 2 90 



ti) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 4 51 amino acid^ 
65 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

70 (Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingival is 

75 (iM) FEATURE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929e70A1_U> 



wo 99/29870 



PCT/AU98/01023 



208/490 



(A) MAI-IE/ KEY: mi £'C_f eat ur e 

(B) LOCATIOU 1 ... 451 

(xi) SEQUEIICE DESCRIPTION: SEQ ID 110:2 90 



Gin 


Gin 


Arg 


Pro 


His 


Arg 


He 


He 


Met 


Leu 


Arg 


Thr 


Phe 


Arg 


He 


Gly 


1 








5 










10 










1 5 




Gly 


He 


Hi5 


Pro 


Pro 


Glu 


Asn 


Lys 


Leu 


Ser 


Ala 


Gly 


Lys 


Pro 


Val 


Glu 








20 










25 










30 






Val 


Leu 


Fro 


He 


Pro 


Ser 


Gin 


Val 


Val 


He 


Pro 


Leu 


Gly 


Gin 


His 


He 






35 










4 0 










4 5 








Gly 


Ala 


Pro 


Al a 


Thr 


Al a 


Thr 


Val 


Lys 


Lys 


Gly 


Asp 


Glu 


Vdl 


Lys 


Val 




50 










55 










60 










Gly 


Thr 


He 


He 


Ala 


Gin 


Ala 


Gly 


Gly 


Phe 


Val 


Ser 


Al a 


Asn 


He 


His 


65 










7 0 










7 5 










8 0 


Ser 


Ser 


Val 


Ser 


Gly 


Lys 


Va 1 


Leu 


Lys 


He 


Asp 


Asn 


Val 


Tyr 


Asp 


Ser 










85 










90 










95 




Ser 


Gly 


Tyr 


Pro 


Lys 


Pro 


Ala 


Val 


Phe 


He 


Ser 


Val 


Glu 


Gly 


Asp 


Glu 








100 










105 










110 






Trp 


Glu 


Glu 


Gly 


He 


Asp 


Arg 


Ser 


Pro 


Al a 


He 


Val 


Ly^ 


Gl u 


Cys 


Asn 






115 










120 










12 5 








Leu 


Asp 


Ala 


Ly^ 


Glu 


He 


Val 


Ala 


Lys 


He 


Ser 


Ala 


Ala 


Gly 


He 


Val 




130 










135 










140 










Gly 


Leu 




Gly 


Ala 


Thr 


Phe 


Pro 


Thr 


Mis 


Val 


Ly^ 


Leu 


Ser 


Pro 


Pro 












150 










155 










1 60 


Pro 


Gly 


Asn 


Ly^ 


Ala 


Glu 


He 


Leu 


He 


He 


Asn 


Ala 


Val 


Glu 


Cys 


Glu 










165 










170 










175 




Pro 


Tyr 


Leu 


Thr 


Ser 


Asp 


His 


Val 


Leu 


Met 


Leu 


Glu 


His 


Gly 


Glu 


Glu 








180 










185 










190 






He 


Met 


He 


Gly 


Val 


Ser 


He 


Leu 


Met 


Lys 


Ala 


He 


Gin 


Val 


Asn 


Lys 






195 










200 










205 








Ala 


Val 


He 


Gly 


Val 


Glu 


Asn 


Asn 


Lys 


Lys 


Asp 


Ala 


He 


Ala 


His 


Leu 




210 










215 










220 










Thr 


Lys 


Leu 


Ala 


Thr 


Ala 


Tyr 


Pro 


Gly 


He 


Glu 


Val 


Met 


Pro 


Leu 


Lys 


225 










230 










235 










24 0 


Val 


Gin 


Tyr 


Pro 


Gin 


Gly 


Gly 


Glu 


Lys 


Gin 


Leu 


He 


Asp 


Ala 


Val 


He 










245 










250 










255 




Arg 


hys 


Gin 


Val 


Lys 


Ser 


Gly 


Ala 


Leu 


Pro 


He 


Ser 


Thr 


Gly 


Ala 


Val 








260 










265 










270 






Val 


Gin 


Asn 


Val 


Gly 


Thr 


Val 


Phe 


Ala 


Val 


Tyr 


Gl u 


Ala 


Val 


Gin 


Lys 






27 5 










280 










285 








Asn 


Lys 


Pro 


Leu 


Val 


Glu 


Arg 


He 


Val 


Thr 


Val 


Thr 


Gly 


Lys 


Lys 


Leu 




290 










295 










300 










Ser 


Arg 


Pro 


Ser 


Asn 


Leu 


Leu 


Val 


Arg 


He 


Gly 


Thr 


Pro 


He 


Al a 


Ala 


305 










310 










315 












Leu 


He 


Gi u 


Ala 


Ala 


Gly 


Gly 


Leu 


Pro 


Glu 


Asn 


Thr 


Gly 


Lys 


He 


He 










325 










330 










335 




Gly 


Gly 


Gly 


Pro 


Met 


Met 


Gly 


Arg 


Ala 


Leu 


Leu 


Ser 


Pro 


Asp 


Val 


Pro 








340 










345 










350 






Val 


Thr 


Lys 


Gly 


Ser 


Ser 


Gly 


Val 


Leu 


He 


Leu 


Asp 


Arg 


Glu 


Glu 


Ala 






355 










360 










365 








Val 


Arg 


Lys 


Pro 


Met 


Arg 


Asp 


Cys 


He 


Arg 


Cys 


Ala 


Lys 


Cys 


Val 


Gly 




370 










375 










380 










Val 


Cys 


Pro 


Met 


Gly 


Leu 


Asn 


Pro 


Ala 


Phe 


Leu 


Met 


Arg 


Asp 


Thr 


Leu 


385 










390 










395 










400 


Tyr 


Lys 


Ser 


Trp 


Glu 


Thr 


Ala 


Glu 


Lys 


Gly 


Asn 


Val 


Val 


Asp 


Cys 


He 










405 










410 










415 




Glu 


Cys 


Gly 


Ser 


Cys 


Ser 


Phe 


Thr 


Cys 


Pro 


Ala 


Asn 


Arg 


Pro 


Leu 


Leu 








420 










425 










4 30 






Asp 


Tyr 


He 


Arg 


Gin 


Ala 


Lys 


Lys 


Thr 


Val 


Met 


Gly 


He 


Gin 


Arg 


Ala 






435 










440 










445 








Arg 


Lys 


Gin 






























450 































10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 (2) IN FORMATION FOR SEQ ID 110:2 91 

(i) SEQUENCE CHARACTERISTICS: 

(Ai LEHGTH: 4B9 amino acids 
(B) TYPE: amino acid 
70 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 PCT/AU98/01023 

209/ 490 

(vi) ORIGINAL SOURCE: 

(A) ORGANISII: Porphyromonas gingival is 

(iy.) FEATURE: 
5 (A) MAIIE/KEV: mi £:c_f eature 

(B) LOCATIOri 1 ... ^89 

(Mi) SEQUEMCE DESCRIPTIOH: SEQ ID 110:2 91 

10 Met Asn Ser Gin Lys Lys Glu Ala Phe Asn Met Lys Arg lie Gin Leu 
1 5 10 15 

Thr Leu lie Ala Leu Phe Ala Ala Val Ala Gly Leu Val Ala Gin Asn 

20 25 30 

Ala Tyr Glu Gly Val lie Ser Tyr Lys He Ser Leu Asp Lys Thr Gly 
15 35 40 45 

Asn Lys Val Val Leu Asn Gly Ala Ala Asp Met Ser Asn Leu Lys Leu 

50 55 60 

Lys Ser Thr Gin Met He He Val Thr Pro He Leu Arg Ser Glu Asp 
65 10 75 80 

20 Gly Thr Ser Arg Val Glu Phe Pro Ser Val Val He Thr Gly Arg Asn 
85 90 55 

Arg Thr Lys Ala Leu Lys Arg Glu He Ala Phe Ser Ser Ala Leu Pro 

100 105 110 

Gin Ala Lys His Ala Ala Gin Tyr He Arg Arg His Asn Gly Lys Ser 
25 115 120 125 

Glu Gin Phe Ala Phe Thr Gly Glu His Ala Tyr Ala Ser Trp I'let Met 

130 135 140 

Asp Ala Lys Plie Val Val Arg Glu Glu Val Arg Gly Cys Ala Lys Cys 
1^5 150 155 160 

30 Pro Val Gly Leu Ser Ser Asn He Val Pro Phe Asp Pro Leu Phe Asn 
165 170 175 

Pro Ala Glu Ala Pro Tyr Leu Leu Ala His He Thr Pro Ala Glu Glu 

180 185 190 

Val Glu Lys Gin Arg Glu Ser Ser Phe Asp Ala Tyr He Asn Phe Lys 
35 195 200 205 

Val Asn Lys Ala Asp Val Leu Pro Glu Tyr Arg Asn Asn Lys Ala Glu 

210 215 220 

Leu Glu Lys He Lys Glu Phe Val Ser Thr Val Lys Ala Asn Pro Asn 
225 230 235 240 

40 Tyr Ser Val Asn Lys Met He He Glu Gly Phe Ala Ser Pro Glu Ala 
245 250 255 

Ser He Ala His Asn Lys Ala Leu Ser Glu Arg Arg Ala Lys Arg Leu 

260 265 270 

Ala Glu Glu Leu Val Arg Lys Tyr Gly Lys Thr Leu Pro Asn He Thr 
45 275 280 285 

Thr Glu Phe Gly Gly Glu Asp Trp Lys Gly Leu Lys Leu Ala He Glu 

2 90 2 95 300 

Lys Ser Asp He Ala Asp Arg Asp Arg Val Leu Glu He He Asn Ser 
305 310 " 315 320 

50 Asp Lys Tyr Ala Asp Asp Asp Ala Arg Glu Gin Ala Leu Lys Gin Leu 
325 330 335 

Ser Ser Tyr Arg Tyr He Leu Asp Gin He Tyr Pro Asn Leu Arg Arg 

340 345 350 

Asn Thr He Thr Met Gly Tyr He Val Arg Asp Tyr Thr Leu Glu Glu 
55 355 360 365 

Ala Arg Glu He He Lys Thr Ala Pro Lys Glu Leu Ser Glu Ala Glu 

370 375 380 

Met Tyr Arg Val Ala Met Ser Tyr Pro Glu Gly His Gin Glu Arg Leu 
385 390 395 400 

60 Phe Ala Leu Asn Thr Thr Leu Lys Tyr Phe Pro Glu Ser Val Thr Gly 
405 410 415 

Arg lie Asn Leu Ala Val Ala Ala Phe Asn Gly Gly Asp Val Gin Gin 

420 425 430 

Ala He Ala Leu Leu Ser Pro He Gin Thr Glu Lys Gly Val Ser Asn 
65 435 440 445 

He Leu Gly Ala Ala Tyr Ala Arg Thr Gly Asp Phe Ala Arg Ala Glu 

450 455 460 

Thr Phe Phe Arg Lys Ala Val Ala Glu Gly Asp Ala Asn Ala Gin Arg 
465 470 475 480 

70 A.sn Leu Asp Met Leu Leu Gly Lys Lys 
485 

(2) IIIFORMATION FOR SEQ ID KO:2 92 
75 (i) SEQUEIICE CHARACTERISTICS: 



BNSDOCID: <WO 9929870A1J_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



210/ 490 



(A) LEMGTH: 384 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
10 (A) ORGAIIISM: Porphyromona s gingivali£ 

(ix) FEATURE: 

(A) NATvIE/KEY: misc_feature 

(B) LOCATION 1 . . . 384 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



(:ci) SEQUEIICE DESCRIPTIOll : SEQ ID MO: 292 



Asp 


Met 


Al a 


Glu 


Lys 


Arg 


Asp 


Tyr 


Tyr 


Glu 


Val 


Leu 


Gly 


Vai 


Ser 


Lys 


1 








5 










1 0 










1 5 




Ae'n 


Al a 


Thr 


Asp 


Asp 


Glu 


Leu 


Lys 


Lys 


Al a 


Tyr 


Arg 


Lys 


Lys 


Al a 


He 


















2 5 










30 






Gin 




His 


Pro 


Asp 


Lys 


Asn 


Pro 


Gly 


Asp 


Lys 


Glu 


Al a 


Gl u 


Glu 


His 






3 5 










4 0 










4 5 








Pne 


Lys 


Glu 


Val 


Ala 


Glu 


Al a 


Tyr 


Asp 


Va 1 


Leu 


Ser 


Asp 


Pro 


Gin 


Lys 




C A 




















60 










Ar<g 


Ser 


Gin 


Tyr 


Asp 


*_3 1 Tl 


Phe 


Gly 


His 


Al a 


Gly 


Leu 


Gly 


Gly 


Al a 


Al a 


c; -J 










7 n 










7 5 










8 0 


Gi y 


Gl y 


Gly 


Plie 


Ser 


Gly 


Gly 


Gly 


Met 


Ser 


Met 


Glu 


Asp 


lie 


Phe 


Ser 










85 










9 0 










Q 

r D 




Arg 


Pne 




Asp 


Leu 


tr n e 


Gl J- 


Gly 


Phe 


Gl y 


Gly 


Phe 


G 1 y 


Gly 


Phe 


Sex* 








r uu 










1 05 










110 






Asp 


Met 




Ci \r 

GiV 


GiV 


Ser 


Arg 


Arg 


Arg 


Va X 


Arg 


Arg 


Gly 


Ser 


Asp 


Leu 






inc. 
X i. O 










120 










125 








Arg 


Va 1 


Arg 


Val 


Lys 


Leu 


Se r 


Leu 


Al a 


Asp 


He 


Ser 


Lys 


Gly 


Val 


Gl u 




1 T n 
u u 




















14 0 










Lys 




Va 1 


Lys 


Val 


L y s 




J*" 1 n 

ox n 


V ax 


V a X 


Cys 


Ser 


Lys 


Cys 


Arg 


LjX y 


145 










150 










155 










160 


Asp 


Gly 


Thr 


Glu 


Glu 


Ala 


Asn 


Gly 


Lys 


Thr 


Thr 


Cys 


Gin 


Thr 


Cys 


His 










165 










170 










175 




Gly 


Thr 


Gly 


Val 


Val 


Thr 


Arg 


Val 


Ser 


Asn 


Thr 


Phe 


Leu 


Gly 


Ala 


Met 








180 










185 










190 






Gin 


Thr 


Gin 


Ser 


Thr 


Cys 


Pro 


Thr 


Cys 


His 


Gly 


Glu 


Gly 


Glu 


He 


He 






195 










200 










205 








Thr 


Lys 


Pro 


Cys 


Ser 


Lys 


Cys 


Lys 


Gly 


Glu 


Gly 


Val 


Glu 


He 


Gly 


Glu 




210 










215 










220 










Glu 


Val 


He 


Ser 


Phe 


His 


He 


Pro 


Ala 


Gly 


Val 


Ala 


Glu 


Gly 


Met 


Gin 


225 










2 30 










235 










240 


Met 


Ser 


Val 


Asn 


Gly 


Lys 


Gly 


Asn 


Ala 


Ala 


Pro 


Arg 


Gly 


Gly 


Val 


Asn 










245 










2 50 










255 




Gly 


Asp 


Leu 


He 


Val 


Vai 


He 


Ala 


Glu 


Glu 


Pro 


Asp 


Pro 


Asn 


Leu 


He 








260 










265 










27 0 






Arg 


Asn 


Gly 


Asn 


Asp 


Leu 


He 


Tyr 


Asn 


Leu 


Leu 


He 


Ser 


Val 


Pro 


Leu 






275 










280 










285 








Ala 


He 


Lys 


Gly 


Gly 


Ser 


Val 


Glu 


Val 


Pro 


Thr 


He 


Asp 


Gly 


Arg 


Ala 




290 










295 










300 










Ly^ 


He 


Arg 


lie 


Glu 


Ala 


Gly 


Thr 


Gin 


Pro 


Gly 


Lys 


Met 


Leu 


Arg 


Leu 


305 










310 










315 










320 


Arg 


Asn 


Lys 


Gly 


Leu 


Pro 


Ser 


Val 


Asn 


Gly 


Tyr 


Gly 


Met 


Gly 


Asp 


Gin 










325 










330 










335 




Leu 


Val 


Asn 


Val 


Asn 


Va 1 


Tyr 


He 


Pro 


Glu 


Ser 


He 


Asp 


Ala 


Lys 


Asp 








340 










345 








350 






Glu 


Gin 


Ala 


He 


Ala 


Ala 


Met 


Glu 


Asn 


Ser 


Asp 


Ser 


Phe 


Lys 


Pro 


Thr 






355 










360 










365 








Asp 


Ala 


Ala 


Arg 


Lys 


Asp 


He 


Asp 


Lys 


Lys 


Tyr 


Arg 


Glu 


Met 


Leu 


Asp 




370 










375 










380 











(2) lUFORMATIOJJ FOR SEQ ID IJO:293 



(i) SEQUEIICE CHARACTERISTICS: 
70 <A) LEMGTH: 309 amino acids 

(B} TYPE: amino acid 
(D) TOPOLOGY: linear 



75 



(ii) MOLECULE TYPE: protein 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



8NSDOCID: <WO 9929a70A1_l_> 





wo 99/29870 



PCT/AU98/01023 



211/490 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAIvIISM: Porphyromonas gingivalis 

iix) FEATURE: 

(A) NAI-tE/KEY: mi sc_f eat ure 
{ B) LOCATION 1 . . .309 



(xi) SEQUENCE DESCRIPTION: SEQ ID 110:293 



Arg 


Phe 


Asp 


Ser 


Thr 


Thr 


Asn 


Val 


Ser 


Gin 


He 


Asn 


Leu 


Arg 


Thr 


Glu 


1 








5 










10 










15 




Met 


hys 


Lys 


Leu 


He 


Leu 


Ala 


Thr 


Leu 


Gly 


Leu 


Met 


Ala 


He 


Ala 


I let 






20 










25 










30 






Leu 


Ser 


Cys 


Ser 


Ser 


Asn 


Asn 


Lys 


Asp 


Leu 


Glu 


Asn 


Lys 


Gly 


Glu 


Ala 






35 










40 










4 5 








Thr 


Leu 


Leu 


Val 


Thr 


Phe 


Gly 


Ser 


Ser 


Tyr 


Lys 


Ala 


Pro 


Arg 


Glu 


Thr 




50 










55 










60 










Tyr 


Ala 


Lys 


He 


Glu 


Lys 


Thr 


Phe 


Ala 


Ala 


Ala 


Tyr 


Pro 


Asp 


Gin 


Arg 


65 








70 










75 










80 


He 


Ser 


Trp 


Thr 


Tyr 


Thr 


Ser 


Ser 


He 


He 


Arg 


Lys 


Lys 


Leu 


Ala 


Gin 










85 










90 










95 




Gin 


Gly 


He 


Tyr 


He 


Asp 


Ala 


Pro 


Asp 


Glu 


Ala 


Leu 


Glu 


Lys 


Leu 


Ala 








100 










105 










HO 






Arg 


Leu 


Gly 


Tyr 


Lys 


Lys 


He 


Asn 


Vai 


Gin 


Ser 


Leu 


His 


Val 


He 


Pro 






115 










120 










125 








Gly 


Arg 


Glu 


Tyr 


Asp 


Glu 


Met 


He 


Asp 


Phe 


Va 1 


Asn 


Lys 


Phe 


Lys 


Al a 




130 










135 










140 










Ala 


His 


Ser 


Asp 


He 


Thr 


Val 


Lys 


Val 


Gly 


Ala 


Pro 


Leu 


Phe 


Asp 


Thr 


145 










150 










155 










160 




Glu 


Asp 


Met 


Arg 


Glu 


Val 


Ala 


Glu 


He 


Leu 


His 


Lys 


Arg 


Phe 


Gin 








165 










170 










175 




Gin 


Thr 


He 


Glu 


Lys 


Gly 


Glu 


Ala 


He 


Val 


Phe 


Met 


Gly 


His 


Gly 


Thr 








180 










185 










190 






Gl u 


His 


Ala 


Ala 


Asn 


Asp 


Arg 


Tyr 


Ala 


Arg 


He 


Asn 


Lys 


He 


Met 


Lys 






195 










200 










205 








Asn 


Tyr 


Ser 


Lys 


Phe 


Met 


He 


Val 


Gly 


Thr 


Val 


Glu 


Ser 


Asp 


Pro 


Ser 




210 










215 










220 










He 


Asn 


Asp 


Val 


He 


Ala 


Glu 


Leu 


Lys 


Glu 


Thr 


Gly 


Ala 


Thr 


Ala 


Val 


225 










230 








235 










240 


Thr 


Met 


Met 


Pro 


Leu 


Met 


Ser 


Val 


Ala 


Gly 


Asp 


His 


Ala 


Thr 


Asn 


Asp 










245 










250 










255 




Met 


Ala 


Gly 


Asp 


Glu 


Asp 


Asp 


Ser 


Trp 


Lys 


Thr 


Leu 


Leu 


Thr 


Asn 


Ala 








260 










265 










270 






Gly 


Tyr 


Thr 


Val 


Ser 


He 


Asp 


Lys 


Leu 


Asp 


Asn 


Gly 


Asn 


Phe 


Ser 


Ala 






275 










280 










285 








Leu 


Gly 


Asp 


He 


Glu 


Glu 


He 


Arg 


Asn 


He 


Trp 


Leu 


Lys 


His 


Met 


Lys 




2 90 










295 










300 










Ala 


Thr 


Ser 


Ala 


Arg 

























(2) INFORMATION FOR SEQ ID NO: 2 94 

(i) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 4 91 amino acids 
(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAI'IE/KEY: misc__f eature 
(Bl LOCATION 1 ... 491 

(;:i) SEQUENCE DESCRIPTION: SEQ ID NO: 294 

Arg Arg Asp Ala Arg Arg Gin Leu Val Arg Arg Arg Arg Lys Glu Leu 

15 10 15 

Ser Gly Thr Ala Asn Thr Arg Cys Arg Arg Arg Val Phe Pro Glu Arg 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSCXXDID: <WO 9929e70Al_l_> 



wo 99/29870 



PCT/AU98/01023 



212/490 



20 25 30 

Acq Ser Ala lie Arg Trp Cys Phe Pro Arg Leu Arg Asp Giy Leu Gin 

35 ^0 45 

Pro lie Arg Phe Gly Ser Leu Leu Ala Asn Pro Gin Gin Leu Ser Arg 
5 50 55 60 

Gly Val Arg Leu Arg Ser Phe Glu Ser Arg Arg Gin Pro He Cys Arg 
65 70 " 75 80 

Phe Gin Gly He Gin Gly hys Arg Arg Asp He Leu Gin Pro Glu Gly 
65 90 95 

10 Asn Ser Leu Leu Arg Thr His Ala His Ala Ser Arg Leu Gly Ala Gly 
100 105 110 

Arg Ser Tyr Arg Tyr Leu Ala Pro Arg Thr Thr Ser Arg Pro Pro Thr 

115 120 125 

Ala ?he Leu Leu Phe Ala Pro Met Thr Ser Val Ser His Leu Arg Thr 
15 130 135 140 

He Ser Val Ala Gly lie Leu Ala Ala Leu Gly Gly Ala Val Leu He 
1^5 150 155 160 

Leu Phe Gly Val Asn Leu Phe Leu Gly Ser Val Ala He Pro Met Ser 
165 170 175 

20 Glu He Plte Arg His Leu Phe Ser Asp Arg Pro Glu Gly Gly Glu Ala 
180 185 " 190 

Leu Val His Tyr Asn He Leu Trp Lys Ser Arg Leu Pro Glu Ala Leu 

195 200 205 

Thr Ala Ala Phe Ala Gly Ala Gly Leu Ser Val Ser Gly Leu Gin Met 
25 2.10 215 220 

Gin Thr Val Phe Arg Asn Pro Leu Ala Gly Pro Ser Val Leu Gly He 
225 230 235 240 

Ser Ser Gly Ala Ser Leu Gly Val Ala Leu Val Val Leu Leu Ser Gl^^- 
2-15 250 255 

30 Ser Leu Gly Gly Val Ala Leu Ser Ser Leu Gly Tyr Met Gly Glu Val 
260 265 270 

Ala llet Asn He Ala Ala Ala Val Gly Ser Leu Ala Val Met Gly Leu 

275 280 285 

He Val Phe Val Ser Thr Lys Val Arg Ser His Val Thr Leu Leu He 
35 290 295 300 

He Gly Val Met He Gly Tyr Val Ala Thr Ala Val He Gly Val Phe 
305 310 315 320 

Lys Phe Phe Ser He Glu Glu Asp He Arg Ala Tyr Val He Trp Gly 
325 330 335 

40 Leu Gly Ser Phe Ser Arg Ala Thr Asp Ser Gin Leu Ser Phe Phe Ala 
340 345 350 

He Leu Met Leu He Phe He Pro Ala Gly Met Leu Leu Val Lys Gin 

355 360 365 

Leu Asn Leu Leu Leu Leu Gly Glu Ser Tyr Ala Arg Asn Leu Gly Leu 
45 370 375 380 

Asn Thr Arg Arg Ala Arg Leu Leu Val He Ser Ser Ala Gly Leu Leu 
385 390 395 400 

He Ala Thr Val Tlir Ala Tyr Cys Gly Pro He Gly Phe Leu Gly Met 
4 05 410 415 

50 Ala Val Pro His Leu Ala Arg Val He Phe His Thr Ser Asp His Arg 
420 425 430 

He Leu Met Pro Ala Thr Cys Leu He Gly Ser Ala Leu Ala Leu Phe 

435 440 445 

Cys Asn He He Ala Arg Met Pro Gly Phe Glu Gly Ala Leu Pro Val 
55 450 455 460 

Asn Ser Val Thr Ala Leu Val Gly Ala Pro He He Val Thr Val Leu 
465 470 475 480 

Phe Arg Arg Arg Arg Phe Lys Glu Glu Thr Asp 
485 490 



60 



(2) IMFORMATIOH FOR SEQ ID 110:295 



(i) SEQUEllCE CHARACTERISTICS: 

{A) LENGTH: 763 amino acids 
65 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

70 (iii) HYPOTHETICAL: YES 

Ivi) ORIGIMAL SOURCE: 

(A) ORGANISM: Porphyromona s gingiva lis 

75 (ix) FEATURE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



213/ 490 



(A) MA] IE/KEY: mi sc_f eat ur e 

(B) LOCATION 1 . . .763 

(xi) SEQUENCE DESCRIPTION: SEQ ID IIO:295 

5 

His Phe Leu Val Thr He Met Arg Thr Lys Thr He Phe Phe Ala He 
15 10 15 

He Ser Phe He Ala Leu Leu Ser Ser Ser Leu Ser Ala Gin Ser Lys 
20 25 30 

10 Ala Val Leu Thr Gly Ser Val Ser Asp Ala Glu Thr Gly Glu Pro Leu 
35 40 45 

Ala Gly Ala Arg He Glu Val Lys His Thr Asn He Val Ala Gly Ala 

50 55 60 

Asp Ala Gly Gly His Phe Glu He Lys Asn Leu Pro Ala Gly Gin His 
15 65 70 75 80 

Thr He He Cys Ser Leu Gly Gly Tyr Gly Gin Lys Glu Glu Val Val 

85 90 95 

Ala He Glu Ala Gly Gin Thr Lys Thr He Ser Phe Ala Leu Arg Leu 
100 105 110 

20 Arg Thr Asn Asn Leu Glu Glu Val Val Val Thr Gly Thr Gly Thr Arg 
115 120 125 

Tyr Arg Leu Val Asp Ala Pro Val Ala Thr Glu Val Leu Thr Ala Lys 

130 135 140 

Asp He Ala Ser Phe Ser Ala Pro Thr Ser Glu Ala Leu Leu Gin Gly 
25 145 150 155 160 

Leu Ser Pro Ser Phe Asp Phe Gly Pro Asn Leu (let Gly Ser Phe llet 

165 170 175 

Gin Leu Asn Gly Leu Ser Ser Lys Tyr He Leu He Leu He Asp Gly 
180 185 190 

30 Lys Arg Val Tyr Gly Asp Val Gly Gly Gin Ala Asp Leu Ser Arg He 
195 200 205 

Ser Pro Asp Gin He Glu Arg He Glu Leu Val Lys Gly Ala Ser Ser 

210 215 220 

Ser Leu Tyr Gly Ser Asp Ala He Ala Gly Val He Asn Val He Thr 
35 225 230 235 240 

Lys Lys Asn Thr Asn Arg Leu Ser Ala Tyr Thr Ser His Arg He Ser 

245 250 255 

Lys Tyr Asn Asp Arg Gin Thr Asn Thr Ser Leu Asp He Asn He Gly 
260 265 270 

40 Lys Phe Ser Ser Asn Thr Asn Tyr Phe Phe Tyr His Thr Asp Gly Trp 
275 280 285 

Gin Asn Ser Pro Phe Glu He Lys Lys Lys Lys Gly Ser Gly Glu Pro 

290 295 300 

Val Leu Glu Glu Thr Tyr Lys Lys Thr Phe Arg Ala Gin Glu Asn Gin 
45 305 310 315 320 

Gly Val Ser Gin Ser Leu Ser Tyr Tyr Ala Thr Asn Asn Leu Ser Phe 

325 330 335 

Ser Gly Asn Val Gin Tyr Asn Lys Arg Gin He Phe Thr Pro Thr Phe 
340 345 350 

50 Ser Glu Lys Lys Ala Tyr Asp Met Asp Tyr Arg Ala Leu Thr Ala Ser 
355 360 365 

Leu Gly Thr Asn Tyr Leu Phe Pro Asn Gly Leu His Thr Leu Ser Phe 

370 375 380 

Asp Ala Val Tyr Asp Arg Phe Arg Phe Glj' Tyr Leu Tyr His Asp Lys 
55 385 390 395 400 

Asp Ser Ser Glu Ser Leu He Asn Asn Gin Gly Gin Thr Glu Gin Pro 

405 410 415 

Thr Phe Phe Pro Gly Gin Leu Arg Asn Lys Asn Asp Gin He Arg Tyr 
420 425 430 

60 Thr Ala Glu Ala Arg Gly Val Phe Thr Leu Pro Tyr Ala Gin Lys Leu 
435 440 445 

Thr Gly Gly Leu Glu Tyr Phe Arg Glu Glu Leu He Ser Pro Tyr Asn 

450 455 460 

Leu He Thr Asp Lys Ala Asp Ala Ser Thr Leu Ser Ala Tyr Val Gin 
65 465 470 475 480 

Asp Glu Trp Lys Pro Leu Asp Trp Phe Asn !iet Thr Ala Gly Phe Arg 

485 490 495 

Leu Val His His Gin Glu Phe Gly Thr Arg Met Thr Pro Lys Val Ser 
500 505 510 

70 He Leu Ala Lys Tyr Gly Pro Leu Asn Phe Arg Ala Thr Tyr Ala Asn 
515 520 525 

Gly Tyr Lys Thr Pro Thr Leu Lys Glu Leu Phe Ala Arg Asn Glu Leu 

530 535 540 

Thr Thr Met Gly Ser His Asn Leu Tyr Leu Gly Asn Ala Asp Leu Lys 
75 545 550 555 560 



BNSDOCID: <WO 9929870A1J_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 PCT/AU98/01 023 

214/ 490 



45 



50 



55 



60 



65 



70 



75 



Pro 


Gin 


net 


Ser 


Asp 
565 


Tyr 


Tyr 


Ala 


Leu 


Gly 
57 0 


Leu 


Glu 


Tyr 


Asn 


Gin 
57 5 


Gly 


Pro 


He 


Ser 


Phe 
580 
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(2) ID FORMATION FOR SEQ ID 110:2 96 

30 (i) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
40 (A) ORGAI'IISH: Porphyromon as gingival! s 

(ix) FEATURE: 

(A) MAI-IE/KEY: misc^feature 

(B) LOCATIOIT 1. . .365 



(>:i) SEQUENCE DESCRIPTION; SEQ ID WO: 296 
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SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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20 (2) IMFORMATIOn FOR SEQ ID MO: 2 97 

U) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 320 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 
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(vi) ORIGIIIAL SOURCE: 

(A) ORGAIIISM: Porphyromonas gingivalis 



(ix) FEATURE: 
35 {A) MAME/KEY: misc_feature 

(B) LOCATIOn 1 . . . 32 0 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 2 97 
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SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(2) INFORMATION FOR SEQ ID HO: 2 98 

(i) SEQUENCE CHARACTERISTICS: 

{AJ LENGTH: 582 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

15 { iii ) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAIIISH: Porphyromonas gingival is 

20 (ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1 . . . 582 



(j;i) SEQUENCE DESCRIPTION: SEQ ID 110:2 98 
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(2) lUFORJ lATIOII FOR SEQ ID !JO:2 99 



(i) SEQUEtlCE CHARACTERISTICS: 

(A) LEHGTH: 985 amino acids 
30 (B) TYPE : amino acid 

{□) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

35 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAtllSM: Porphyrornonas gingiva lis 

40 (ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1. . . 985 



(>:i) SEQUEIICE DESCRIPTIOM: SEQ ID NO: 2 99 
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Asp 


Gin 


He 


Thr 


Ala 


Lys 


Ser 


Thr 


Asp 


Ala 


145 










150 










155 










160 


Asp 


Val 


Val 


Arg 


Leu 


Leu 


Lys 


Glu 


Lys 


Tyr 


Asn 


Ser 


Ala 


Val 


Glu 


Ala 










165 










170 










175 




Ser 


Phe 


Asn 


Val 


Leu 


Arg 


Ala 


Arg 


He 


Asp 


Ala 


Phe 


Gly 


Val 


Val 


Ala 








180 










185 










190 






Pro 


Asn 


Leu 


Gin 


Arg 


Leu 


Glu 


Gly 


Gin 


Gly 


Arg 


He 


Leu 


Val 


Glu 


Leu 






195 










200 










205 








Pro 


Gly 


Val 


Lys 


Asp 


Pro 


Glu 


Arg 


Val 


Arg 


Thr 


Leu 


Leu 


Gin 


Arg 


Ser 




210 










215 










220 










Ala 


Asn 


Leu 


Gin 


Phe 


Trp 


Arg 


Thr 


Tyr 


Lys 


Phe 


Gl u 


Glu 


Val 


Ser 


Gly 


225 










230 








235 










240 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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Asp Leu lie Ala Ala Aen Aep Arg Leu Ser Glu Leu Ala Met Asn Asn 

24b 250 2 55 

Thr Asp Ala Thr Pro Glu Thr Glu Pro Ala Thr Thr Asp Ser Val Ala 
260 265 270 

5 Ala Thr Ala Asp Ser Ala Ala Val Gin Ala Val Ala Asp Ser Ala Thr 
S^S 280 285 

Val Ala Gin Lys Glu Ala Lys Asp Ala Thr Arg Lys Asp Ala Leu Ph^ 

290 295 300 

Ser Leu Leu Thr Pro Val Asn Arg Gly Gly Ala Val Val Gly Val Ala 
lU 305 310 315 320 

Arg Arg Ala Asn Ilet Ala Gin He Ser Glu Met Leu Gin Gin Ala His 

325 330 335 

Asp Leu Lys Val Thr Arg Glu Asp Val Leu Phe Leu Trp Gly Ala Lys 

3-10 3^5 
Ala He Glu Asp Pro Glu Thr Lys Lys Glu Thr Asp Leu Tyr Glu Leu 

355 360 
Tyr Ala He Arg Thr Asn Arg Thr Gly Asp Pro Asp Leu Gly Glv Asd 

370 375 380 

Val Val Thr Ser Ala Lys Ser Asp He Gin Asn Asp Phe Gly Arg S-r 
385 390 395 400 

Glu Pro He Val Ser Met Thr Met Asn Glu Glu Gly Ala Arg Lys Trp 

405 410 415 

Ala Arg He Thr Lys Asp Asn Val Gly Arc Ala He Ala He Val Leu 

=^20 425 ^30 

Asp Gly Val Val Tyr Ser Ala Pro Asn Val Asn Asp Glu He Thr Gly 

^35 4 40 445 

Gly Arg Ser Glr. He Ser Gly His Phe Thr Val Glu Glu Ala Glv Asp 

450 455 460 

Leu Ala Asn Val Leu Asn Ser Gly Lys Met Asp Ala Thr Val Ser He 
465 470 475 480 

Glu Gin Glu Asn Val He Gly Pro Thr Leu Gly Ala Glu Ser He Lys 

485 490 495 

Ala Gly Phe Leu Ser Phe Leu Leu Ala Leu Val He Leu Met Cys Tyr 

500 505 510 

Met Cys Leu Ala Tyr Gly Phe Leu Pro Gly Leu He Ala Asn Gly Ala 

515 520 525 

Leu He Val Asn Ser Phe Phe Thr Leu Gly Val Leu Ala Ser Phe His 

530 535 540 

Ala Val Leu Thr Leu Ser Gly He Ala Gly Leu Val Leu Thr Leu Glv 
40 545 550 555 560 

Met Ala Val Asp Ala Asn Val Leu He Phe Glu Arg He Lys Glu Glu 

565 570 575 

Leu Arg Ala Gly Lys Thr Pro He Arg Ala Val Thr Asp Gly Tyr Gly 
.p. 580 585 590 

Asn Ala Phe Ser Ala He Phe Asp Ser Asn Val Thr Thr He He Thr 

595 600 605 

Gly He He Leu Phe Leu Tyr Gly Thr Gly Pro He Arg Gly Phe Ala 

610 615 ' 620 

Thr Thr Leu He He Gly Leu He Ala Ser Phe He Thr Ala Val Ph- 
50 625 630 635 640 

Leu Thr Arg He Val Phe Glu Lys Leu Ala Lys Lys Gly Arg Leu Asp 

^^5 650 655 

Lys He Thr Phe Thr Thr Ser He Thr Arg Asn Leu Leu Val Asn Pro 
660 665 670 

er Tyr Asn He Leu Gly Lys Arg Lys Thr Gly Phe He He Pro Val 
675 680 685 

He He He Val Leu Gly Leu He Ala Ser Phe Thr He Gly Leu Asn 

690 695 700 

Arg Gly He Glu Phe Ser Gly Gly Arg Asn Tyr Val Val Lys Phe Asp 
60 '?05 710 715 720 

Gin Pro Val Ser Ser Glu A3 a Val Arg Ser Ala Leu Ser Ser Pro Leu 

"^25 730 735 

Gin Glu Lys Val Leu Val Thr Ser He Gly Thr Glu Gly Thr Glu Val 
^P- "^-JO 745 750 

DD Arg He Ser Thr Asn Tyr Lys He Gin Glu Glu Ser Glu Glu Thr Glu 
755 760 765 

Ala Glu He Thr Asp Lys Leu Tyr Gin Ser Leu Lys Gly Phe Tvr Thr 

770 775 780 

Gin Gin Pro Thr Ala Asp Gin Phe Leu Asp Asn He He Ser Ser Gin 
/U 785 790 795 80n 

Lys Val Ser Pro Ser Met Ser Ser Asp He Thr Arg Gly Ala He Trp 

805 810 815 

Ala Val Leu Leu Ser Met He Phe Met Ala He Tyr He Leu He Ara 

820 825 830 

Phe Arg Asp He Ser Phe Ser Ala Gly Val Phe Val Ser Val Ala AJ a 
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10 



15 



20 



45 



50 



55 



60 



65 



70 



75 







835 










B40 










845 








Thr 


Thr 

850 


Phe 


Cys 


He 


He 


Ala 
855 


Leu 


Tyr 


Ala 


Leu 


Leu 
860 


Trp 


Lys 


He 


Leu 


Pro 


Phe 


Thr 


Met 


Glu 


He 


Asp 


Gin 


Asn 


Phe 


He 


Ala 


Ala 


He 


Leu 


Ala 


865 










870 










875 










880 


He 


He 


Gly 


Tyr 


Ser 
885 


Leu 


Asn 


Asp 


Thr 


Val 
890 


Val 


Val 


Phe 


Asp 


Arg 
895 


He 


Arg 


Glu 


Thr 


Met 
900 


Lys 


Leu 


Tyr 


Pro 


Asn 
905 


Arg 


Asp 


Arg 


Tyr 


Gin 
910 


Val 


He 


Asn 


Asp 


Ala 
915 


Leu 


Asn 


Ser 


Thr 


Leu 
920 


Gly 


Arg 


Thr 


Leu 


Asn 
925 


Thr 


Ser 


Leu 


Thr 


Thr 


Phe 


He 


Val 


Met 


Leu 


Val 


He 


Phe 


He 


Phe 


Gly 


Gly Ala 


Thr 




930 










935 










940 










Met 


Arg 


Ser 


Phe 


Thr 


Phe 


Ser 


He 


Leu 


Leu 


Gly 


He 


Val 


He 


Gly 


Thr 


9'15 










950 










955 










960 


Tyr 


Ser 


Thr 


Leu 


Phe 
965 


Val 


Ala 


Thr 


Pro 


Leu 
970 


Ala 


Tyr 


Glu 


He 


Gin 
975 


Lys 


Arg 




Leu 


Asn 
980 




Ala 


Ala 


Lys 


Lys 
985 

















(2} INFORMATION FOR SEQ ID NO: 300 



(i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 104 6 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) J10LECULE TYPE: protein 

30 (iii) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

35 (ix) FEATURE: 

(A) HA! IE/ KEY: mis cofeature 

(B) LOCATION 1. . .104 6 



40 





(xi) SEQUENCE DESCRIPTION: SEQ : 


ID NO: 300 










Asp 


Phe 


Pro 


Trp 


Val 


Arg 


Val 


Lys 


Pro 


Glu 


Lys 


Lys 


Arg 


Lys 


Gin 


His 


1 








5 










10 










15 




Asn 


Ser 


Asn 


Asn 


Phe 


Lys 


Phe 


Asn 


Ala 


Lys 


Glu 


Lys 


Ser 


Met 


Lys 


Arg 








20 










25 










30 






Met 


Thr 


Leu 


Phe 


Phe 


Leu 


Cys 


Leu 


Leu 


Thr 


Ser 


He 


Gly 


Trp 


Ala 


Met 






35 










40 










45 








Ala 


Gin 


Asn 


Arg 


Thr 


Val 


Lys 


Gly 


Thr 


Val 


He 


S-r 


Ser 


Glu 


Asp 


Asn 




50 










55 










60 










Glu 


Pro 


Leu 


He 


Gly 


Ala 


Asn 


Val 


Val 


Val 


Val 


Gly 


Asn 


Thr 


Thr 


He 


65 










70 










75 










80 


Gly Ala 


Ala 


Thr 


Asp 


Leu 


Asp 


Gly 


Asn 


Phe 


Thr 


Leu 


Ser 


Val 


Pro 


Ala 










85 










90 










95 




Asn 


Ala 


Lys 


Met 


Leu 


Arg 


Val 


Ser 


Tyr 


Ser 


Gly 


Met 


Thr 


Thr 


Lys 


Glu 








100 










105 










110 






Val 


Ala 


He 


Ala 


Asn 


Val 


Met 


Lys 


He 


Val 


Leu 


Asp 


Pro 


Asp 


Ser 


Lys 






115 










120 










125 






Val 


Leu 


Glu 


Gin 


Val 


Val 


Val 


Leu 


Gly 


Tyr 


Gly 


Thr 


Gly 


Gin 


Lys 


Leu 




130 










135 










140 










Ser 


Thr 


Val 


Ser 


Gly 


Ser 


Val 


Ala 


Lys 


Val 


Ser 


Ser 


Glu 


Lys 


Leu 


Ala 


145 










150 










155 










160 


Glu 


Lys 


Pro 


Val 


Ala 


Asn 


He 


Met 


Asp 


Ala 


Leu 


Gin 


Gly 


Gin 


Val 


Ala 










165 










170 










175 




Gly 


Met 


Gin 


Val 


Met 


Thr 


Thr 


Ser 


Gly 


Asp 


Pro 


Thr 


Ala 


Val 


Ala 


Ser 








180 










185 










190 






Val 


Glu 


He 


His 


Gly 


Thr 


Gly 


Ser 


Leu 


Gly 


Ala 


Ser 


Ser 


Ala 


Pro 


Leu 






195 










200 










205 








Tyr 


He 


Val 


Asp 


Gly 


Met 


Gin 


Thr 


Ser 


Leu 


Asp 


Val 


Val 


Ala 


Thr 


Met 




210 










215 










220 










Asn 


Pro 


Asn 


Asp 


Phe 


Glu 


Ser 


Met 


Ser 


Val 


Leu 


Lys 


Asp 


Ala 


Ser 


Ala 


225 










230 










235 










240 


Thr 


Ser 


He 


Tyr 


Gly 


Ala 


Arg 


Ala 


Ala 


Asn 


Gly 


Val 


Val 


Phe 


He 


Gin 










245 










250 










255 




Thr 


Lys 


Lys 


Gly 


Lys 


Met 


Ser 


Gl u 


Arg 


Gly 


Arg 


He 


Thr 


Phe 


Asn 


Ala 








260 










265 










270 






Ser 


Tyr 


Gly 


He 


Ser 


Gin 


He 


Leu 


Asn 


Thr 


Lys 


Pro 


Leu 


Asp 


Asn 


Met 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 







275 










280 










285 








! let 


Thr 


Gly 


Asp 


Glu 


Leu 


Leu 


Asp 


Phe 


Gin 


Va 1 


Lys 


Ala 


Gly 


Phe 


Trp 




2 90 










2 95 










300 










Gly 


Asn 


As n 


Gin 


Thr 


Val 


Gin 


Lys 


Val 


Lys 


Asp 


Met 


He 


Leu 


Ala 


Gly 


305 










310 










315 










320 


Al a 


Gl u 


Asp 


Leu 


Tyr 


Gl y 


Asn 


Tyr 


Asp 


Ser 


Leu 


Lys 


Asp 


Glu 


Tyr 


Gly 










325 










330 










335 




Lys 


Thr 


Leu 


Phe 


Pro 


Va 1 


Asp 


Phe 


Asn 


His 


Asp 


Ala 


Asp 


Trp 


Leu 


Lys 








3-3 0 










34 5 










350 






Al a 


L&u 


Pne 


T - "c? 
Xjy S 


Thr 


Ml a 


Pro 


Thr 


Ser 


Gin 


Gly 


Asp 


i 1 e 


Ser 


Phe 


Ser 






355 










360 










365 








Gly 


Gly 


Ser 


Gin 


Gly 


Thr 


Ser 


Tyr 


Tyr 


Al a 


Ser 


He 


Gly 


Tyr 


Phe 


Asp 




370 










375 






380 










Gl n 


Gl u 


Gly 


Met 


Ala 


Arg 


Glu 


Pro 


Ala 


Asn 


Phe 


Lys 


Arg 


Tyr 


Ser 


Gly 


38 5 










390 










395 










4 00 


Arg 


Leu 


Asn 


Phe 


Glu 


Ser 


Arg 


He 


Asn 


Gl u 


Trp 


Leu 


Lys 


Val 


Gly 


Ala 










4 05 










410 










415 




Asn 


Leu 


Ser 


Gly 


Al a 


He 


Ala 


Asn 


Arg 


Arg 


Ser 


Ala 


Asp 


Tyr 


Phe 


Gly 








420 










425 










430 






Lys 


Tyr 


Tyr 


I let 


Gly 


Ser 


Gly 


Thr 


Phe 


Gly 


Val 


Leu 


Thr 


Met 


Pro 


Arg 






^ 35 










44 0 










4 45 








Tyr 


Tyr 


Asn 


Pro 


Phe 


Asp 


Val 


Asn 


Gly 


Asp 


Leu 


Ala 


Asp 


Val 


Tyr 


Tyr 




•3 50 










4 55 










4 60 










M €; t 


Tyr 




Al a 


Thr 


Arg 


Pro 


Ser 


I let 


Thr 


Glu 


Pro 


Tyr 


Phe 


Al a 


Lys 


'J 65 










4 7 0 










4 7 5 










4 80 


1 1 e t 


Arg 


Pro 


Phe 


Ser 


Ser 


Glu 


Ser 


His 


Gin 


Al a 


Asn 


Val 


Asn 


Gly 


Phe 










4 8 5 










4 90 










4 95 




Al a 


Gin 


lie 


Thr 


Pro 


He 


Lys 


Gly 


Leu 


Thr 


Leu 


Lys 


Ala 


Gin 


Al a 


Gly 








5 00 










505 










510 






Va.1 


Asp 


lie 


Thr 


Asn 


Thr 


Arg 


Thr 


Ser 


Ser 


Lys 


Arg 


Met 


Pro 


Asn 


Asn 






515 










520 










525 








Pro 


T:,-r 


Asp 


Ser 


Thr 


Pro 


Leu 


Gly 


Glu 


Arg 


Arg 


Glu 


Arg 


Ala 


Tyr 


Arg 




530 










535 










540 










Asp 


Yal 


Ser 


Lys 


Ser 


Phe 


Thr 


Asn 


Thr 


Ala 


Glu 


Tyr 


Lys 


Phe 


Ser 


He 


54 5 










550 










555 










560 


Asp 


Glu 


Lys 


His 


Asp 


Leu 


Thr 


Ala 


Leu 


Met 


Gly 


His 


Glu 


Tyr 


He 


Glu 










565 










570 










575 




Tyr 


Glu 


Gly 


Asp 


Val 


He 


Gly 


Ala 


Ser 


Ser 


Lys 


Gly 


Phe 


Glu 


Ser 


Asp 








580 










585 










590 






Lys 


Leu 


Met 


Leu 


Leu 


Ser 


Gin 


Gly 


Lys 


Thr 


Gly 


Asn 


Ser 


Leu 


Ser 


Leu 






595 










600 










605 








Pro 


Glu 


His 


Arg 


Val 


Ala 


Glu 


Tyr 


Ala 


Tyr 


Leu 


Ser 


Phe 


Phe 


Ser 


Arg 




610 










615 










620 










Phe 


Asn 


Tyr 


Gly 


Phe 


Asp 


Lys 


Trp 


Met 


Tyr 


He 


Asp 


phe 


Ser 


Val 


Arg 


625 










630 










635 










64 0 


Asn 


Asp 


Gin 


Ser 


Ser 


Arg 


Phe 


Gly 


Ser 


Asn 


Asn 


Arg 


Ser 


Ala 


Trp 


Phe 










64 5 










650 










655 




Tyr 


Ser 


V d 1 


Gly 


G 1 y 


I let 


Phe 


Asp 


He 


Tyr 


Asn 


Lys 


Phe 


He 


Gin 


Glu 








660 










665 










67 0 






Ser 


Asn 


Trp 


Leu 


Ser 


Asp 


Leu 


Arg 


Leu 


Lys 


Met 


Ser 


Tyr 


Gly 


Thr 


Thr 
















680 










685 








Gly 


Asn 


Ser 


Glu 


He 


Gly 


Asn 


Tyr 


Asn 


His 


Gin 


Al a 


Leu 


Va 1 


Thr 


Val 




6^0 










C Q t 

o y D 










7 00 










Asn 


As n 


1 yr 


Thr 


uX U 


Asp 


Ala 


Met 


Gly 


Leu 


Ser 


He 


Ser 


Thr 


Al a 


Gly 


T n 










Tin 










715 










72 0 


Asn 


Pro 


Asp 


Leu 


Ser 


Trp 


Glu 


Lys 


Gin 


Ser 


Gin 


Phe 


Asn 


Phe 


Gly 


Leu 










725 










730 










7 35 




Ala 


Ala 


Gly 


Al a 


Phe 


Asn 


Asn 


Arg 


Leu 


Ser 


Ala 


Glu 


Val 


Asp 


Phe 


Tyr 








7 4 0 










74 5 










7 50 






Va 1 


Arg 


Thr 


Thr 


Asn 


Asp 


Met 


Leu 


He 


Asp 


Val 


Pro 


Met 


Pro 


Tyr 


He 






7 55 










760 










765 








Ser 




Phe 


Phe 


Ser 


Gl n 


Tyr 


Gin 


Asn 


Val 


Gly 


Ser 


Met 


Lys 


Asn 


Thr 














7 7 5 










7 80 










Gly 


Val 


Asp 


Leu 


Ser 


Leu 


Lys 


Gly 


Thr 


He 


Tyr 


Gin 


Asn 


Lys 


Asp 


Trp 


785 










790 










7 95 










800 


Asn 


Val 


Tyr 


Ala 


Ser 


Ala 


Asn 


Phe 


Asn 


Tyr 


Asn 


Arg 


Gin 


Glu 


He 


Thr 










805 










810 










815 




Lys 


Leu 


Phe 


Phe 


Gly 


Leu 


Asn 


Ly^ 


Tyr 


I let 


Leu 


Pro 


Asn 


Thr 


Gly 


Thr 








820 










82 5 










830 






lie 


Trp 


Glu 


He 


Gly 


Tyr 


Pro 


Asn 


Ser 


Phe 


Tyr 


Met 


Ala 


Glu 


Tyr 


Ala 






835 










840 










845 








Gly 


lie 


Asp 


Lys 


Lys 


Thr 


Gly 


Lys 


Gin 


Leu 


Trp 


Tyr 


Val 


Pro 


Gly 


Gin 




8 50 










855 










860 










Val 


Asp 


Ala 


Asp 


Gly 


Asn 


Lys 


Val 


Thr 


Thr 


Ser 


Gin 


Tyr 


Ser 


Ala 


Asp 


865 










870 










075 










880 
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25 



30 



Leu 


Glu 


Thr 


Arg 


He 


Asp 


Lys 


Ser 


Val 


Thr 


Pro 


Pro 


He 


Thr 


Gly 


Gly 










885 










8 90 










895 




Fhe 


Ser 


Leu 


Gly 


Ala 


Ser 


Trp 


hys 


Gly 


Leu 


Ser 


Leu 


Asp 


Ala 


Asp 


Phe 








900 










905 










910 






Ala 


Tyr 


He 


Val 


Gly 


Lys 


Trp 


Met 


lie 


Asn 


Asn 


Asp 


Arg 


Tyr 


Phe 


Thr 






915 










920 










925 








Glu 


Asn 


Ala 


Gly 


Gly 


Leu 


Het 


Gin 


Leu 


Asn 


Lys 


Asp 


Lys 


Met 


Leu 


Leu 




930 










935 










940 










Asn 


Ala 


Trp 


Thr 


Glu 


Asp 


Asn 


Lys 


Glu 


Thr 


Asp 


Val 


Pro 


Lys 


Leu 


Gly 


945 










950 










955 










960 


Gin 


Ser 


Pro 


Gin 


Phe 


Asp 


Thr 


His 


Leu 


Leu 


Glu 


Asn 


Ala 


Ser 


Phe 


Leu 










965 










97 0 










975 




Arg 


Leu 


Lys 


Asn 


Leu 


Lys 


Leu 


Thr 


Tyr 


Val 


Leu 


Pro 


Asn 


Ser 


Leu 


Phe 








980 










985 










990 






Ala 


Gly 


Gin 


Asn 


Val 


He 


Gly 


Gly 


Ala 


Arg 


Val 


Tyr 


Leu 


Het 


Ala 


Arg 






995 










1000 








1005 






Asn 


Leu 


Leu 


Thr 


Val 


Thr 


Lys 


Tyr 


Lys 


Gly 


Phe 


Asp 


Pro 


Glu 


Ala 


Gly 




1010 








1015 








1020 








Gly 


Asn 


Val 


Gly 


Lys 


Asn 


Gin 


Tyr 


Pro 


Asn 


Ser 


Lys 


Gin 


Tyc 


Val 


Ala 



10 



15 



20 1025 1030 1035 1040 

Gly He Gin Leu Ser Phe 
1045 



f2) IIIFORMATIOII FOR SEQ ID 110:301 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LEMGTH: 869 amino acids 

(B) TYPE: amino acid 
[D) TOPOLOGY: linear 

(ii) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



35 (vi) ORIGINAL SOURCE: 

(A) ORGAI II SH: Po rphyromonas gingivals 

(i/i) FEATURE: 

(A) MAllE/KEY: mi sc_f eature 
40 (B) LOCATION 1...869 

ixl) SEQUEIJCE DESCRIPTIOIl: SEQ ID HO: 301 



45 


Trp 


His 


Arg 


Asn 


He 


Phe 


He 


Phe 


Ala 


Ser 


Thr 


Phe 


Ser 


Pro 


Lys 


Asn 


1 








5 










10 










15 






Met 


Leu 


Pro 


Leu 
20 


Pro 


Tyr 


Arg 


Tyr 


Ala 

25 


Lys 


Thr 


Glu 


His 


Leu 
30 


Phe 


Leu 




Ala 


Lys 


Gly 


Tyr 


Cys 


Lys 


Asn 


Pro 


He 


Thr 


Asn 


He 


He 


He 


Leu 


Phe 


50 






35 










40 










45 








Met 


Lys 
50 


Lys 


Lys 


Asn 


Phe 


Leu 

55 


Leu 


Leu 


Gly 


He 


Phe 
60 


Val 


Ala 


Leu 


Leu 




Thr 


Phe 


He 


Gly 


Ser 


Het 


Gin 


Ala 


Gin 


Gin 


Ala 


Lys 


Asp 


Tyr 


Phe 


Asn 




65 










70 










75 










80 


55 


Phe 


Asp 


Glu 


Arg 


Gly 


Glu 


Ala 


Tyr 


Phe 


Ser 


Phe 


Lys 


Val 


Pro 


Asp 


Arg 










85 










90 










95 






Ala 


Val 


Leu 


Gin 
100 


Glu 


Leu 


Ala 


Leu 


He 
105 


Met 


Ser 


He 


Asp 


Glu 
HO 


Phe 


Asp 




Pro 


Val 


Thr 


Asn 


Glu 


Ala 


He 


Ala 


Tyr 


Ala 


Ser 


Glu 


Glu 


Glu 


Phe 


Glu 


60 






lis 










120 










125 








Ala 


Phe 
130 


Leu 


Arg 


Tyr 


Gly 


Leu 
135 


Lys 


Pro 


Thr 


Phe 


Leu 
140 


Thr 


Pro 


Pro 


Ser 




Met 


Gin 


Arg 


Ala 


Val 


Glu 


Met 


Phe 


Asp 


Tyr 


Arg 


Ser 


Gly 


Glu 


Lys 


Tyr 




145 










150 










155 










160 


65 


Gl u 


Trp 


Asn 


Ala 


Tyr 


Pro 


Thr 


Tyr 


Glu 


Ala 


Tyr 


He 


Ser 


Met 


Met 


Glu 








165 










170 










175 






Glu 


Phe 


Gin 


Thr 
180 


Lys 


Tyr 


Pro 


Ser 


Leu 
185 


Cys 


Thr 


Thr 


Ser 


Val 
190 


He 


Gly 




Lys 


Ser 


Val 


Lys 


Asp 


Arg 


Lys 


Leu 


Het 


He 


Cys 


Lys 


Leu 


Thr 


Ser 


Ser 


70 






195 










2 00 










205 








Ala 


Asn 
210 


Thr 


Gly 


Lys 


Lys 


Pro 
215 


Arg 


Val 


Leu 


Tyr 


Thr 
220 


Ser 


Thr 


Met 


His 




Gly A£p Glu 


Thr 


Thr 


Gly 


Tyr 


Val 


Val 


Leu 


Leu 


Arg 


Leu 


He 


Asp 


His 




225 










230 










235 










240 


75 


Leu 


Leu 


Ser 


Asn 


Tyr 


Glu 


Ser 


Asp 


Pro 


Arg 


He 


Lys 


Asn 


He 


Leu 


Asp 










245 










250 








255 
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Lys Thr Glu Val Trp He Cys Pro Leu Thr Asn Pro Asp Gly Ala Tyr 

260 265 . 270 

Arg Ala Gly Asn His Thr Val Gin Gly Ala Thr Arg Tyr Asn Ala Asn 
275 280 285 

5 Asn Val Asp Leu Asn Arg Asn Phe Lys Asp Asp Val Ala Gly Asp His 
290 295 300 

Pro Asp Gly Lys Pro Trp Gin Pro Glu Ala Thr Ala Phe Met Asp Leu 
305 310 315 320 

Glu Gly A^n Thr Ser Phe Val Leu Gly Ala Asn lie His Gly Gly Thr 
10 325 330 335 

Glu Val Vjil Asn Tyr Pro Trp Asp Asn Lys Lys Glu Arg His Ala Asp 

3-10 345 350 

Asp Glu Trp Tyr Lys Leu He Ser Arg Asn Tyr Ala Ala Ala Cys Gin 
355 360 365 

15 Ser He Ser Ala Ser Tyr Met Thr Ser Glu Thr Asn Ser Gly He 11^ 

370 375 380 

Asn Gly Ser Asp Trp Tyr Val He Arg Gly Ser Arg Gin Asp Asn Ala 
3S5 390 395 400 

Asn Tyr Phe His Arg Leu Arg Glu He Thr Leu Glu He Ser Asn Thr 

^05 410 415 

Lys Leu Val Pro Ala Ser Gin Leu Pro Lys Tyr Trp Asn Leu Asn Lys 

420 425 430 

Glu Ser Leu Leu Ala Leu He Glu Glu Ser Leu Tyr Gly He His Gly 

435 440 445 

Thr Val Thr Ser Ala Ala Asn Gly Gin Pro Leu Lys Cys Gin He Leu 

450 455 460 

He Glu Asn His Asp Lys Arg Asn Ser Asp Val Tyr Ser Asp Ala Thr 
465 470 475 480 

Thr Gly Tyr Tyr Val Arg Pro He Lys Ala Gly Thr Tyr Thr Val Lys 

485 490 495 

Tyr Lys Ala Glu Gly Tyr Pro Glu Ala Thr Arg Thr He Thr lie Lys 

SOO . 505 510 

Asp Lys Glu Thr Val He Ilet Asp He Ala Leu Gly Asn Ser Val Pro 

515 520 525 

Leu Pro Val Pro Asp Phe Thr Ala Ser Pro Met Thr He Ser Val G1-- 

530 535 540 

Glu Ser Val Gin Phe Gin Asp Gin Thr Thr Asn Asn Pro Thr Asn Trp 
545 550 555 560 

Glu Trp Thr Phe Glu Gly Gly Gin Pro Ala Ilet Ser Thr Glu Gin Asn 

565 570 575 

Pro Leu Val Ser Tyr Ser His Pro Gly Gin Tyr Asp Val Thr Leu Lys 

580 585 590 

Val Trp Asn Ala Ser Gly Ser Asn Thr He Thr Lys Glu Lys Phe He 

595 600 605 

Thr Val Asn Ala Val Met Pro Val Ala Glu Phe Val Gly Thr Pro Thr 

6^0 615 620 

Glu He Glu Glu Gly Gin Thr Val Ser Fhe Gin Asn Gin Ser Thr Asn 
625 630 635 64 0 

Ala Thr Asn Tyr Val Trp He Phe Asp Gly Gly Thr Pro Ala Thr Ser 

645 650 655 

Glu Asp Glu Asn Pro Thr Val Leu Tyr Ser Lys Ala Gly Gin Tyr Asp 

660 665 670 

Val Thr Leu Lys Ala He Ser Ala Ser Gly Glu Thr Val Lys Thr Lys 

675 680 685 

Glu Lys Tyr He Thr Val Lys Lys Ala Pro Val Pro Ala Pro Val Ala 

690 695 700 

Asp Phe Glu Gly Thr Pro Arg Lys Val Lys Lys Gly Glu Thr Val Thr 
'^Ob 710 715 720 

Pfie Lys Asp Leu Ser Thr Asn Asn Pro Thr Ser Trp Leu Trp Val Phe 

725 730 735 

Glu Gly Gly Ser Pro Ala Thr Ser Thr Glu Gin Asn Pro Val Val Thr 

74 0 7 45 7 50 

Tyr Asn Glu Thr Gly Lys Tyr Asp Val Gin Leu Thr Ala Thr Asn Glu 

755 760 765 

Gly Gly Ser Asn Val Lys Lys Ala Glu Asp Tyr He Glu Val He Leu 

770 775 780 

Asp Asp Ser Val Glu Asp He Val Ala Gin Thr Gly He Val He Arn 
785 790 795 800 

Pro Gin Asn Gly Thr Lys Gin He Leu He Glu Ala Asn Ala Ala He 

805 810 815 

Lys Ala He Val Leu Tyr Asp He Asn Gly Arg Val Val Leu Lys Thr 

820 825 830 

Thr Pro Asn Gin Leu Arg Ser Thr Val Asp Leu Ser He Leu Pro Glu 
_c 840 845 

/O Gly He Tyr Thr He Asn He Lys Thr Glu Lys Ser Ala Arg Thr Glu 
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25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



wo 99/29870 
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223/490 



850 855 860 

Lys lie His lie Gly 
865 

5 (2) IN FORMAT lOI I FOR SEQ ID 110:302 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids^ 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



15 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivali^ 



ti:-:) FEATURE: 
20 (A) NAJIE/KEY: misc_feature 

(B) LOCATION 1 ... 106 



25 





iy.i) 


SEQUENCE DESCRIPTION: SEQ ID NO: 3 02 












Leu 


Ser 


Tyr 


Ser 


Gly 


Glu 


Ser 


Asp 


Ala 


Lys 


Glu 


Ser 


Asp 


Gin 


Asn 


cys 


1 






5 










10 










15 




Arq 


Lys 


Cys 


Thr 


Phe 


He 


Gly 


Phe 


Glu 


Lys 


Arg 


Val 


Asn 


Thr 


Met 


Arg 






2 0 










25 










30 






Leu 


lie 


Lys 


Ala 


Phe 


Leu 


Val 


Gin 


Leu 


Leu 


Leu 


Leu 


Pro 


He 


Phe 


Phe 






35 










40 










45 








Tyr 


Lys 


Arg 


Phe 


He 


Ser 


Pro 


Leu 


Thr 


Pro 


Pro 


Ser 


Cys 


Arg 


Phe 


Thr 




50 










55 










60 










Pro 


Ser 


Cys 


Ser 


Ser 


Tyr 


Ala 


He 


Glu 


Ala 


Leu 


Arg 


Lys 


Tyr 


Gly 


Pro 


65 








70 










75 










80 


Gly 


Lys 


Gly 


Leu 


Leu 


Leu 


Ser 


He 


Lys 


Arg 


He 


Leu 


Arg 


Cys 


His 


Pro 






85 










90 










95 




Trp 


Gly 


Gly 


Ser 


Gly 


Tyr 


Asp 


Pro 


Val 


Pro 




















100 










105 

















30 



35 



40 (2) INFORMATION FOR SEQ ID NO: 303 

(i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vil ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 



(ix) FEATURE: 
55 (A) N7\1^E/KEY: misc_feature 

(B) LOCATION 1 . . .8 61 

{>ix) SEQUEI'ICE DESCRIPTION: SEQ ID NO: 303 



60 


Leu 
1 


Glu 


Met 


Ala 


Tyr 
5 


Asp 


Phe 


Thr 


Gin 


Thr 
10 


Phe 


Arg 


Asn 


Ser 


Leu 
15 


Glu 




Tyr 


Ser 


Tyr 


Gin 


Glu 


Ala 


Thr 


Arg 


Leu 


Gly 


Val 


Val 


Ala 


Val 


Thr 


Gin 






20 










25 










30 








Asp 


Met 


Leu 


Val 


Leu 


Gly 


He 


He 


Arg 


Asp 


Gly 


Asp 


Asn 


Gly 


Ala 


He 


65 




35 








40 










45 










Asp 


He 
50 


Met 


Arg 


His 


Tyr 


Gly 
55 


He 


Asn 


Leu 


Tyr 


Glu 

60 


Leu 


Lys 


Arg 


Leu 




li- 


Glu 


Leu 


Glu 


Ala 


He 


Ala 


Glu 


Ser 


Leu 


Pro 


Ala 


Ser 


Pro 


Glu 


Gly 




es 










7 0 










75 










80 


70 


Ser 


Pro 


He 


Phe 


Thr 
85 


Pro 


Ser 


Ala 


Arg 


Glu 
90 


Ala 


He 


Asp 


Asp 


Ala 
95 


Thr 




Asp 


He 


Cys 


Ala 
100 


Asp 


Met 


Glu 


Asp 


Glu 
105 


Ala 


Val 


Ser 


Pro 


Val 
110 


His 


Leu 




Leu 


Leu 


Ser 


He 


Leu 


Asn 


Ser 


Thr 


Gin 


Glu 


Ser 


Leu 


Val 


Gin 


Lys 


He 


75 






115 










120 










125 
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Phe 


Met 


Lys 


Gin 


Gly 


He 


Lys 


Tyr 


Asp 


Thr 


He 


Leu 


Ser 


Asp 


Tyr 


Phe 




130 










135 










140 










Gly 


GJ n 


Arg 


Asn 


Pro 


Ser 


Gl u 


Gly 


Lys 


Ser 


Pro 


Ser 


Glu 


Met 


Glu 


He 


145 










150 










155 










160 


Leu 


Asp 


Gly 


Tyr 


Gin 


Asp 


Asn 


Asp 


Phe 


Asp 


Asp 


Glu 


Glu 


Asp 


Glu 


Ser 










165 










170 










1 75 




Ser 


Pro 


Pro 


Ser 


Gly 


Asn 


Ser 


Gly 


Thr 


Gly 


Gly 


Gly 


Ser 


Gly 


Asp 


Al a 








180 










185 










1 90 






Pro 


Glu 


Gin 


Asn 


Thr 


Gly 


Gly 


Gly 


Asp 


Thr 


Thr 


Thr 


Thr 


Thr 


Arg 


Ser 






195 










200 










205 








Gly 


Gly 


Asp 


Thr 


Pro 


Ala 


Leu 


Asp 


Thr 


Phe 


Gly 


Thr 


Asp 


He 


Thr 


Ala 




210 










215 










220 










I -let 


Ala 


Ala 


Ala 


Gly 


Lys 


Leu 


Asp 


Pro 


Val 


Val 


Gly 


Arg 


Glu 


Gin 


Gl u 


22 5 










230 










235 










240 


He 


Gl u 


Arg 


Val 


He 


Gin 


He 


Leu 


Ser 


Arg 


Arg 


Lys 


Lys 


Asn 


Asn 


Pro 










245 










250 










255 




Val 


Leu 


He 


Gly 


Glu 


Pro 


Gly 


Val 


Gly 


Lys 


Ser 


Ala 


He 


Val 


Glu 


Gly 








260 










265 










270 






Leu 


Ala 


Glu 


Arg 


He 


Val 


Asn 


Arg 


Lys 


Val 


Ser 


Arg 


He 


Leu 


Phe 


Asp 






275 










280 










285 








Lys 


Arg 


He 


lie 


Ser 


Leu 


Asp 


Leu 


Ala 


Gin 


Met 


Val 


Ala 


Gly 


Thr 


Lys 




2 90 










295 










3 0 Ci 










Tyr 


Arg 


Gly 


Gin 


Phe 


Glu 


Glu 


Arg 


Leu 


Lys 


Ala 


Val 


Leu 


Asp 


Glu 


Leu 


305 










310 










315 










320 


Lys 


Lys 


Asn 


Pro 


Gin 


He 


He 


Leu 


Phe 


He 


Asp 


Glu 


He 


His 


Thr 


He 










325 










330 










335 




Val 


Gly 


Al a 


Gly 


Ser 


Ala 


Ala 


Giy 


Ser 


Met 


Asp 


Thr 


Ala 


Asn 


Met 


Leu 








340 










345 










350 






Ly^ 


Fro 


Ala 


Leu 


Ala 


Arg 


Gly 


Gin 


Val 


Gin 


C*'s 


He 


Gly 


Ala 


Thr 


Thr 






355 










360 










365 








Leu 


Asp 


Glu 


Tyr 


Arg 


Lys 


Asn 


He 


Glu 


Lys 


Asp 


Gly 


Ala 


Leu 


Glu 


Arg 




370 










375 










380 










Arg 


Phe 


Gin 


Lys 


Val 


Pro 


He 


Ala 


Pro 


Ser 


Thr 


Ala 


Glu 


Glu 


Thr 


Leu 


385 










390 










395 










400 


Thr 


He 


Leu 


Gin 


Asn 


He 


Lys 


Glu 


Lys 


Tyr 


Glu 


Asp 


Tyr 


His 


Gly 


Val 










4 05 










410 










415 




Arg 


Tyr 


Thr 


Asp 


Glu 


Ala 


He 


Lys 


Ala 


Ala 


Val 


Gl u 


Leu 


Thr 


Asp 


Arg 








420 










425 










430 






Tyr 


Val 


Ser 


Asp 


Arg 


Phe 


Phe 


Pro 


Asp 


Lys 


Ala 


He 


Asp 


Ala 


Met 


Asp 






435 










440 










445 








Glu 


Ala 


Gly 


Ala 


Ser 


Val 


His 


He 


Thr 


Asn 


Val 


Val 


Ala 


Pro 


Lys 


Glu 




4 50 










455 










460 










He 


Glu 


He 


Leu 


Glu 


Ala 


Glu 


Leu 


Ala 


Ser 


Val 


Arg 


Glu 


Asn 


Lys 


Leu 


4 65 










4 7 0 










4 75 










4 80 


Ser 


Ala 


Val 


Lys 


Ala 


Gin 


Asn 


Tyr 


Glu 


Leu 


Ala 


Ala 


Ser 


Phe 


Arg 


Asp 










4 85 










4 90 










4 95 




Gin 


Glu 


Arg 


Arg 


Thr 


Gin 


Gin 


Gin 


He 


Ala 


Glu 


Glu 


Lys 


Lys 


Lys 


Trp 








500 










505 










510 






Glu 


Glu 


Gin 


Met 


Ser 


Lys 


His 


Arg 


Glu 


Thr 


Val 


Asp 


Glu 


Asn 


Val 


Val 






515 










520 










525 








Al a 


His 


Val 


Val 


Ala 


Leu 


Met 


Thr 


Gly 


Val 


Pro 


Ala 


Glu 


Arg 


Leu 


Ser 




5 30 










535 










540 










Thr 


Gly 


Glu 


Gly 


Glu 


Arg 


Leu 


Arg 


Thr 


I-Ist 


Ala 


Asp 


Asp 


Leu 


Lys 


Thr 


545 










550 










555 










560 


Lys 


Val 


Val 


Gly 


Gin 


Asp 


Thr 


Ala 


He 


Glu 


Lys 


Met 


Val 


His 


Ala 


He 










565 










57 0 










57 5 




Gl n 


Arg 


Asn 


Arg 


Leu 


Gly 


Leu 


Arg 


Asn 


Glu 


Lys 


Lys 


Pro 


He 


Gly 


Ser 








580 










585 










590 






Phe 


Leu 


Phe 


Leu 


Gly 


Pro 


Thr 


Gly 


Val 


Gly 


Lys 


Thr 


Tyr 


Leu 


Ala 


Lys 






595 










600 










605 








Lys 


Leu 


Ala 


Glu 


Tyr 


Leu 


Phe 


Glu 


Asp 


Glu 


Asn 


Ala 


Met 


He 


Arg 


Val 




610 










615 










620 












I let 


Ser 


Gl u 


Tyr 


i''ie t 


Ui. U 


Lys 


Phe 


Ser 


Val 


Ser 


Arg 


Leu 


Val 


Gly 


625 










630 










635 










64 0 


Ala 


Pro 


Pro 


Gly 


Tyr 


Val 


Gly 


Tyr 


Glu 


Glu 


Gly 


Gly 


Gin 


Leu 


Thr 


Glu 










645 










650 










655 




Arg 


Val 


Arg 


Arg 


Lys 


Pro 


Tyr 


Ser 


Val 


Val 


Leu 


Leu 


Asp 


Glu 


He 


Glu 








660 










665 










67 0 






Ly^ 


Ala 


His 


Ala 


Asp 


Val 


Phe 


Asn 


Leu 


Leu 


Leu 


Gl n 


Val 


Met 


Asp 


Glu 






675 










680 










685 








Gly 


Gin 


Leu 


Thr 


Asp 


Ser 


Leu 


Gly 


Arg 


Arg 


Val 


Asn 


Phe 


Lys 


Asn 


Thr 




690 










695 










700 










Val 


He 


He 


He 


Thr 


Ser 


Asn 


Val 


Gly 


Thr 


Arg 


Gin 


Leu 


Lys 


Asp 


Phe 


705 










710 










715 










720 


Giy 


Gin 


Gly 


He 


Gly 


Phe 


Arg 


Ser 


Glu 


Lys 


Asp 


Glu 


Glu 


Ala 


Asn 


Lys 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 
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10 



20 



25 



45 



50 



55 



60 



65 



70 



75 











725 










730 


Glu 


His 


Ser 


Arg 


Ser 


Val 


lie 


Gin 


Lys 


Ala 








7^0 










745 




Pro 


Glu 


Phe 


Leu 


Asn 


Arg 


Leu 


Asp 


Asp 


He 






755 










760 






Gly 


Lys 


Thr 


Glu 


He 


Arg 


Arg 


Met 


Val 


Asp 




770 










775 








Leu 


Air, 


Arcj 


He 


His 


Arg 


Ala 


Gly 


Tyr 


Asp 


785 










790 










Ala 


Lys 


Asp 


Val 


He 


Ala 


Thr 


Lys 


Gly 


Tyr 










805 










810 


Arg 


Fro 


Leu 


Lys 


Arg 


Thr 


Leu 


Gin 


Asn 


Glu 








820 










825 




A-p 


Leu 


He 


Leu 


Ser 


Gly 


Gin 


He 


Glu 


Lys 






335 










840 






Ser 


Al« 


Arg 


Asp 


Gly 


Glu 


Tie 


He 


Val 


Gin 







735 


Lys 


Thr 


Phe 




750 




Phe 


Asp 


Gin 


765 






Leu 


Lys 


Ala 


Leu 


Thr 


Asp 


Gin 


Tyr 


Gly 






815 


Asp 


Arg 


Leu 




830 




Thr 


Leu 


Thr 



700 
Leu Val 

795 800 



15 335 840 84^ 

;iu Gin AIe 

850 " ' 855 860 



(2) IMFORMATIOII FOR SEQ ID MO: 304 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LEIiGTH: 4 56 amino acid^ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



30 (vi) ORIGIIiAL SOURCE: 

{A) ORGAMISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) HAl IE/KEY: misc_feature 
35 (B) LOCATIOM 1 . . .456 

SEOUEMCE DESCRI PTIOIf : SEQ ID 110:304 



40 



Arg 


Gly 


Gly 


Gin 


He 


Arg 


Arg 


His 


His 


Thr 


Asp 


Ser 


Ser 


Arg 


Gly 


Ser 


1 


















10 










15 




Asp 


Ser 


Lys 


Ala 


Cys 


Arg 


Phe 


Ser 


Phe 


His 


Ala 


Glu 


Thr 


He 


Gly 


Phe 








20 










25 










30 






Ser 


Asn 


His 


Gin 


Arg 


Ala 


Lys 


Met 


Asn 


Tyr 


Leu 


Tyr 


He 


Leu 


He 


Thr 






35 










40 










45 








Leu 


Leu 


Leu 


Ser 


Gly 


Phe 


Phe 


Ser 


Gly 


Ala 


Glu 


He 


Ala 


Phe 


Leu 


Ser 




50 










55 










60 










Ser 


Asp 


Lys 


Leu 


Arg 


Leu 


Glu 


Leu 


Asp 


Arg 


Asn 


Arg 


Gly 


Asp 


Leu 


Thr 


65 










70 










75 










80 


•31 y 


Arg 


Al a 


Leu 


Asn 


Leu 


Leu 


Tyr 


Arg 


His 


Pro 


Asp 


Gin 


Leu 


Val 


Thr 










85 










90 










95 




Thr 


Leu 


Leu 


Val 


Gly 


As n 


Asn 


He 


Val 


Leu 


Val 


Val 


Tyr 


Gly 


Leu 


Leu 








100 










105 










110 






I let 


Ala 


Gly 


Leu 


Leu 


Ala 


Ala 


Pro 


Leu 


Ala 


Gin 


Trp 


He 


Asp 


Asn 


Asp 






115 










120 










125 








Ala 


net 


He 


Val 


Val 


Leu 


Gin 


Ser 


Val 


Leu 


Ser 


Thr 


He 


He 


He 


Leu 




130 










135 










140 










Phe 


Thr 


Gly 


Glu 


Phe 


Leu 


Pro 


Lys 


Ala 


He 


Phe 


Lys 


Thr 


Asn 


Ala 


Asn 


145 










150 










155 










160 


Met 


Met 


Met 


Arg 


Val 


Phe 


Ala 


Leu 


Pro 


He 


Val 


Ala 


He 


Tyr 


Tyr 


Leu 










165 










170 










175 




Leu 


Tyr 


Pro 


Leu 


Ser 


Lys 


Leu 


Phe 


Thr 


Gly 


Leu 


Ser 


Arg 


Ser 


Phe 


He 








180 










185 










190 






Arg 


Leu 


Val 


Asp 


Lys 


Asn 


Tyr 


Val 


Pro 


Thr 


Thr 


Val 


Gly 


Leu 


Gly 


Arg 






195 










2 00 










205 








Val 


Asp 


Leu 


Asp 


His 


Tyr 


Leu 


Ala 


Glu 


Asn 


Met 


Ser 


Gly 


Glu 


Asn 


Glu 




21 0 










215 










2 20 










Gin 


Asn 


Asp 


Leu 


Thr 


Thr 


Glu 


Val 


Lys 


He 


He 


Gin 


Asn 


Ala 


Leu 


Asp 


2 2 5 










230 










235 










24 0 


Phe 


Ser 


Gly 


He 


Gin 


Val 


Arg 


Asp 


Cys 


Met 


He 


Pro 


Arg 


Asn 


Glu 


Met 










245 










250 










255 




He 


Ala 


Cys 


Glu 


Leu 


Gin 


Thr 


Asp 


He 


Glu 


Val 


Leu 


Lys 


Thr 


Thr 


Phe 








260 










265 










270 






He 


Asp 


Thr 


Gly 


Leu 


Ser 


Lys 


He 


He 


He 


Tyr 


Arg 


Gin 


Asn 


He 


Asp 






275 










280 










285 








Asp 


Val 


Val 


Gly 


Tyr 


He 


His 


Ser 


Ser 


Glu 


Met 


Phe 


Arg 


Gly 


Gin 


Asp 
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290 










295 




Trp 


Gin 




Arg 


He 


Asn 


Thr 




305 










310 






Ala 


Asn 


Lys 


Leu 


Met 


Arg 


Leu 


O 










325 








He 


Val 


He 


Asp 
34 0 


Glu 


Leu 


Gly 




Asp 


Leu 


Val 
355 


Glu 


Glu 


He 


Phe 


10 


hrg 


Lys 
370 


He 


lie 


Ala 


Lys 


Gin 
375 




Gly 


Arg 


Met 


Glu 


He 


Asp 


Asp 




385 










390 






Pro 


Glu 


Ser 


Asp 


Asp 


Tyr 


Leu 


15 










405 






His 


Gin 


Asn 


He 
420 


Pro 


Gin 


Ala 




Thr 


Phe 


Thr 


He 


Leu 


Arq 


Ser 


20 






435 










Met 


Ser 
4 50 


lie 


Asp 


Asp 


Gin 


Ser 
455 



300 



Thr 


Val 


Phe 


Val 


Pro 


Glu 


Ser 


1 let 


Tyr 








315 










320 


Leu 


Met 


Gin 


Arg 


Lys 


Lys 


Ser 


He 


Al a 






330 










335 




Gly 


Thr 


Ala 


Gly 


Met 


Va 1 


Thr 


Leu 


Glu 




34 5 










350 






Gly 


Asp 


He 


Glu 


Asp 


Glu 


His 


Asp 


Thr 


360 










365 








Leu 


Gly 


Pro 


His 


Thr 


Tyr 


Leu 


Val 


Ser 










380 










Val 


Asn 


Glu 


Arg 


Phe 


Gly 


Leu 


Ser 


Leu 








395 










4 0 0 


Thr 


Val 


Ala 


Gly 


Phe 


He 


Leu 


Asn 


Ser 






410 










415 




As n 


Glu 


Val 


Val 


Glu 


He 


Ala 


Pro 


Tyr 




425 










430 






Ser 


Ser 


Thr 


Lys 


He 


Glu 


Leu 


Val 


Lys 


440 










445 








Asn 



















(2) IIJFORMATIOIJ FOR SEQ ID HO: 305 

25 (i) SEQUEllCE CHARACTERISTICS: 

(A) LENGTH: 299 amino acids 

(B) TYPE: amino acid 
( D ) TO POLOG V : linear 

30 (ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



ivi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Porphyromonas gingival is 



40 



(ix) FEATURE: 

(A) JIATIE/KEY: misc_feature 

(B) LOCATIOIJ 1 . . .299 

(xi) SEQUEMCE DESCRIPTION: SEQ ID MO: 305 





Leu 


Lys 




Glu 


He 


Thr 


Met 


Lys 


Gin 


Asn 


Tyr 


Phe 


Lys 


Arg 


Val 


Cys 


45 


1 








5 










10 










15 




Ser 


Leu 


Leu 


Trp 

2 0 


Leu 


Val 


Leu 


Pro 


Met 
25 


Leu 


He 


Met 


Pro 


Leu 

30 


Glu 


Val 




Ala 


Ala 


Gin 
35 


Glu 


He 


He 


Pro 


Asn 
4 0 


Glu 


Gl u 


Val 


Leu 


Glu 

45 


Ser 


Leu 


Thr 




Phe 


Val 


Ala 


Pro 


Val 


Glu 


Glu 


Thr 


Asp 


Ala 


He 


Glu 


Ala 


Glu 


Val 


Glu 


50 




50 










55 








60 












Ala 


Leu 


Gin 


Glu 


He 


Val 


Ala 


Thr 


Glu 


Glu 


He 


Ala 


Glu 


Gin 


Ala 


Val 




65 










70 










75 










80 




Arg 


Ser 


Tyr 


Thr 


Tyr 


Thr 


Val 


Tyr 


Arg 


Asp 


Gly 


Val 


Lys 


He 


Ala 


Ser 


55 










85 










90 










95 




Gly 


Leu 


Thr 


Glu 
100 


Pro 


Thr 


Phe 


Leu 


Asp 
105 


Glu 


Asp 


Val 


Pro 


Ala 
110 


Gly 


Glu 




His 


Thr 


Tyr 
115 


Cys 


Val 


Glu 


Val 


Gin 
120 


Tyr 


Gin 


Gly 


Gly 


Val 
125 


Ser 


Asp 


Lys 


60 


Val 


Cys 


Val 


Asp 


Val 


Glu 


Val 


Lys 


Asp 


Phe 


Lys 


Pro 


Val 


Thr 


Asn 


Leu 




130 










135 








140 












Thr 


Gly 


Thr 


Ala 


Ser 


Asn 


Asp 


Glu 


Val 


Ser 


Leu 


Asp 


Trp 


Asp 


Gly 


Val 




145 










150 










1 55 










160 




Glu 


Glu 


Lys 


Ala 


Glu 


Glu 


Pro 


Ala 


Ser 


Asp 


Lys 


Ala 


Val 


Ser 


Tyr 


Asn 


65 










165 










170 










175 




Val 


Tyr 


Lys 


Asn 
180 


Gly 


Thr 


Leu 


He 


Gly 
185 


Asn 


Thr 


Ala 


Glu 


Thr 
190 


His 


Tyr 




Val 


Glu 


Thr 

195 


Gly 


Val 


Ala 


Asn 


Gly 
200 


Thr 


Tyr 


He 


Tyr 


Glu 
205 


Val 


Glu 


Val 


70 


Lys 




Pro 


Asp 


Gly 


Val 


Ser 


Pro 


Lys 


Val 


Ala 


Val 


Thr 


Val 


Thr 


Va i 




210 










215 








220 












Thr 


Asn 


Ser 


Ser 


Leu 


Ser 


Asn 


Val 


Asp 


Gly 


Gin 


Ala 


Pro 


Tyr 


Thr 


Leu 




225 










230 










235 










24 0 




Arg 


Val 


Glu 


Gly 


Lys 


Lys 


He 


He 


Ala 


Glu 


Ala 


His 


Gly 


Met 


He 


Thr 


75 










245 










250 










255 




Leu 


Tyr 


Asp 


He 


Asn 


Gly 


Arg 


Thr 


Val 


Ala 


Val 


Ala 


Fro 


Asn 


Arg 


Leu 
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260 265 210 

Glu Tyr Met Ala Gin Thr Gly Phe Tyr Ala Val Arg Phe Asp Val Gly 

275 280 285 

Asn Lys His His Val Ser Lys He Gin Val Arg 
5 290 295 

(2) ItlFORllATIOM FOR SEQ ID MO:306 

(i) SEQUEIICE CHARACTERISTICS: 
10 (A) LEDGTH: 377 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



15 



20 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 

(A) ORGANISM: Porphyromona s gingiva lis 



(i::) FEATURE: 

(A) IJAME/KEY: mi sc_f ea ture 

(B) LOCATIOM 1 ... 377 

25 {;:i} SEQUENCE DESCRI PTIOIJ : SEQ ID HO: 30"; 



30 



Leu 


Phe 


Asn 


His 


Lys 


Lys 


Ser 


Trp 


Tyr 


Glu 


Thr 


Phe 


Gin 


Phe 


Tyr 


Leu 


1 








5 










10 










15 




Val 


Val 


Phe 


Arg 


Ser 


Gly 


Phe 


He 


Leu 


Leu 


Cys 


Glu 


Asn 


Thr 


Leu 


Ala 








20 










25 










30 






Gin 


Gin 


Lys 


Thr 


Glu 


Glu 


Phe 


Ala 


Pro 


Val 


Ser 


Asp 


Leu 


Arg 


Ala 


Glu 






35 










40 










45 








Ala 


Tyr 


Gly 


Ser 


Thr 


Val 


Phe 


Leu 


His 


Trp 


Thr 


Pro 


Pro 


Tyr 


Asp 


Asn 




50 










55 










60 










Pro 


Met 


He 


Pro 


Leu 


Ser 


Glu 


Ser 


Phe 


Glu 


Ser 


Gly 


He 


Pro 


Ala 


He 


65 










70 










75 










80 


Trp 


Lys 


Thr 


He 


Asp 


Ala 


Asp 


Gly 


Asp 


Gly 


Tyr 


Asn 


Trp 


Met 


His 


Leu 










85 










90 










95 




Thr 


Asn 


Phe 


Thr 


Gly 


Gin 


Ser 


Gly 


Leu 


Cys 


Val 


Ser 


Ser 


Ala 


Ser 


Tyr 








100 










105 










110 






He 


Gly 


Gly 


Val 


Gly 


Ala 


Leu 


Thr 


Pro 


Asp 


Asn 


Tyr 


Leu 


He 


Thr 


Pro 






115 










120 










125 








Glu 


Leu 


Lys 


Leu 


Pro 


Thr 


Asp 


Ala 


Leu 


Val 


Glu 


He 


He 


Tyr 


Trp 


Val 




130 










135 










140 










Cys 


Thr 


Gin 


Asp 


Leu 


Thr 


Ala 


Pro 


Ser 


Glu 


His 


Tyr 


Ala 


Val 


Tyr 


Ser 


145 










150 










155 










160 


Ser 


Ser 


Thr 


Gly 


Asn 


Asn 


Ala 


Ala 


Asp 


Pile 


Val 


Asn 


Leu 


Leu 


Tyr 


Glu 










165 










170 










175 




Glu 


Thr 


Leu 


Thr 


Ala 


Lys 


Arg 


He 


Gin 


Ser 


Pro 


Glu 


Leu 


He 


Arg 


Gly 








180 










185 










190 






Asn 


Arg 


Thr 


Gin 


Gly 


Val 


Trp 


Tyr 


Gin 


Arg 


Lys 


Val 


Val 


Leu 


Pro 


Asn 






195 










200 










205 








Asp 


Thr 


Lys 


Tyr 


Val 


Ala 


Phe 


Arg 


His 


Phe 


Asn 


Ser 


Thr 


Asp 


Asn 


Phe 




210 










215 










220 










Trp 


Leu 


Asn 


Leu 


Asp 


Glu 


Val 


Ser 


He 


Leu 


Tyr 


Thr 


Pro 


Leu 


Pro 


Arg 


225 










230 










2 35 










240 


Arg 


Ala 


Pro 


Cys 


Pro 


His 


Pro 


Gly 


Gly 


Tyr 


Thr 


Tyr 


Ser 


Val 


Phe 


Arg 










2^5 










250 










255 




Asp 


Gly 


Gin 


Lys 


He 


Ala 


Ser 


Gly 


Leu 


Ser 


Ala 


Leu 


Ala 


Tyr 


He 


Asp 








260 










265 










270 






Thr 


Asp 


Val 


Pro 


Tyr 


Gly 


Thr 


Gin 


Asp 


Tyr 


Cys 


Val 


Gin 


Val 


Asn 


Tyr 






275 










280 










285 








Leu 


Gin 


Gly 


A^p 


Ser 


Tyr 


Lys 


Val 


Cys 


Lys 


Asn 


He 


Val 


Val 


Ala 


Asn 




290 










2S5 










300 










Ser 


Ala 


Asn 


He 


Tyr 


Gly 


Ala 


Asp 


Lys 


Pro 


Phe 


Ala 


Leu 


Thr 


Val 


Val 


305 










310 










315 










320 


Gly 


Lys 


Thr 


He 


Val 


Ala 


Ser 


Ala 


Phe 


Lys 


Gly 


Glu 


He 


Thr 


Leu 


Tyr 










325 










330 










335 




Asp 


He 


Arg 


Gly 


Arg 


Leu 


He 


Ala 


Ser 


Gly 


"J 


Asp 


Thr 


Leu 


Arg 


Tyr 








340 










345 










350 






Lys 


Ala 


Glu 


Asn 


Gly 


Phe 


Tyr 


Leu 


He 


Lys 


He 


Gin 


Val 


Asn 


Gly 


Thr 






355 










360 










365 








Val 


Tyr 


Thr 


Glu 


Lys 


He 


Gin 


He 


Gin 


















370 










37 5 
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(2) INFORMATION FOR SEQ ID 140:307 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 84 9 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) llOLECLfLE TYPE: protein 

10 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

15 (i::) FEATURE: 

(A) NAI^IE/KEY: mi s c_f ea t ure 

( B) LOCATION 1 . . .84 9 



20 



35 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 07 



Phe Gly He Ser Pro Ser Met Lys Lys Ser Phe Leu Leu Ala He Val 
1 5 10 15 

Met Leu Phe Gly He Ala Met Gin Gly His Ser Ala Pro Val Thr Ly." 
2 0 2 5 30 

25 Glu Arg Ala Leu Ser Leu Ala Arc Leu Ala Leu Arg Gin Val Ser Leu 
35 4 0 .15 

Arg Met Gly Gin Thr Ala Val Ser Asp Ly^ He Ser He Asp Tvr Val 

SO 55 60 

Tyr Arg Gin Gly Asp Ala Glu Arg Gly He Thr Ser Gin Glu Glu Gly 
30 65 70 75 80 

Ser Pro Ala Tyr Phe Tyr Val Ala Asn Arg Gly Asn Asn Glu Gly Tyr 

85 90 95 

Ala Leu Val Ala Ala Asp Asp Arg He Pro Thr He Leu Ala Tyr Ser 

100 105 110 

Pro He Gly Arg Phe Asp Met Asp Ser Met Pro Asp Asn Leu Arg Met 

115 120 125 

Trp Leu Gin He Tyr Asp Gin Glu He Gly Leu He Leu Ser Gly Lys 

130 135 140 

Ala Gin Leu Asn Glu Glu He Leu Arg Thr Glu Gly Val Pro Ala Glu 
1^5 150 155 160 

Vai His Ala Leu Met Asp Asn Gly His Phe Ala Asn Asp Pro Met Arg 

165 170 175 

Trp Asn Gin Gly Tyr Pro Trp Asn Asn Lys Glu Pro Leu Leu Fro Asn 
.p. 180 185 190 

45 Gly Asn His Ala Tyr Thr Gly Cys Val Ala Thr Ala Ala Ala Gin He 
155 200 205 

Met Arg Tyr His Ser Trp Pro Leu Gin Gly Glu Gly Ser Phe Asp Tyr 

210 215 220 

His Ala Gly Ser Leu Val Gly Asn Trp Ser Gly Thr Phe Gly Glu Met 
230 235 240 

Tyr Asp Trp He Asn Met Pro Gly Asn Pro Asp Leu Asp Asn Leu Thr 

24S 250 255 

Gin Ser Gin Val Asp Ala Tyr Ala Thr Leu Met Arg Asp Val Ser Ala 

260 265 270 

Ser Val Ser Met Ser Phe Tyr Glu Asn Gly Ser Gly Thr Tyr Ser Val 

275 280 285 

Tyr Val Val Gly Ala Leu Arg Asn Asn Phe Ara Tyr Lys Arg Ser Leu 

290 295 " 300 

Gin Leu His Val Arg Ala Leu Tyr Thr Ser Gin Glu Trp His Asp Met 
60 305 310 315 320 

He Arg Gly Glu Leu Ala Ser Gly Arg Pro Val Tyr Tyr Ala Gly Asn 

325 330 335 

Asn Gin Ser He Gly His Ala Phe Val Cys Asp Gly Tyr Ala Ser Asp 
r^r- 340 345 350 

DO Gly Thr Phe His Phe Asn Trp Gly Trp Gly Gly Val Ser Asn Gly Ph- 
355 360 365 

Tyr Lys Leu Thr Leu Leu Ser Pro Thr Ser Leu Gly He Gly Gly Glu 

3"/0 375 380 

Gly He Gly Phe Thr He Tyr Gin Glu He He Thr Gly He Glu Pro 
385 390 396 400 

Ala Lys Thr Pro Ala Glu Ala Gly Thr Asp Ala Leu Pro He Leu Ala 

405 410 415 

Leu Lys Asp He Glu Ala Glu Tyr Lys Ser Glu Ser Gly Leu Asn Val 

^20 425 430 

Gly Tyr Ser He Tyr Asn Thr Gly Glu Glu Gin Ser Asn Leu Asp Leu 



75 
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435 440 4*15 

Gly Tyr Arg Leu A£:n Lys Ala Asp Gly Glu Val lie Glu Val Lys Thr 

450 455 460 

Ser Ser lie Asn lie Ser Tcp Tyr Gly Tyr Gly Glu His Pro Glu Ser 
5 465 470 475 480 

Phe Ser Leu Ala Pro A^n Gin Leu Ser Gin Gly lie Asn Thr He Thr 

4 85 4 90 4 95 

Leu Leu Tyr Arg Arg Thr Gly Thr Glu Gin Trp Glu Pro Val Arg His 
500 505 510 

10 Ala Gin Gly Gly Tyr Val Asn Ser He Lys Val Asn Thr Thr Asp Pro 
515 520 525 

Asn Asn Val Val Val Thr Val Asp Asn Asn Glu Gly Lys Leu Ser He 

530 535 540 

Val Pro Asn Ser Phe Val Ala Asp Leu Asn Ser Tyr Glu His Ser Thr 
15 545 550 555 560 

He Thr Val Gin Phe Asn Ser Asp Ser Pro Asp Glu He Arg Thr Pro 

565 570 575 

Val Ala Phe Ala Leu Ser Thr Gly Ala Thr Ala Asp Asp Val He Ser 
580 505 590 

20 Leu Gly Trp Val Met Ala Glu Val Pro Gly Gly Ser Ser Asn Tyr Pro 
595 600 605 

Val Val Trp Ser Lys Asp Val Leu Thr Leu Ser Glu Gly Asp Tyr Thr 

610 615 620 

Leu Trp Tyr Arq Phe Ser He Asn Asn Gin Lys Asp Glu Trp Lys Lys 
25 62 5 630 635 6)0 

He Gly Ser Val Ser Val Lys Thr Pro Thr Glu Tyr Thr His Pro Leu 

64 5 650 655 

Phe Glu Val Gly His Asn Gin Tiii: Ser Thr Tyr Thr Leu Asp Met Ala 
660 665 670 

30 His Asn Arg Val Leu Pro Asp Phe Thr Leu Lys Asn Leu Gly Leu Pro 
675 680 685 

Phe Asn Gly Glu Leu Val Val Val Phe Arg Gin Thr Gin Ser Ser Ser 

690 695 700 

Gly Ser Leu Trp Ala Ala Gin Glu Thr Val His He Lys Gin Gly Glu 
35 705 710 715 720 

Thr Phe Val Tyr Lys Pro Val Val Glu Gly Pro He Pro Asp Gly Ser 

725 730 735 

Tyr Arg Ala Thr Leu His Ala Phe Val Asn Gly Gin Gin Gin Leu Tyr 
740 745 750 

40 Leu L^-s Gly Lys Arg Asn Tyr Thr Val Lys He Val Asn Gly Thr Ala 
755 760 765 

Val Glu Ala He Glu Ser Ser Glu Glu He Arg Val Phe Pro Asn Pro 

770 775 780 

Ala Arg Asp Tyr Val Glu He Ser Ala Pro Cys He Pro Gin Glu Thr 
45 785 790 795 800 

Ser He He Leu Phe Asp Leu Ser Gly Lys He Val Met Lys Asn Ser 

805 810 815 

Leu Ser Ala Gly His Gly Arg Met Asp Val Ser Arg Leu Pro Asn Gly 
820 825 830 

50 Ala Tyr He Leu Lys Val Asp Gly Tyr Thr Thr Lys He Asn He Val 
835 840 845 

His 

55 (2) inPORIIATlOtl FOR SEQ ID 110:308 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



65 



(vi) ORIGINAL SOURCE: 

{A) ORGAtJISH: Porphyromonas gingiva lis 



(ix) FEATURE: 
70 (A) NAUE/hlEY: misc_feature 

(B) LOCATIOIJ 1 ... 295 

(Mi) SEQUENCE DESCRIPTION: SEQ ID HO: 308 

75 Thr Lys Lys Thr Leu Met Lys Lys Leu Phe Leu Ser Leu Thr Ser Leu 
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10 



15 



20 



25 



30 



35 



40 



45 



65 



70 



75 



1 








5 










10 










1 5 




Val 


Met 


Val 


Phe 


Ala 


Val 


Ala 


Ser 


Cys 


Asp 


He 


He 


Asp 


Lys 


Asp 


Gin 








20 










25 










30 




Thr 


Leu 


Leu 


Pro 


Ala 


Pro 


Thr 


Asn 


Val 


Thr 


Pro 


Asp 


Asn 


Pro 


Asp 


Asp 






35 










AO 










4 5 






Asn 


Pro 


Ser 


Glu 


He 


Asp 


He 


Thr 


Gin 


Thr 


His 


Thr 


Glu 


Lvs 


T\T 


Val 




50 










55 










60 






Leu 


Ala 


Glu 


Glu 


Phe 


Thr 


Gly 


Gin 


Ly^ 


Cys 


Leu 


Asn 


Cys 


Pro 


Lys 


Gly 












70 










75 










80 


His 


Arg 


Ly^ 


Leu 


Ala 


Ala 


Leu 


Lys 


Glu 


Gin 


Tyr 


Gly 


Lys 


Arg 


Leu 


Thr 










85 










90 








95 




Val 


Val 


Gly 


He 
100 


His 


Ala 


Gly 


Pro 


Gly 
105 


Ser 


Leu 


Val 


Pro 


Pro 
110 


Leu 


Fhe 


Arg 


Thr 


Glu 
115 


Ala 


Gly 


Asp 


Ala 


Tyr 
120 


Tyr 


Ser 


Lys 


Phe 


Ala 
125 


Asn 


Asn 


Thr 


Pro 


Leu 
130 


Pro 


Ala 


Leu 


Met 


Val 
135 


Ser 


Arg 


Lys 


Lys 


Phe 
140 


Gly 


Ser 


Ser 


T-/r 


Val 


Tyr 


Asp 


Lys 


Ser 


Tyr 


Lys 


Thr 


Trp 


Asp 


Val 


Pro 


He 


Ala 


Glu 


Gin 


1^5 










150 










155 










160 


Met 


Glu 


Gin 


Lys 


Ala 
165 


Lys 


He 


Asn 


He 


Phe 
17 0 


Ala 


Val 


Ala 


Glu 


Tyr 
17 5 


Thr 


Asp 


Thr 


Gin 


Lys 


He 


Lys 


Val 


Thr 


Val 


Lys 


Gly 


L' -s 


He 


Leu 


Glu 


J J. J 








180 










185 








1 90 






Asn 


Tlir 


Leu 

195 


Pro 


Lys 


Ser 


r-)et 


Val 
200 


Gin 


Val 


Tyr 


Leu 


Leu 

2 05 


Glu 


Asp 


Lys 


Leu 


He 
210 


Ala 


Pro 


Gin 


Val 


Asp 
215 


Gly 


Asn 


Thr 


Thr 


Val 

22 0 


Glu 


Asn 


Tyr 


Glu 


His 


A^n 


His 


Val 


Leu 


Arg 


Gly 


Ala 


Val 


Asn 


•31y 


He 


Trp 


Gly 


Glu 


Glu 


S 










230 










235 








24 0 


Phe 


Val 


Asn 


Leu 


Lys 
2^5 


Asp 


Tyr 


Leu 


Tyr 


Thr 
250 


Tyr 


Ala 


Val 


Glu 


Pro 
255 


Leu 


Ser 


Gly 


Met 


Ser 

260 


Phe 


Val 


Ala 


Glu 


Asn 
265 


Tyr 


Ser 


He 


Val 


Ala 

270 


Phe 


Val 


Tyr 


Asp 


Val 


Gin 


Thr 


Phe 


Glu 


Val 


Tyr 


Asp 


Val 


Val 


His 


Val 


Lys 


He 






275 










280 










285 






Asn 


Pro 
290 


Gin 


Ser 


Asp 


Gly 


Lys 
295 





















(2) IMFORMATIOIJ FOR SEQ ID MO: 309 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 230 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

50 (vi) ORIGINAL SOURCE: 

(A) ORGAUISM: Porphyromonas gingivalis 

(i::) FEATURE: 

(A) MWIE/KEY: misc_fsature 
55 {B) LOCATION 1...230 

(::i) SEQUENCE DESCRIPTION: SEQ ID NO: 309 

Thr Asn Lys Lys Glu Glu Thr Met Lys Lys Ser Ser Val Val Ala Ser 
OU 1 5 10 15 

Val Leu Ala Val Ala Leu Val Phe Ala Gly Cys Gly Leu Asn Asn M-r 

^0 25 30 

Ala Lys Gly Gly Leu He Gly Ala Gly Val Gly Gly Ala He Gly Ala 

35 40 45 

Gly Val Gly Asn Val Ala Gly Asn Thr Ala Val Gly Ala He Val Gl"- 

^C' 55 60 

Thr Ala Val Gly Gly Ala Ala Gly Ala Leu He Gly Lys Lys Met Asp 

"70 75 80 

Lys Gin Lys Lys Glu Leu Glu Ala Ala Val Pro Asp Ala Thr He Gin 

8 5 90 95 

Thr Val Asn Asp Gly Glu Ala He Leu Val Thr Phe Asp Ser Gly He 

100 105 110 

Leu Phe Ala Thr Asn Ser Ser Thr Leu Ser Pro Asn Ser Arc Thr Ala 

11- 120 125 

Leu Thr Lys Phe Ala Ala Asn Met Asn Lys Asn Pro Asp Thr Asp He 
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130 135 140 

Arg lie Val Gly His^ Thr Asp Asn Thr Gly Ser Asp Lys lie Asn Asp 
1^5 150 155 160 

Pro Leu Ser Giu Arg Arg Ala Ala Ser Val Tyr Ser Phe Leu Asn Ser 
5 165 no 175 

Gin Gly Val Ser Met Ser Arg Met Ala Ala Glu Gly Arg Gly Ser Hi^ 

180 185 190 

Glu Pro Val Ala Asp Asn Ser Thr Val Ala Gly Arg Ser Ala Asn Arg 
195 200 ' 205 

10 Arg Val Glu Val Tyr lie Leu Pro Asn Ala Lys Met lie Glu Gin Ala 
210 215 220 

Gin Gin Gly Thr Leu Lys 
225 230 

15 (2) INFORMATION FOR SEQ ID MO: 310 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3-3 2 amino acids 

(B) TYPE: amino acid 
20 ( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



25 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGAUISJl: Porphyromonas gingiva lis 



(i:-;) FEATURE: 
30 (A) NAl-lE/KEY: misc_feature 

(B) LOCATION 1 ... 342 



(.Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 310 



Asn 


Arg 


Asn 


Arg 


Asn 


Met 


Ser 


Lys 


Lys 


Ssr 


He 


Leu 


Leu 


Leu 


Cys 


Cys 


1 








5 










10 










15 




Ser 


Leu 


Cys 


Phe 


He 


Ser 


Ala 


Thr 


Lys 


Ala 


Val 


Thr 


Pro 


Vai 


Arg 


Asn 








20 










25 










30 






Val 


Arg 


Asn 


Ser 


Gin 


Val 


Asn 


Ser 


Lys 


Ala 


Lys 


Thr 


Giu 


Arg 


Thr 


Lys 






35 










40 










45 








Pro 


Ser 


Asp 


Ser 


Val 


Arg 


Tyr 


He 


Ser 


Asn 


Met 


He 


Ala 


Asp 


Arg 


Leu 




50 










55 










60 










Glu 


Phe 


Arg 


Asn 


Lys 


He 


Ser 


Ser 


Glu 


Lys 


Glu 


Val 


Arg 


Lys 


Ala 


Glu 


65 










70 










75 










80 


Tyr 


Glu 


Asn 


Arg 


Leu 


Ala 


Met 


Glu 


Ala 


Leu 


Asn 


Tyr 


Pro 


Ala 


He 


Asp 










85 










90 










95 




Leu 


Tyr 


Gly 


Glu 


Asp 


Ser 


Trp 


Ser 


Glu 


Tyr 


Va 1 


Asn 


Pro 


Phe 


Val 


Gly 








100 










105 










110 






Ala 


Gly 


Thr 


Asp 


Val 


Glu 


He 


Pro 


Asn 


Ser 


Tyr 


Asp 


He 


Asp 


Cys 


Ser 






115 










120 










125 








Ser 


Phe 


Val 


Met 


Pro 


Val 


Glu 


Asp 


Lys 


Gin 


Val 


Thr 


Ser 


Gin 


Phe 


Gly 




130 










135 










140 










Tyr 


Arg 


Arg 


Arg 


Phe 


Gly 


Arg 


Met 


His 


Tyr 


Gly 


He 


Asp 


Leu 


Ser 


Vai 


1*15 










150 










155 










160 


Asn 


Arg 


Gly 


Asp 


Thr 


He 


Arg 


Ala 


Ala 


Phe 


Asp 


Gly 


Lys 


Val 


Arg 


Val 










165 










170 










175 




Arg 


Ser 


Tyr 


Glu 


Ala 


Arg 


Gly 


Tyr 


Gly 


Tyr 


Tyr 


He 


Val 


Leu 


Arg 


His 








180 










185 










1 90 






Pro 


Asn 


Gly 


Leu 


Glu 


Thr 


Val 


Tyr 


Gly 


His 


Met 


Ser 


Arg 


Gin 


Leu 


val 






195 










200 










205 








Asp 


Glu 


Asn 


Gin 


He 


Val 


Arg 


Ala 


Gly 


Gin 


Pro 


He 


Gly 


Leu 


Gly 


Gly 




210 










215 










220 










Ser 


Thr 


Gly 


Arg 


Ser 


Thr 


Gly 


Pro 


His 


Leu 


His 


Phe 


Glu 


Thr 


Arg 


Phe 


225 










230 










235 










24 0 


Met 


Gly 


He 


Pro 


He 


Asn 


Pro 


Ser 


Thr 


He 


He 


Asp 


Phe 


Asp 


Asn 


Gly 










2^5 










250 










255 




Val 


Pro 


Leu 


Arg 


Asp 


He 


Tyr 


Thr 


Phe 


Lys 


Arg 


Gly 


Ser 


Asn 


Ser 


Arg 








260 










265 










270 






Tyr 


Ala 


Lys 


Ala 


Ser 


Lys 


Tlir 


Ser 


Ser 


Arg 


Tyr 


Ala 


Lys 


Lys 


Gly 


Lys 






275 










280 










285 








Lys 


Gly 


Arg 


Gin 


Ala 


Ser 


Ser 


Pro 


Met 


Thr 


Tyr 


Arg 


He 


Lys 


Lys 


Gly 




290 










295 










300 










Asp 


Thr 


Leu 


Glu 


Thr 


He 


Ala 


Lys 


Arg 


His 


Gly 


Thr 


Ser 


Val 


Gin 


Lys 


305 










310 










315 










320 


Leu 


Cys 


Ala 


Thr 


Asn 


Gly 


He 


Gly 


Lys 


Ser 


Lys 


He 


Leu 


Thr 


Pro 


Gly 
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325 330 335 

Lys Ala Leu Avg lie Lys 
340 

5 (2) IllFORIIATIOH FOR SEQ ID NO: 311 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 15 9 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY; linear 

(ii) tlOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 

(A) ORGANISri: Po rphyromonas gingivalis 



15 



(i>:) FEATURE: 
20 {A) NAI-IE/KEY: misc_feature 

(B) LOCATION 1 . . .159 



65 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311 



25 



30 



35 



40 



45 



{!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 95 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona s gingivalis 

60 (i::) FEATURE: 

(A) NAI-IE/KEY: mi^c_feature 

(B) LOCATION 1 . . . 395 



Pro 


Ser 


Lys 


Thr 


He 


He 


Lys 


Thr 


Met 


Ala 


L:,-s 


He 


Asn 


Phe 


Tyr 


Ala 


1 








5 










10 










15 




Giu 


•3iV 


Val 


Ser 


Leu 


Pro 


Arg 


He 


Arg 


Arg 


Arg 


He 


Val 


Gly 


Lys 


Trp 








2 0 










25 










3 0 






He 


Ala 


Glu 


Val 


Cys 


Ser 


Arg 


Tyr 


Gly 


Lys 


Ala 


Val 


Gly 


Glu 


He 


Ser 






35 










40 










45 








Tyr 


Leu 


Phe 


Cys 


Asp 


Asp 


Glu 


Tyr 


He 


Leu 


Lys 


Ala 


Asn 


Gin 


Glu 


Phe 




50 










55 










60 










Leu 


A^p 


His 


Asp 


Tyr 


Tyr 


Thr 


Asp 


He 


He 


Thr 


Phe 


Asp 


Ser 


Cys 


Giu 


65 










70 










75 










80 


Ala 


Asp 


Thr 


Val 


Asn 


Gly 


Asp 


Leu 


Leu 


He 


Ser 


Leu 


Asp 


Thr 


Val 


Arg 










85 










90 










95 




Ser 


Asn 


Ala 


Arg 


Ala 


Leu 


Asp 


Leu 


Arg 


Tyr 


Glu 


Asp 


Glu 


Leu 


His 


Arg 








100 










105 










110 






Val 


He 


He 


His 


Gly 


He 


Leu 


His 


Leu 


Cys 


Gly 


Leu 


Lys 


Asp 


Lys 


Ser 






115 










120 










125 








Lys 


Lys 


Asp 


Glu 


Ala 


Gin 


Met 


Arg 


Ala 


Ala 


Glu 


Glu 


Lys 


Ala 


Leu 


Val 




130 










135 










140 










Met 


Leu 


Arg 


Glu 


Thr 


He 


Gly 


Ser 


Glu 


Leu 


Ser 


Leu 


Leu 


His 


Thr 




14S 










150 










155 












(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 312 

















(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 



Lys Ser Lys Thr Met Lys Val Lys Tyr Leu Met Leu Thr Leu Val Gly 
15 10 15 

Ala He Ala Leu Asn Ala Ser Ala Gin Glu Asn Thr Val Pro Ala Thr 
20 25 30 

70 Gly Gin Leu Pro Ala Lys Asn Val Ala Phe Ala Arg Asn Lys Ala Gly 
35 40 45 

Ser Asn Trp Phe Val Thr Leu Gin Gly Gly Val Ala Ala Gin Phe Leu 

50 55 60 

Asn Asp Asn Asn Asn Lys Asp Leu Met Asp Arg Leu Gly Ala He Gly 
75 65 7 0 7 5 ftCi 
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Ser 


Leu 


Ser 


Va 1 


Gly 


Lys 


Tyr 


His 


Ser 


Pro 


Phe 


Phe 


Ala 


Thr 


Arg 


Leu 












85 










90 










95 






Gin 


lie 


Asm 


Gly 


Gly 


Gin 


Ala 


His 


Thr 


Phe 


Leu 


Gly 


Lys 


Asn 


Gly 


Glu 










100 










105 










110 






5 


Gin 


Glu 


He 


Asn 


Thr 


Asn 


Phe 


Gly 


Ala 


Ala 


His 


Phe 


Asp 


Phe 


Met 


Phe 








115 










120 










125 










A^p 


Val 


Val 


Asn 


Tyr 


Phe 


Ala 


Pro 


Tyr 


Arg 


Glu 


Asn 


Arg 


Phe 


Phe 


His 






130 










135 










140 










10 


Leu 


He 


Pro 


Trp 


Val 


Gly 


Val 


Gly 


Tyr 


Gin 


His 


Lys 


Phe 


He 


Gly 


Ser 


145 










150 










155 










160 




Glu 


Trp 


Ser 


Lys 


Asp 


Asn 


Val 


Glu 


Ser 


Leu 


Thr 


Ala 


Asn 


Val 


Gly 


Val 












165 










170 










175 






Met 


Met 


Ala 


Phe 


Arg 


Leu 


Gly 


Lys 


Arg 


Val 


Asp 


Phe 


Val 


He 


Glu 


Ala 










180 










185 










190 






15 


Gin 


Ala 


Ala 


His 


Ser 


Asn 


Leu 


Asn 


Leu 


Ser 


Arg 


Ala 


Tyr 


Asn 


Ala 


Lys 








195 










200 










205 










Lys 


Thr 


Fro 


Val 


Phe 


Glu 


Asp 


Pro 


Ala 


Gly 


Arg 


Tyr 


Tyr 


Asn 


Gly 


Phe 






210 










215 










220 










20 


Gin 


Gly 


I let 


Ala 


Thr 


Ala 


Gly 


Leu 


Asn 


Phe 


Arg 


Leu 


Gly 


Ala 


Val 


Gly 


225 










230 










235 










240 




Phe 


Asn 


Ala 


He 


Unk 


Pro 


Met 


Asp 


Tyr 


Ala 


Leu 


He 


Asn 


Asp 


Leu 


Asn 












24 5 










250 










255 






Gly 


Gin 


He 


Asn 


Arg 


Leu 


Arg 


Ser 


Glu 


Val 


Glu 


Glu 


Leu 


Ser 


Lys 


Arg 


25 








260 










265 










270 






Pro 


Val 


Ser 


Cys 


Pro 


Glu 


" j' s 


Pro 


Glu 


Val 


Thr 


Pro 


Val 


Thr 


Lys 


Thr 








275 










280 










285 










Gl u 


Asn 


He 


Leu 


Thr 


Glu 


Lys 


Ala 


Val 


Leu 


Phe 


Arg 


Phe 


Asp 


Ser 


His 






2 90 




















300 










30 


Val 


Val 


Asp 


Lys 


Asp 


Gin 


Leu 


He 


Asn 


Leu 


Tyr 


Asp 


Val 


Ala 


Gin 


Phe 


305 










310 










315 










320 




Val 


Lys 


GlTJ 


Thr 


Asn 


Glu 


Pro 


He 


Thr 


Val 


Val 


Gly 


Tyr 


Ala 


Asp 


Pro 












325 










330 










335 






Thr 


Gly 


Asn 


Thr 


Gin 


Tyr 


Asn 


Glu 


Lys 


Leu 


Ser 


Glu 


Arg 


Arg 


Ala 


Lys 


35 








340 










345 










350 






Ala 


Val 


Val 


Asp 


Val 


Leu 


Thr 


Gly 


Lys 


Tyr 


Gly 


Val 


Pro 


Ser 


Glu 


Leu 








355 










360 










365 










lie 


Ser 


Val 


Glu 


Trp 


Lys 


Gly 


Asp 


Ser 


Thr 


Gin 


Pro 


Phe 


Ser 


Lys 


Lys 






370 










37 5 










380 










40 


Ala 


Trp 


Asn 


Arg 


Val 


Val 


He 


Val 


Arg 


Ser 


Lys 












385 










390 










395 













(2) IIJ FORMATION FOR SEQ ID NO: 313 

( i ) SEQUENCE CHARACTERISTICS : 
45 (A) LEMGTH: 387 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



50 



55 



60 



65 



70 



75 



Hi) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

<A) NAflE/KEY: mi sc_f ea t ure 

(B) LOCATIOII 1 . . . 387 





C-ii) 


SEQUENCE DESCRIPTION: SEQ ID NO: 313 










Tyr 


Lys 


Met 


Thr 


Tyr 


Arg 


He 


Met 


Lys 


Ala 


Lys 


Ser 


Leu 


Leu 


Leu 


Ala 


1 


















10 










15 




Leu 


Ala 


Gly 


Leu 


Ala 




Thr 


Phe 


Ser 


Ala 


Thr 


Ala 


Gin 


Glu 


Ala 


Thr 








20 










25 










30 






Thr 


Gin 


Asii 


Lys 


Ala 


Gly 


Met 


His 


Thr 


Ala 


Phe 


Gin 


Arg 


Asp 


Lys 


Ala 






35 










40 










45 








Ser 


Asp 


His 


Trp 


Phe 


lie 


Asp 


He 


Ala 


Gly 


Gly 


Ala 


Gly 


Met 


Ala 


Leu 




50 










55 










60 










Ser 


Gly 


Trp 


Asn 


Asn 


Asp 


Val 


Asp 


Phe 


Val 


Asp 


Arg 


Leu 


Ser 


He 


Val 


65 










70 










75 










80 


Pro 


Thr 


Phe 


Gly 


He 


Gly 


Lys 


Trp 


His 


Glu 


Pro 


Tyr 


Phe 


Gly 


Thr 


Arg 










85 










90 










95 




Leu 


Gin 


Phe 


Thr 


Gly 


Phe 


Asp 


He 


Tyr 


Gly 


Phe 


Pro 


Gin 


Gly 


Ser 


Lys 








100 










105 










HO 
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Glu Arg Asn His Asn Tyr Phe Gly Asn Ala His Leu Asp Phe Met Phe 

lis 120 125 

Asp Leu Thr Asn Tyr Phe Gly Val Tyr Arg Pro Asn Arg Val Ph- His 
J. 130 135 1^0 

O He He Pro Trp Ala Gly He Gly Phe Gly Tyr Lys Phe His Ser Glu 
145 150 155 160 

Asn Ala Asn Gly Glu Lys Val Gly Ser Lys Asp Asp Met Thr Gly Thr 

165 170 175 

Val Asn Val Gly Leu Met Leu Lys Phe Arg Leu Ser Arg Val Val Asp 
10 180 185 190 

Phe Asn He Glu Gly Gin Ala Phe Ala Gly Lys Met Asn Phe He Gly 

195 200 205 

Thr Lys Arg Gly Lys Ala Asp Phe Pro Val Met Ala Thr Ala Gly L-u 
210 215 220 

15 Thr Phe Asn Leu Gly Lys Thr Glu Trp Thr Glu He Val Pro M-t Asp 
225 230 235 240 

Tyr Ala Leu Val Asn Asp Leu Asn Asn Gin He Asn Ser Leu Arg Gl-y 

245 250 255 

Gin Val Glu Glu Leu Ser Arg Arg Pro Val Ser Cys Pro Glu Cvs Pro 
^O 260 ' 265 270 

Glu Pro Thr Gin Pro Thr Val Thr Arg Val Val Val Asp Asn Val Val 

275 280 285 

Tyr Phe Arg He Asn Ser Ala Lys He Asp Arg Asn Gin Glu He Asn 
^- 290 295 300 

Val Tyr Asn Thr Ala Glu Tyr Ala Lys Thr Asn Asn Ala Pro II- L— 
305 310 315 

Val Val Gly Tyr Ala Asp Glu Lys Thr Gly Thr Ala Ala Tyr Asn Met 

325 330 335 

Lys Leu Ser Glu Arg Arg Ala Lys Ala Val Ala Lys Met Leu Glu L'-^s 

340 345 350 

Tyr Gly Val Ser Ala Asp Arg He Thr He Glu Trp Lys Gly Ser Ser 

355 360 365 

Glu Gin He Tyr Glu Glu Asn Ala Trp Asn Arg He Val Val M«t Thr 
„^ 370 375 ' 380 

OO Ala Ala Glu 
385 

(2) iriFORHATIOM FOR SEQ ID NO: 31 4 

40 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



45 



50 



55 



60 



65 



70 



75 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

( V i ) OR I G I N AL SOU RCE : 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAT-IE/ KEY: mis cofeature 

(B) LOCATION 1 . . .195 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314 



Val 


He 


Gly 


He 


He 


Met 


Glu 


Phe 


Phe 


Met 


Leu 


Phe 


He 


Ala 


Ala 


Val 


1 








5 










10 










15 




Phe 


Val 


Asn 


Asn 
20 


Val 


Val 


Leu 


Ser 


Gin 
25 


Phe 


Leu 


Gly 


He 


C' 'S 

30 


Pro 


Phe 


Leu 


Gly Val 


Ser 


Lys 


Lys 


Val 


Asp 


Thr 


Ser 


He 


Gly 


Met 


Gly Ala 


Ala 


Val 




35 










40 








45 








Thr 


Phe 


Val 


Leu 


Ala 


Leu 


Ala 


Thr 


Leu 


Val 


Thr 


Phe 


Leu 


He 


Gin 




50 










55 










60 








Lys 


Phe 


Val 


Leu 


Asp 


Arg 


Phe 


Gly 


Leu 


Gly 


Phe 


Met 


Gin 


Thr 


He 


Ala 


65 










70 








75 










8 0 


Phe 


He 


Leu 


Val 


He 
85 


Ala 


Ala 


Leu 


Val 


Gin 

90 


Met 


Val 


Glu 


He 


He 
95 


Leu 


LV3 


Lys 


Val 


Ser 


Pro 


Pro 


Leu 


Tyr 


Gin 


Ala 


Leu 


Gly 


Val 


Phe 


Leu 


Pro 








100 










105 








110 






Leu 


He 


Thr 


Thr 


Asn 


Cys 


Cys 


Val 


Leu 


Gly 


Val 


Ala 


He 


Leu 


Val 


He 


Gin 




115 










120 








125 






Lys 
130 


Asp 


Tyr 


Thr 


Leu 


Leu 
135 


Gin 


Ser 


Phe 


Val 


Tyr 
140 


Ala 


He 


Ser 


Thr 
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Ala He Gly Phe Thr Leu Ala Met Val Thr Phs Ala Gl y He Arg Glu 
145 150 155 160 

Gin Leu Asp Met Thr Aeu Leu Pro Lys Ala Met Lys Gly He Pro Ser 
1G5 170 175 

5 Ala Leu Leu Ala Ala Gly He Leu Ala Met Ala Phe Met Gly Phe Ser 
180 185 190 

Gly He Ala 
155 

10 (2) INFORMATION FOR SEQ ID NO: 315 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



20 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival i£ 



(ix) FEATURE: 
25 (A) NAME/ KEY: misc_f sature 

(B) LOCATION 1 ... 876 



(::i) SEQUENCE DESCRIPTION: SEQ ID NO: 315 



Tyr 


Arg 


Ser 


Tyr 


Arg 


Gly 


He 


Gly 


Ser 


Gly 


Thr 


His 


Ser 


Pro 


Asn 


Leu 


1 








5 










10 










15 




Lys 


Asn 


Arg 


Leu 


Lys 


Arg 


He 


Gly 


He 


Arg 


He 


Pro 


Asn 


Arg 


His 


Tyr 








20 










25 










30 






He 


His 


He 


Lys 


Pro 


He 


Lys 


Pro 


Lys 


Asn 


Lys 


Met 


Lys 


Gin 


Leu 


Asn 






35 










40 










45 








He 


He 


Ser 


Phe 


He 


He 


Ala 


Phe 


Leu 


Phe 


Leu 


Gly 


Thr 


Ser 


Ala 


Ser 




50 










55 










60 










Ala 


Gin 


Gin 


Ser 


Gly 


Gly 


Ser 


Val 


Thr 


Gly 


Thr 


Val 


Val 


Asp 


Lys 


Ser 


65 










70 










75 










80 


Ser 


Lys 


Glu 


Pro 


He 


Ala 


Tyr 


Val 


Gin 


Val 


Phe 


Val 


Lys 


Gly 


Thr 


Thr 










85 










90 










95 




Leu 


Gly 


Thr 


Ser 


Thr 


Asp 


Ala 


Asn 


Gly 


Asn 


Tyr 


Ser 


He 


Lys 


Gly 


He 








100 










105 










110 






Pro 


Ser 


Gly 


Asn 


Gin 


Thr 


He 


Val 


Ala 


Arg 


Leu 


Met 


Gly 


Tyr 


Ser 


Thr 






115 










120 










125 








Gys 


Glu 


Glu 


Lys 


Val 


His 


He 


Glu 


Lys 


Gly 


Gly 


Ser 


Arg 


His 


Val 


Asp 




130 










135 










140 










Leu 


Tyr 


Leu 


Thr 


Glu 


Glu 


He 


Leu 


Ser 


Leu 


Asp 


Gly 


Val 


Val 


Val 


Ser 


1^5 










150 










155 










160 


Ala 


Asn 


Arg 


Asn 


Glu 


Thr 


Phe 


Arg 


Arg 


Gin 


Ala 


Pro 


Ser 


Leu 


Val 


Thr 










165 










170 










175 




Va 1 


Leu 


Ser 


Pro 


Glu 


Leu 


Phe 


Leu 


Lys 


Thr 


Asn 


Ser 


Thr 


Asn 


Leu 


Ser 








180 










185 










190 






Gin 


Gly 


Leu 


Lys 


Phe 


Gin 


Pro 


Gly 


Leu 


Arg 


Val 


Glu 


Asp 


Asn 


Cys 


Gin 






195 










200 










205 








Asn 


Cys 


Gly 


Phe 


Asn 


Gin 


Val 


Arg 


He 


Asn 


Gly 


Leu 


Glu 


Gly 


Ala 


Tyr 




210 










215 










220 










Ser 


Gin 


He 


Leu 


He 


Asp 


Ser 


His 


Pro 


He 


Phe 


Ser 


Ser 


Leu 


Ala 


Gly 


225 










230 










235 










240 


Val 


Tyr 


Gly 


Leu 


Glu 


Gin 


Met 


Pro 


Ala 


Asn 


Met 


He 


Glu 


Arg 


Val 


Glu 










245 










2 50 










255 




Val 


He 


Arg 


Gly 


Gly 


Gly 


Ser 


Ala 


Leu 


Phe 


Gly 


Ser 


Asn 


Ala 


Val 


Gly 








260 










265 










270 






Gly 


Val 


He 


Asn 


Val 


He 


Thr 


Lys 


Glu 


Pro 


Leu 


Arg 


Asn 


Ser 


Ala 


Glu 






275 










280 










285 








He 


Ser 


His 


Ser 


Thr 


Met 


Thr 


Phe 


Asp 


His 


Ala 


Lys 


Gly 


Trp 


Gly 


Ser 




2 90 










295 










300 










Phe 


Gin 


Asn 


Thr 


Thr 


Gin 


Phe 


Asn 


Gly 


Ser 


Met 


Leu 


Thr 


Glu 


Asp 


Arg 


305 










310 










315 










320 


Ly^ 


Ala 


Gly 


Val 


Met 


Val 


Phe 


Gly 


Gin 


His 


Asn 


Tyr 


Arg 


Pro 


Gly 


Gin 










325 










330 










335 




Asp 


He 


Asp 


Gly 


Asp 


Asn 


Phe 


Thr 


Glu 


Leu 


Pro 


Asn 


Leu 


Arg 


Asn 


Arg 








340 










345 










350 






Ser 


Leu 


Gly 


Phe 


Arg 


Ser 


Tyr 


Tyr 


Lys 


Thr 


Gly 


Leu 


Tyr 


Ser 


Lys 


Ala 






355 










360 










365 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO_9929870A1_ 



wo 99/29870 PCT/AU98/01023 

236/ 490 



Thr 


Leu 


Gl u 


Tyr 


His 


Ser 


!let 


Gin 


Glu 


Tyr 


Arg 


Arg 


Gly 


Gly 


Asp 


Arg 




37 o 










375 










3b 0 










Leu 


Asp 


Asn 


Pro 


Pro 


Phe 


Glu 


Ala 


Gin 


He 


Al a 


Glu 


Tyr 


Leu 


Gin 


His 


385 










3 90 










395 








4 00 


Tyr 


He 


As n 


Gly 


'ji V 


Ser 


Phe 


Lys 


Phe 


Asp 


Gin 


Gly 


Phe 


Ser 


Gly 


Gly 










405 










410 










415 




hys 


Asp 


F'he 


Phe 


Ser 


Leu 


Tyr 


Ala 


Ser 


Ala 


Gin 


Asp 


Val 


Gin 


Arg 


Arg 








4 2 0 










'i25 










430 


Ser 


Tyr 


Tyr 


Gly 


Gly 


Gly 


Asp 


Tyr 


Thr 


Glu 


Asn 


Leu 


Leu 


Asn 


Gly 


Al a 






4 35 










4 40 










445 








Va 1 


Gin 


Ser 


Gly 


Ser 


Thr 


Glu 


Ser 


Asp 


Glu 


Tyr 


Asn 


Asp 


Ala 


Phe 


Thr 




-1 50 










455 










460 








Ala 


Leu 


Thr 


Ser 


Tyr 


Gly 


Thr 


Thr 


Lys 


Gly 


Phe 


Asp 


Leu 


Gin 


Gly 


Gly 


A 65 










470 










•175 






4 80 


Gly 


Met 


Tyr 


Arg 


His 


Thr 


Phe 


Gly 


Glu 


Asn 


Trp 


Asp 


Phe 


Thr 


Gly 


Gly 










4 85 










4 90 










4 95 




Leu 


Gl u 


Tyr 


He 


Tyr 


Gly 


Gin 


Leu 


Asp 


Asp 


Arg 


Ser 


Gly 


Tyr 


Arg 


Pro 








500 










505 










510 




Ser 


Lys 


lie 


Asp 


Gin 


Asn 


Thr 


Ser 


Thr 


Phe 


Ser 


Gin 


Tyr 


Asp 


Gin 


Leu 






515 










520 










525 








Gl u 


Tyr 


Lys 


Thr 


Gl u 


Lys 


Leu 


Ser 


Ala 


Leu 


He 


Gly 


Ala 


Arg 


He 


Asp 




530 










535 










540 








Tyr 


v'al 


Leu 


Leu 


Asn 


Gin 


Asp 


Gly 


Lys 


Arg 


Tyr 


He 


Asp 


Pro 


Leu 


Phe 


Oh ^ 










55 0 










555 








560 


He 


Phe 


Ser 


Pro 


Arg 


Ala 


Asn 


Val 


Arg 


Tyr 


Asn 


Pro 


Asn 


Lys 


Asn 


Leu 










565 










57 0 










575 




Ser 


Phe 


Arg 


Leu 


Ser 


Tyr 


Ser 


Glu 


Gly 


Phe 


Arg 


Ala 


Pro 


Gin 


Tyr 


Phe 








580 










585 








590 




A.-p 


Glu 


Asp 


Leu 


His 


Val 


Glu 


Leu 


Ala 


Gly 


Gly 


Thr 


Pro 


He 


Ser 


Arg 






595 










600 








605 






Val 


Leu 


Ser 


Pro 


Asn 


Leu 


Lys 


Glu 


Glu 


Arg 


Ser 


Arg 


Ser 


He 


Ser 


Ala 




610 










615 










620 










Ser 


Phe 


Asp 


Tyr 


Tyr 


His 


Arg 


Ala 


Asp 


Glu 


Trp 


Gin 


Phe 


Asn 


He 


Met 


625 










630 










635 










64 0 


Gly 


Gl u 


Ala 


Piie 


Ser 


Thr 


Phe 


He 


Ser 


Asn 


Gin 


Phe 


Lys 


Pro 


Ser 


Asp 










64 5 










650 








655 


Lys 


Val 


Glu 


Thr 


Thr 


Ser 


Asp 


Gly 


Lys 


Glu 


Trp 


He 


He 


Arg 


Thr 


He 








660 










665 










670 






Tyr 


Asn 


Asp 


Lys 


Asp 


Gly 


Val 


Ser 


Lys 


Val 


Tyr 


Gly 


Val 


Asn 


Leu 


Glu 






675 










680 






685 








Gly 


Arg 


He 


Ala 


Tyr 


Asn 


Lys 


Ser 


Phe 


Asp 


Leu 


Gin 


Leu 


Gly 


Gly 


Thr 




690 










695 










700 






Trp 


Gin 


Arg 


Ser 


Arg 


Tyr 


Gly 


Ser 


He 


Tyr 


Thr 


Ala 


Val 


Glu 


Ala 


Asp 


7 05 










710 










715 










720 


Lys 


Thr 


Thr 


Gly 


Gin 


Ala 


Glu 


He 


Ser 


Val 


Lys 


Asp 


Tyr 


Val 


Arg 


Thr 










725 










730 










735 




Pro 


Asn 


Leu 


Tyr 


Gly 


Tyr 


Phe 


Val 


Ala 


Thr 


Val 


Arg 


Pro 


Thr 


Glu 


His 








7 4 0 










745 








750 






Phe 


Ala 


He 


Asn 


Leu 


Ser 


Gly 


Thr 


Phe 


Thr 


Gly 


Lys 


Met 


Asp 


Val 


Val 






755 










760 










765 






His 


Gl u 


Ala 


Tyr 


Glu 


Gly 


Asp 


He 


Pro 


Ala 


Glu 


His 


He 


Ala 


Pro 


Asp 




77 0 










775 










780 










Gly 


Ser 


Phe 


Asp 


Phe 


Glu 


Met 


Asn 


Gly 


Gin 


Gin 


Phe 


Lys 


Gly 




Al a 


785 










790 










795 






800 


Glu 


Gly 


His 


Ala 


Lys 


Leu 


Val 


Lys 


Thr 


Pro 


Ala 


Phe 


Ala 


Asp 


He 


Asp 










805 










810 










815 




Leu 


Lys 


Leu 


Ser 


His 


Asp 


Phe 


His 


Leu 


Ala 


Ser 


Thr 


Met 


Thr 


Leu 


Glu 








820 










825 










830 






Leu 


Asn 


Ala 


Gly 


He 


Gin 


Asn 


He 


Phe 


Asn 


Ser 


Tyr 


Gin 


Lys 


Asp 


Thr 






835 










840 










845 




Asp 


Lys 


Gly 


Pro 


Gly 


Arg 


Ala 


Ser 


Thr 


Tyr 


Val 


Tyr 


Gly 


Pro 


Met 


Gin 




850 










855 








860 








Pro 


Arg 


Arg 


He 


Phe 


Val 


Gly 


Thr 


Lys 


He 


Asn 


Phe 










865 










870 








875 












(2) 


ItlFORMATIOIJ 


FOR 


SEQ 


ID 110:316 

















10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



(i) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 8 99 amino acids 
70 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

75 (iii) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 PCT/AU98/01023 

237/490 



10 



(vi ) ORIGIDAL SOURCE: 

( A ) ORG Al i 1 Sn : F c r ph y r oriion g i ng i v a 1 i 3 

(i;-:) FEATURt:: 

{A) IIAIlD/hXY : tr.i^c_f eature 
( B) LOCATIOtI 1 ... 899 

(Mi) SEQUEUCE DESCRIFTlOtl: SEQ ID IIO:316 



lie Leu A?n His Leu Arg Lys" Thr Met Tyr Lys Lys lie lie Ala Val 

1 5 10 15 

Ala Ala Leu Phe Cys Ala Ser lie Gly lie Leu hys Gly Gin Ser Ser 
20 25 30 

15 As-p Leu Thr Pro Gin A?p Thr lie Tyr Ser Pro Glu lie Ser Tyr Ala 
35 ^0 45 

hys I'ro I]e His Lys Thr lie Ala Ser lie Glu lie Glu Gly Met Arg 

50 55 60 

Ser Phe A.vp Asp Phe Val Leu Arg Asn Leu Ser Gly Leu Ala Val Gly 

20 65 70 75 eo 

Asp Glu Val Leu lie Pro Gly Asp Ala Met Ser Ala Ala Val Asn Arg 

85 90 95 

lie Met Arg Gin Gly Tyr Piie Ser Asn Val Arg lie lie Ala Asp Lys 
100 105 110 

25 Tyr Val Gly Asn Lys Val Tyr Leu Lys He He Val Thr Glu Arg Pro 
115 120 125 

Arg He Ser Lys Val Thr Phe Ser Gly Val Lys Lys Ser Glu Arg Glu 

130 135 140 

Asp Leu Glu Met Lys He Gly Leu Arg Glu Gly He Gin Met Thr Arg 
30 145 150 155 160 

Asn Asn Glu Asp Lys Val Arg Gin He Val Gin Lys Tyr Phe Ser Glu 

165 170 175 

Lys Gly Tyr Arg Asp Ala Ser He Arg He Thr Gin Glu Pro Asp Leu 
180 185 190 

35 Ser Lys Asp Gly Phe Val Asn Val Leu He Ser He Glu Lys Lys Ser 
195 200 205 

Lys Thr Lys Val Asn Glu He Tyr Phe Ser Gly Asn Lys Ala Leu Ser 

210 215 220 

Asn His Lys Leu Arg Met Ala Met Lys Asn Thr Asn Ala Lys Phe Ser 
40 225 230 235 240 

Leu Arg Lys His He Arg Ser Ser Phe Leu Lys Leu Phe Ser Thr His 

245 250 255 

Lys Phe Val Glu Glu Ser Tyr Arg Glu Asp Leu Val Arg Leu He Glu 
260 265 270 

45 Lys Tyr Gin Glu Tyr Gly Tyr Arg Asp Ala Glu He Leu Thr Asp Ser 
275 280 285 

Val Val Lys Ala Pro Asp Gly Lys Arg Val Asp He Tyr Leu Asn He 

290 295 300 

Glu Glu Gly Gin Lys Tyr Tyr He Lys Asp Val Asn Phe Val Gly Asn 
50 305 310 315 320 

Ser Gin Tyr Pro Ser Glu Tyr Leu Glu Arg Val Leu Gly He Lys Ser 

325 330 335 

Gly Asp Val Tyr Asn Gin Arg Arg Leu Ala Lys Arg Leu Asn Glu Asp 
340 345 350 

55 Glu Asp Ala Val Gly Asn Leu Tyr Tyr Asn Asn Gly Tyr He Phe Ala 
355 360 365 

Trp Val Asp Pro Val Glu Thr Asn Val Val Gly Asp Ser Val Ser Leu 

370 375 3B0 

Asp He Arg He Ala Glu Gly Lys Gin Ala Asn He Asn Lys Val He 
60 385 390 395 400 

He Lys Gly Asn Thr Val Val Tyr Glu Asp Va3 Val Arg Arg Glu Leu 

4 05 410 4 15 

Tyr Thr Lys Pro Gly Gin Leu Phe Ser Arg Glu Asp He He Asn Ser 
420 425 430 

65 He Arg Leu He Asn Gin Leu Gly His Phe Asp Ala Glu Lys Ser He 
4 35 4 40 4 45 

Pro Arg Pro He Pro Asn Pro Glu Thr Gly Thr Val Asp He Glu Tyr 

4 50 4 55 4 60 

Asp Leu Val Pro Arg Ser Ser Asp Gin Leu Glu Leu Ser Val Gly Trp 
70 465 470 475 480 

Ser Gin Ser Gly Leu Leu Phe Arg Gly Ala He Lys Phe Thr Asn Phe 

485 490 495 

Ser Val Gly Asn Leu Leu His Pro Ser Met Tyr Lys Lys Gly He He 
500 505 510 

75 Pro Gin Gly Asp Gly Gin Thr Leu Ser Leu Ser Ala Gin Thr Asn Gly 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDCXID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



238/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



70 







515 










52 0 










52 5 










Tyr 


Tyr 


Gin 


Gin 


Tyr 


Ser 


Val 


Thr 


PJie 


Met 


Asp 


Pre 


Trp 


Phe 


Gly 




530 










535 










5^0 








Gly 


Lyr 


Arg 


Pro 


Asp 


[•let 


Phe 


Ser 


Fhe 


Ser 


Ala 


Phe 


Tyr 


Ser 


Lys 


Thir 












550 










555 








560 


Thr 


Ala 


lie 


Asp 


Ser 


Lys 


Phe 


Tyr 


Asn 


Ser 


Asn 


Ala 


Gly 


Asn 


Tyr 


Tyr 










565 










57 0 








575 




M£-n 


Ai a 


Tyr 


Tyr 


Asn 


Ser 


Tyr 


Tyr 


Asn 


As n 


Tyr 


Asn 


Ser 


Tyr 


Tyr 


Asn 








580 










585 










590 




Gly 


Met 


Ser 


Asn 


Tyr 


Thr 


Gly 


Asp 


Leu 


Tyr 


Thr 


Gin 


Ala 


Ser 


Asp 


Pre 






5 95 










600 










605 








Asp 


Arg 


Ser 


Leu 


Gin 


Het 


Leu 


Gly 


Thr 


Ser 


He 


Gly 


Tyr 


Gly 


Lys 


Arg 




61 0 










615 










620 






Leu 


Thr 


Trp 


Pro 


Asp 


Asn 


Trp 


Phe 


Gin 


He 


Tyr 


Thr 


Ser 


Leu 


Asn 


Tyr 


625 










630 










635 










640 


Thr 


Tyr 


Tyr 


Arg 


Leu 


Arg 


Asn 


Trp 


Ser 


Tyr 


Asn 


Thr 


Phe 


Gin 


Asn 


Phe 










64 5 










650 










655 




His 


His 


Gly 


Ser 


Ala 


Asn 


Asp 


Leu 


Asn 


Leu 


Gl u 


Leu 


Arg 


Leu 


Ser 


Arg 








660 










665 










670 




Thr 


Ser 


lie 


Asp 


Asn 


Pro 


He 


Tyr 


Thr 


Arg 


Ser 


Gly 


Ser 


Asp 


Phe 


Met 






67 5 










680 










685 








Val 


Ser 


Val 


Ala 


Ala 


Thr 


Leu 


Pro 


Tyr 


Ser 


Leu 


Trp 


Asp 


Asn 


His 


Asp 




690 










695 










700 






Tyr 


Ala 


Ser 


Gin 


Asn 


Leu 


Ser 


Val 


Ser 


Asp 


Arg 


Tyr 


Arg 


Phe 


He 


Glu 


705 










710 










715 








720 


Tyr 


His 


Lys 


Trp 


Lys 


Phe 


Arg 


Gly 


Arg 


Val 


Phe 


Thr 


Pro 


Leu 


Leu 


Asn 










725 










730 










735 




Pro 


Ala 


Thr 


His 


Lys 


Tyr 


Thr 


Pro 


Val 


Leu 


Met 


Ser 


Arg 


Val 


Glu 


Gly 








740 










745 










750 




Ala 


Val 


Leu 


Gly 


Ser 


Tyr 


Asn 


Ser 


Asn 


Lys 


Lys 


Ser 


Pro 


Phe 


Gly 


Thr 






755 










760 










765 






Fhe 


Tyr 


Het 


Gly 


Gly 


Asp 


Gly 


Met 


Ser 


Ser 


Tyr 


Tyr 


Gly 


Gly 


Tyr 


Met 




770 










775 










780 






Asn 


Glu 


Thr 


He 


Gly 


Leu 


Arg 


Gly 


Tyr 


Lys 


Asn 


Gly 


Ser 


He 


Ala 


Gly 


7 85 










790 










795 








800 


Asn 


Asn 


Tyr 


Asp 


Tyr 


Ala 


Tyr 


Ala 


Tyr 


Met 


Arg 


Leu 


Thr 


Met 


Glu 


Leu 










805 










810 










815 




Arg 


Phe 


Pro 


He 


Leu 


Phe 


Glu 


Asn 


Ser 


Phe 


Asn 


Ala 


Trp 


Leu 


Leu 


Ala 








820 










825 








830 






Phe 


Ala 


Glu 


Ala 


Gly 


Asn 


Ala 


Trp 


Arg 


Ser 


He 


Asp 


Asn 


Tyr 


Asn 


Pro 






835 










840 










845 






Phe 


Asn 


Leu 


Lys 


Arg 


Ser 


Ala 


Gly 


Val 


Gly 


Leu 


Arg 


Val 


Thr 


Leu 


Pro 




850 










855 










860 










Met 


Val 


Gly 


Met 


Leu 


Gly 


He 


Asp 


Trp 


Gly 


Tyr 


Gly 


Phe 


Asp 


Arg 


Pro 


865 










870 










875 








880 


Asp 


Asn 


Ser 


Leu 


Gin 


Arg 


Gly 


Gly 


Ser 


Asn 


Val 


His 


Phe 


Val 


Leu 


Gly 










885 










890 










895 




Gl ri 


Glu 


Phe 





























(2) INFORj-lATION FOR SEQ ID HO: 317 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

60 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAhUSM: Porphyromonas gingivalif 

65 (ix) FEATURE: 

{A) NAI^E/KEY: misc_feature 

(B) LOCATION I ... 177 



(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 317 



Gin He Asn Met Asn Gly Asp Met Lys Arg Phe Leu He Leu He Gly 

15 10 15 

Phe Ala Leu Ala Val Ala Phe Ser Gly Phe Ser Gin Lys Phe Ala Leu 
71- 20 25 30 

/D Val Asp Met Glu Tyr He Leu Arg Asn He Pro Asp Tyr Glu Met Met 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_L> 




wo 99/29870 PCT/AU98/01023 

239/490 



10 



15 



20 



45 



50 



55 



60 



65 



70 



40 





Glu 


Gin 


Leu 


Gill 


Gin 


Val 


Ser 


Lys 


Lys 


Trp 


Gin 


As n 


Glu 


He 


Glu 




50 










55 










60 










Ala 


Leu 


Gl u 


Asn 


Glu 


Ala 


Gin 


Ser 


Met 


Tyr 


Lys 


Lys 


Tyr 


Gl n 


Ser 


Asp 


65 










7 0 










75 










CO 


Leu 


Va 1 


Phe 


Leu 


Ser 


Ala 


Ala 


Gin 


Lys 


Lys 


Thr 


• J J. n 


- 

(j1 u 


--lu 


Al a 


1 1 — 










Q C 










90 










95 




Val 


Lys 


L-/5 


Gl u 


Gi n 


Gin 


Ala 


Ser 


Glu 


Leu 


Lys 


Arg 


Lys 


Tyr 


Phe 


Gly 








100 










105 










110 






Pro 


Gl u 


Gly 


Glu 


Leu 


Tyr 


Lys 


Lys 


Arg 


Ser 


Asp 


Leu 


Met 


Lys 


Pro 


He 






115 










120 










125 








Gin 


Asp 


Giu 


He 


Trp 


Asn 


Ala 


He 


Lys 


Gl u 


He 


Ala 


Lys 


Arg 


Asn 


Asn 




130 










135 










140 










Tyr 


Gin 


Met 


Val 


Leu 


Asp 


Arg 


Gly 


Thr 


Ser 


Gly 


He 


He 


Phe 


Ala 


Ser 


145 










150 










155 










160 


Pro 


Ser 


He 


Asp 


He 


Ser 


Asp 


Leu 


Val 


Leu 


Ser 


Lys 


Met 


Gly 


Phe 


Ser 










165 










170 










175 




Lys 

































(2} IIIFORMATIon FOR SEO ID lJO:318 



(i) SEQUEIICE CHARACTERISTICS: 

(A) LEtJGTH: 17 0 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

30 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAIJISM: Po rphyromonas gingivalis 

35 (ix) FEATURE: 

(A) IJAIIE/KEY: misc_feature 
{ B) LOCATION 1 ... 170 



40 





(xi ) 


SEQUENCE DESCRIPTION: SEQ ID !10:31G 


1 










Arg 


He 


Asn 


Lys 


Gin 


His 


Glu 


Met 


Lys 


Lys 


Phe 


Phe 


Leu 


Met 


Leu 


Leu 


1 






5 










10 










15 




Met 


Ala 


Leu 


Pro 


Leu 


Ser 


Leu 


Leu 


Ala 


Gin 


Lys 


Val 


Ala 


Val 


Val 


Asn 








20 










25 










30 






Thr 


Glu 


Glu 


He 


He 


Ser 


Lys 


Met 


Pro 


Glu 


Gin 


Val 


Ala 


Ala 


Thr 


Lys 






35 










40 










45 








Gin 


Leu 


Asn 


Glu 


Leu 


Ala 


Glu 


Lys 


Tyr 


Arg 


Leu 


Asp 


Leu 


Lys 


Ser 


Met 




50 










55 










60 










Asp 


Asp 


Glu 


Phe 


Ala 


Lys 


Lys 


Thr 


Glu 


Glu 


Phe 


Val 


Lys 


Glu 


Lys 


Asp 


65 










70 










75 










SO 


Ser 


Leu 


Leu 


Glu 


Asn 


He 


Arg 


Asn 


Arg 


Arg 


Gin 


Gin 


Glu 


Leu 


Gin 


Asp 










85 










90 










95 




He 


Gin 


Thr 


Arg 


Tyr 


Gin 


Gin 


Ser 


Tyr 


Gin 


Thr 


Met 


Gin 


Glu 


Asp 


Leu 








100 










105 










110 






Gin 


Lys 


Arg 


Gin 


Gin 


Gin 


Leu 


Phe 


Ala 


Pro 


He 


Gin 


Gin 


Lys 


Val 


Ala 






115 










120 










125 








Asp 


Ala 


He 


Lys 


Lys 


Val 


Gly 


Asp 


Glu 


Glu 


Asn 


Cys 


Ala 


Tyr 


He 


Met 




130 










135 










140 










Glu 


Ala 


Gly 


Met 


Met 


Leu 


Tyr 


Thr 


Gly 


Ala 


Thr 


Ala 


He 


Asp 


Leu 


Thr 


145 








150 










155 










160 


Ala 




Val 


hys 


Ala 


Lys 


Leu 


Gly 


lie 


Lys 






















165 










170 















(2) INFORHATIOIJ FOR SEQ ID NO: 319 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
75 (vi) ORIGINAL SOURCE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



8NSDOCID; <WO 9929870A1_I_> 




15 



20 



25 



30 



WO 99/29870 PCT/AU98/01023 

240/490 

(A) ORGAIIISIl: Porphyromonar ginaivaiis- 

(ix) FEATURE: 

(A) liAIfE/KEY: mi^c_feature 
5 (B) LOCATION 1 ... 828 

SEQUEUCE DESCRI PTIOtI : SEQ ID !IO:319 

lie liet L-ys Olu Ala He Pro Arg Ly^ A-n Ly:? Tyr He Lys Leu Asn 
10 1 5 ic i5 

Gly He Tyu Arg Leu Ser Phe He Leu Leu Cys Cys Leu Leu Cys Ser 

20 25 30 

Gin Ala Ala Met Ala Gin Gly V5.I Arg Val Ser Gly Tyr Val Leu Asp 

35 40 4b 

Arg Gly Glu Lys Pro He Pro Phe Ala Gly Val Lys Val Arg Gly Thr 

50 55 60 

Gly Thr Gly Ala Thr Thr Asn Leu Lys Gly Tyr Tyr Glu Phe Arg Met 

70 75 80 

Lys Ala Thr Thr A£-p Ser He Thr He Glu Phe Ser Ser Met Gly Tyr 

85 90 95 

Gin Gly Val Ser Arg Ser Phe Pro Ser Leu Thr Lys Asp Thr Arg Leu 

100 105 110 

Asn Val Arg Leu Ala Glu Ala Glu Ilet Glu Leu Ser Ser Val Thr Val 

115 120 125 

Gin Ala Thr Lys Arg Arg Leu Asn Thr Met Glu Arg Val Asn Thr Arg 

130 135 
Asp Leu Arg Val Asn Ala Gly Pro Thr Gly Gly Val Glu Ser L'-u He 
1-^5 150 155 160 

Ser Thr Tyr Ala Gly Val Thr Gin Asn Asn Glu Leu Ser Ser Gin Tyr 

1^5 170 175 

Ser Val Arg Gly Gly Ser Tyr Asp Glu Asn Met Val Tyr Val Asn Gly 

180 185 190 

Val Glu Val Tyr Arg Pro Leu Leu Val Arg Ser Ala Gin Gin Glu Gly 
ot- 200 205 

OD Leu Ser Phe Val Asn Pro Asp Leu Thr Gin Ser Val Gin Phe Ser Ala 
210 215 220 

Gly Gly Phe Thr Ala Asp Tyr Gly Asp Lys I let Ser Ser Val Leu Asp 
225 230 235 240 

He Arg Tyr Lys Gin Pro Gin Glu Lys Glu Gly Ala Val Leu Leu Gly 

245 250 255 

Met Leu Gin Ser Ser Ala Tyr Tyr Gly Ser Ser Ala Gly Ala Phe Ser 

260 265 270 

Gin He Thr Gly Val Arg Tyr Lys Ser Ala Lys Ser Leu Leu Gly Thr 

275 280 285 

Thr Asp Thr Lys Ala Glu Tyr Asp Pro He Tyr Ala Asp Gly Gin Thr 

2?0 295 300 

Phe Met Thr Tyr Arg Phe Ser Pro Lys Leu Ser Val Ser Phe Leu Gly 
^'='5 310 315 320 

Asn He Ser Gin Thr Arg Tyr Lys Phe Val Pro Gin Thr Arg Glu Thr 

325 330 335 

Sor Phe Gly Thr Leu Ser Asp Ala Lys Lys Leu Lys He Phe Phe Asp 

340 345 
Gly Gin Glu Gin Asp Arg Phe Leu Thr Tyr Phe Gly Ala Phe Ser Met 

355 360 365 

Asn Phe Val Pro Asp Asp Lys Gin Arg His Thr Val Thr Leu Ser Ala 

370 375 380 

Phe Asn Ser Asn Glu Arg Glu Thr Tyr Asp He Gin Gly Glu Tyr Ph° 
385 390 395 400 

Leu Asn Asp Val Gin Leu Gly Ala Asp Gly Thr Ala Ser Met Ala Ser 

405 410 415 

Gly Ser Glu Asn Ser Asn Gly Leu Gly He Gly Arg Asn His Glu His 

420 425 430 

Ala Arg Asn Arg Leu Ser Tyr Arg Val Leu Asn Met Gly Tyr Arg Gly 

435 440 445 

Glu Met Lys Leu Asn Glu Lys His Arg Leu Gin Ala Gly Val Ser Ala 

450 455 
Gin Met Glu Lys He Ala Asp His He Ser Glu Trp Glu Arg Arg Asp 
470 475 480 

Ser Val Gly Tyr Asn Leu Pro His Ser Glu Thr Val Leu Leu Met Tyr 

485 490 495 

Asn Asn Leu Tyr Ala Asp Thr Gin Met Arg Gly Thr Arg Leu Ser Ala 

500 505 510 

Phe Val Gin Asp Arg Phe Asn Phe Ser Met Gly Gly Gly Thr Phe Ser 

515 520 525 

Leu He Pro Gly He Arg Ala Ser Trp Trp Ser Phe Asn Lys Glu Leu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



40 



45 



50 



55 



60 



65 



70 



75 



BNSDOCID: <WO 9929870A1_I > 




wo 99/29870 PCT/AU98/01023 

241/490 



40 



45 



65 



70 



75 



530 535 54 0 





Leu 


Vai 


Ser 


Pro 


Arg 


lie 


Ser 


Val 


Gly 


Tyr 


Ser 


Pro 


Glu 


Ser 


As J") 


Pro 




5-1 5 










550 










555 










560 




Ala 


Leu 


Vaa 


Leu 


Arg 


Ala 


Ala 


Al a 


Gly 


Leu 


Tyr 


Tyr 


Gin 


Ala 


F ro 


Phe 


D 










565 










57 0 










57 5 






Tyr 


Lyc 


Gl u 


Leu 
58 0 


Arg 


Gin 


Thr 


His 


Lys 
585 


Asp 


Al a 


Gl u 


Gly 


Asn 
590 


Asn 


Va 1 






Val 


Leu 
595 


As n 


G 1 LI 


Ly- 


He 


Arg 
600 


Ser 


Gin 


--ly 


Al a 


Fhe 

605 


His 


He 


Leu 


lU 


Al a 


Gl y 
610 


Ala 


A-p 


Tyr 


Thr 


Fhe 
61 5 


Glu 


I let 


Gly 


Giy 


Arg 
620 


L'j-s 


Tyr 


Lys 


Phe 




Thr 


Al a 


Gl u 


Al a 


Tyr 


Tyr 


Ly^ 


Ser 


Leu 


Fhe 


Asn 


He 


Asn 


Pro 


Tyr 


He 




625 










630 










635 










6^0 


15 


lie 


Giu 


Asn 


Val 


Lys 


He 


Arg 


Tyr 


Leu 


Gly 


Glu 


Asn 


He 


Gly 


Ser 


Gly 










5 










650 










655 






T-T 


Ala 


Ala 


Gly 
660 


lie 


Asp 


Leu 


Lys 


Leu 
665 


Phe 


Gly 


Glu 


Leu 


Val 
670 


Pro 


Gly 




Val 


A^p 


Ser 


Trp 


Leu 


Thr 


Ala 


Ser 


He 


He 


Lys 


Ala 


Arg 


Gin 


Lys 


Leu 


on 






67 5 










680 










685 








A-p 


Gly 
690 


Tyr 


Gly 


Ser 


Leu 


Pro 
695 


Leu 


Met 


Asn 


Ala 


Pro 
700 


Thr 


Tyr 


Asn 


Phe 




Ser 


Phe 


Phe 


Leu 


Gin 


Glu 


Tyr 


Val 


Pro 


Gly 


Asn 


Lys 


Arg 


He 


Thr 


Ala 




7 05 










710 










715 










720 




Thr 


Leu 


Arg 


Ala 


Ala 


Leu 


Ser 


Gly 


Gly 


Leu 


Pro 


Gin 


Leu 


Asn 


Pro 


Ser 










725 










7 30 










735 






Ly^ 


Gly 


Leu 


Ser 
74 0 


Ser 


Pro 


Ala 


Phe 


Thr 
7^15 


Ala 


Pro 


Ala 


Tyr 


Lys 
750 


Arg 


Val 




Asp 


Leu 


Gly 


Val 


Met 


Tyr 


Ly~ 


Trp 


Leu 


Asp 


Pro 


Asp 


Asp 


Ser 


Phe 


Ala 


30 






755 










760 










765 








Giy 


Arg 
770 


Ser 


Lys 


Trp 


Leu 


Met 
775 


Gly 


Val 


Lys 


Gly 


Ala 
780 


Tyr 


lie 


Gly 


Ala 




A-p 


Leu 


Phe 


Asn 


Leu 


Phe 


Asp 


Met 


Thr 


Asn 


Val 


Asn 


Ser 


Tyr 


Tyr 


Trp 




785 










790 










7 95 










800 


35 


Val 


Ser 


Asp 


Ala 


Tyr 


Gin 


Gin 


Gin 


Tyr 


Ala 


Val 


Pro 


Asn 


Tyr 


Leu 


Thr 










805 










810 










815 






Arg 


Arg 


Gin 


Phe 
820 


Asn 


Leu 


Arg 


Leu 


Leu 
825 


Val 


Glu 


Phe 











(2) IIIFORMATIOII FOR SEQ ID l^!O:320 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 67 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Hii) HYPOTHETICAL: YES 



50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 
55 (B) LOCATION 1...67 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320 



60 



Pro 


He 


Pro 


Phe 


Leu 


Leu 


Asn 


Met 


Tyr 


Ser 


Gly 


His 


His 


Lys 


He 


His 


1 








5 










10 










15 




Tyr 


Pro 


Phe 


Leu 


He 


Leu 


Leu 


Val 


Cys 


Leu 


Ala 


Phe 


Ala 


Ala 


cys 


Lys 








20 










25 










30 






Ser 


Val 


Lys 


Leu 


Lys 


Asp 


Ala 


Glu 


Lys 


Ala 


His 


Asp 


Arg 


Gin 


Glu 


Tyr 






35 










■30 










45 








Thr 


Lys 


Ala 


Ala 


Asp 


Met 


Tyr 


Asn 


Thr 


Leu 


Tyr 


Arg 


Arg 


Thr 


Arg 


Arg 




50 










55 










60 










Lys 


Gin 


Val 


Glu 


Met 


Lys 


Ala 


Tyr 


Thr 


Ala 


Phe 


Arg 


Ser 


Gly 


Glu 


Asn 


65 










70 










75 










80 


Tyr 


Arg 


Ala 


Ala 


Gly 


Arg 


Gin 


Ala 


Lys 


Ala 


Leu 


Arg 


Gly 


Tyr 


Leu 


Asn 










85 










90 










95 




Ala 


Arg 


Arg 


Tyr 


Gly 


Tyr 


Pro 


Asp 


Ser 


Val 


Val 


Leu 


Leu 


Arg 


Leu 


Ala 








100 










105 










110 






Gin 


Thr 


Tyr 


Gin 


Gin 


Gly Gly 


Asn 


Tyr 


Lys 


Glu 


Ala 


Glu 


Val 


Leu 


Phe 






115 










120 










125 








Arg 


Gly Tyr 


Leu 


Glu 


Ala 


Tyr 


Pro 


Lys 


Ser 


Tyr 


Phe 


Ala 


Ala 


He 


Gly 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 
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242/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 





1 30 










135 










140 










Leu 


Glu 


Giy 




Leu 


Plie 


Al a 


Arg 


Gin 


Gin 


Lys 


Glu 


Tyr 


Pro 


Thr 


Arg 












1 50 










155 










160 


Tyr 


Arg 


lie 


Arg 


Arg 


Ala 


Ai a 


Glu 


Trp 


Asn 


Ser 


Ala 


Arg 


Gly 


Asp 


Phe 










165 










17 0 










175 




Gly 


Fro 


Al a 


Tyr 


Al a 


Pro 


Asp 


Ala 


Ser 


Ala 


Leu 


Tyr 


Phe 


Thr 


Ser 


Ser 








I 8 










185 










190 






Arg 


Ser 


Lys 


Asp 


Asp 


Gly 


Leu 


Asp 


Asn 


Ser 


Ser 


He 


Thr 


Gly 


Leu 


Lys 






1 95 










200 










205 




Fro 


Asn 


Asp 


He 


Tyr 


He 


He 


Lys 


Arg 


Asp 


Ala 


Gin 


Gly 


Arg 


Trp 


Gly 




2 1 0 










215 










22 0 








Arg 


Pro 


Asp 


Ser 


Val 


Ser 


Gly 


Gly 


He 


Asn 


Thr 


Pro 


Trp 


Asp 


Glu 


Gly 


225 










230 










235 






240 


Vai 


Pro 


Thr 


He 


Thr 


Pro 


Asp 


Gl y 


Ser 


Thr 


He 


Tyr 


Tyr 


Thr 


Leu 


Ala 










245 










250 










255 




Gl n 


Gin 


Gly 


Ala 


Asp 


Tyr 


Asp 


Arg 


Thr 


Val 


Gin 


He 


Tyr 


Ser 


Ala 


Ala 








260 










265 








270 






Arg 


Ser 


Gly 


Glu 


Gly 


Gly 


Trp 


Ser 


Asn 


Gly 


Ser 


Leu 


Val 


Asp 


He 


Met 






275 










280 










285 








Arg 


Asp 


Ser 


Leu 


Arg 


Met 


Ala 


Ala 


His 


Pro 


Ser 


I let 


Ser 


Ala 


Ser 


Gly 




290 










295 










300 








Asp 


Tyr 


Leu 


Tyr 


Phe 


Val 


Ser 


Asn 


He 


Gly 


Gly 


Ser 


Tyr 


Gly 


Gly 


Lys 


305 










310 










315 






320 


Asp 


lie 


Tyr 


Arg 


Val 


Lys 


Val 


Ser 


Asp 


Arg 


Ser 


Tyr 


Gly 


Ser 


Pro 


Glu 










325 










330 








335 




Asii 


Leu 


Gly 


Pro 


Asp 


He 


As n 


Thr 


Pro 


Gly 


Asp 


Glu 


Met 


Phe 


Pro 


Phe 








34 0 










345 










350 






lie 


Asp 


Gly 


Asp 


Ser 


Thr 


Leu 


Phe 


Phe 


Ala 


Ser 


Asp 


Gly 


His 


Ala 


Gly 






355 










360 










365 






Leu 


Gly 


Gly 


Leu 


Asp 


He 


Phe 


Lys 


Ala 


Thr 


Leu 


Asp 


Ser 


Thr 


Gly 


Gin 




37 0 










375 










380 








Trp 


His 


Val 


Val 


Asn 


Met 


Gly 


Gin 


Pro 


Val 


Asn 


Ser 


Ser 


Ala 


Asp 


Asp 


3 8 5 










390 










395 








400 


Phe 


Gly 


Leu 


Ala 


Val 


Glu 


Pro 


Lys 


Gly 


Lys 


Asn 


Lys 


Glu 


Glu 


Ala 


Leu 










405 










410 








415 




Pro 


Asp 


Asn 


Gly 


Val 


Lys 


Gly 


Val 


Phe 


Cys 


Ser 


Asn 


Arg 


Gly 


Asp 


Ala 








420 










425 










430 




Arg 


Gl y 


Trp 


Pro 


His 


Leu 


Phe 


His 


Phe 


Glu 


Leu 


Pro 


Ala 


He 


Tyr 


Thr 






435 










440 










445 






Gl u 


lie 


Gin 


Gly 


Tyr 


Val 


Met 


Asp 


Arg 


Glu 


Glu 


Asn 


Pro 


He 


Ala 


Gly 




4 50 










455 










460 










Al a 


Thr 


Val 


Arg 


He 


Val 


Gly 


Glu 


Arg 


Gly 


Pro 


Val 


Gly 


Gin 


Gly 


Phe 


4 65 










470 










475 






480 


Val 


Thr 


Thr 


Arg 


Asp 


Asp 


Gly 


Ser 


Tyr 


Lys 


Met 


Ser 


Val 


Gin 


Gly 


Asp 










485 










490 










4 95 




Thr 


Arg 


Tyr 


Val 


Met 


Leu 


Ala 


Gly 


Ala 


Ser 


Gly 


Tyr 


Leu 


Asn 


Gin 


Tyr 








500 










505 










510 




Val 


Glu 


Leu 


Lys 


Thr 


Asp 


Thr 


Ala 


Lys 


Gin 


Ser 


Glu 


Thr 


Tyr 


Tyr 


Val 






51 5 










520 










525 




Asp 


Phe 


Phe 


Leu 


Ala 


Ser 


Arg 


Glu 


Lys 


Ala 


Glu 


Gly 


Leu 


Gin 


Asn 


He 




530 










535 










540 










Phe 


Tyr 


Asp 


Phe 


Asp 


Lys 


Ala 


Thr 


Leu 


Arg 


Pro 


Glu 


Ser 


Met 


Lys 


Ser 


54 5 










550 










555 








560 


Leu 


Asp 


Glu 


Leu 


He 


Arg 


He 


Leu 


Thr 


Asp 


Asn 


Pro 


Asp 


He 


Arg 


He 










565 










570 








575 




Glu 


Leu 


Gly 


Ser 


His 


Ala 


Asp 


Arg 


Lys 


Gly 


Pro 


Asp 


Ala 


Tyr 


Asn 


Leu 








580 










585 










590 






Gl V 


Leu 


Se r 




Arg 


Arg 


Ml a 


T \ re 


Ser 


Val 


Val 


Asp 


Tyr 


Leu 


Thr 


Ser 






595 










600 










605 








Arg 


Gly 


He 


Ala 


Ala 


Asp 


Arg 


Leu 


Thr 


Trp 


Lys 


Gly 


Tyr 


Gly 


Lys 


Ser 




610 










615 










620 




Val 


Pro 


Lys 


Thr 


Val 


Thr 


Ala 


Lys 


He 


Ala 


Glu 


Arg 


His 


Asp 


Phe 


Leu 


625 










630 










635 








640 


Lys 


Glu 


Gl y 


Asp 


Val 


Leu 


Thr 


Glu 


Glu 


Phe 


Val 


Ala 


Pro 


Leu 


Thr 


Glu 










645 










650 










655 




Glu 


Gin 


Gln 


Ser 


Val 


Cys 


Asp 


Gin 


Leu 


Asn 


Arg 


Arg 


Thr 


Glu 


Phe 


Arg 








660 










665 










670 




Val 


lie 


Glu 


Glu 


Glu 


Leu 


Arg 
























€75 


























(2) 


III FORI lATIOM 


FOR 


SEQ 


ID NO: 321 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 772 amino acids 

(B) TYPE: amino acid 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



243/490 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIJAL SOURCE: 

(A) ORGAUISII: Pcrphyromona s gingival is 

{ix) FEATURE: 

(A) tJAJIE/KEY: mi £c_f eatur e 
{ B) LOCATIOII 1 . . .77 2 



(Mi) SEQUEMCE DESCRI PTI Oi I : SEQ ID riO:321 



Pro 


Ala 


Gin 


Pro 


Lys 


Thr 


Tyr 


Cys 


He 


Arg 


T-yr 


Phe 


Arg 


Arg 


Glu 


Val 


1 








5 










10 










15 




Ser 


Pro 


Arg 


Arg 


Gin 


Thr 


Glu 


Arg 


Thr 


Leu 


Tyr 


Ala 


Asp 


Lys 


Met 


Arg 








20^ 










25 










30 






Arg 


His 


I]e 


Val 


Lys 


Cys 


Cys 


Pro 


Ser 


Ala 


Ser 


He 


Leu 


Leu 


Gin 


Glu 






35 










40 










45 








Tyr 


Ser 


Thr 


Thr 


Glu 


Leu 


Phe 


He 


His 


His 


Phe 


Asn 


He 


Ser 


He 


He 




50 










55 










GO 










net 


Lys 


Lys 


Phe 


Phe 


Phe 


Ala 


Leu 


Leu 


Ser 


He 


Gly 


He 


Ser 


Ala 


Gin 


65 










70 










75 










80 


Al a 


Phe 


Ala 


Lys 


Thr 


Asp 


Asn 


Val 


Pro 


Thr 


Asp 


Ser 


Leu 


Arg 


Val 


His 










85 










90 










95 




A^n 


Leu 


Gin 


Thr 


Val 


Thr 


Val 


Tyr 


Ser 


Thr 


Arg 


Thr 


Ala 


Val 


Pro 


Leu 








100 










105 










110 






Lys 


Lys 


He 


Pro 


Ala 


Lys 


Met 


Glu 


Leu 


He 


Ser 


Ser 


Ara 


Asn 


He 


Lys 






115 










120 










125 








Gin 


Ser 


Gly 


Phe 


Asn 


Asn 


Met 


Thr 


Asp 


He 


Leu 


Lys 


Thr 


Gin 


Ser 


Ser 




130 










135 










140 










Leu 


A^p 


Val 


He 


Gin 


Tyr 


Pro 


Gly 


Phe 


Ser 


Ser 


Asn 


He 


Gly 


He 


Arg 


145 










150 










155 










160 


Gly 


Phe 


Lys 


Pro 


Ser 


Gly 


Lys 


Tyr 


Val 


Thr 


Val 


Leu 


Val 


Asn 


Gly 


He 










165 










170 










175 




Pro 


Ala 


Gly 


Thr 


Asp 


Asn 


He 


Ser 


Thr 


Leu 


Asn 


Thr 


Ser 


Asn 


He 


Glu 








180 










185 










190 






Gin 


He 


Glu 


He 


Leu 


Lys 


Gly 


Pro 


Phe 


Ser 


Ser 


He 


Tyr 


Gly 


Thr 


Asn 






195 










200 










205 








Ala 


Met 


Gly 


Gly 


Val 


Val 


Asn 


He 


He 


Thr 


His 


Lys 


Ser 


Lys 


Asp 


Lys 




210 










215 










220 










He 


His 


Gly 


Asn 


Val 


Ser 


Leu 


Phe 


Gly 


Gly 


Ser 


Tyr 


Gin 


Thr 


Met 


Ala 


225 










230 










235 










240 


Gly 


Ser 


Phe 


Asn 


Leu 


Gly 


Gly 


Arg 


Phe 


Glu 


Asp 


He 


Phe 


Ser 


Phe 


Asp 










245 










250 










255 




Leu 


Ser 


Leu 


Gly 


Leu 


Asp 


Lys 


Gin 


As n 


Lys 


Asp 


Tyr 


Lys 


Thr 


Gly 


Ser 








260 










265 










270 






Asn 


Asn 


Phe 


Leu 


Ser 


Leu 


Ser 


Lys 


Leu 


Glu 


Glu 


Ala 


He 


Val 


Asp 


Val 






275 










280 










285 








Afp 


Ala 


Thr 


Lys 


Asn 


Lys 


Lys 


Met 


Lys 


Gly 


Ser 


Asp 


Tyr 


Thr 


Val 


Ala 




290 










295 










300 










Thr 


Gly 


Arg 


Leu 


Arg 


Phe 


Gly 


He 


Asp 


Phe 


Thr 


Pro 


Glu 


Trp 


Ser 


Leu 


305 










310 










315 










320 


Asn 


Leu 


Tyr 


Gin 


Asn 


Val 


Phe 


Leu 


Gly Asp 


Ala 


He 


Pro 


Val 


Gly 


Gly 










325 










330 










335 




Ser 


He 


Trp 


Gly 


Val 


Tyr 


Gly 


Glu 


Ser 


Lys 


Lys 


Asn 


Leu 


Asn 


Arg 


Ser 








340 










345 










350 






Ser 


Thr 


Ser 


Phe 


Glu 


Leu 


Leu 


Gl y 


Lys 


His 


Gly 


Cys 


His 


Thr 


Leu 


Gin 






355 










360 










365 








Phe 


Ser 


Pro 


Tyr 


Phe 


Asn 


He 


Glu 


Lys 


Ser 


Glu 


Asn 


Tyr 


Asn 


Asn 


Ala 




370 










375 










380 










Asp 


Pro 


Thr 


Gly 


Phe 


He 


Asn 


Tyr 


Lys 


Ser 


Asp 


Tyr 


Tyr 


Thr 


Tyr 


Gly 


385 










390 










395 










4 00 


Ala 


Leu 


Leu 


Gin 


Asp 


Lys 


He 


Ser 


Phe 


Gly 


Gly 


Gin 


Asn 


He 


Val 


Leu 










405 










410 










415 




Gly 


Val 


Asp 


Ser 


Arg 


Asn 


Met 


Thr 


Met 


Glu 


Ser 


Glu 


Arg 


Phe 


Glu 


Gin 








420 










425 










430 






Ala 


Gly 


Val 


Asn 


Thr 


Lys 


Pro 


Tyr 


Asn 


Pro 


Gly 


Tyr 


Ala 


Thr 


Asn 


Asn 






-135 










4 40 










445 








He 


Gly 


Leu 


Phe 


Gly 


Gin 


Ala 


Asn 


Phe 


Tyr 


Leu 


Leu 


Asn 


Asp 


Ala 


Leu 




450 










455 










4 60 










Ser 


He 


Ser 


Ala 


Gly 


Ala 


Arg 


Ala 


Asp 


Phe 


Met 


Phe 


Phe 


Asp 


Leu 


Lys 


465 










470 










475 










480 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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Ala 


Asn 


Glu 


Tyr 


Leu 
^85 


As n 


Asn 


Glu 


Ala 


Lys 
4 90 


Gin 


Glu 


Thr 


His 


Asn 
4 95 


Val 


lie 


Asn 


Pro 


Asn 
500 


Val 


Gly 


He 


Lys 


Tyr 
505 


Glu 


Phe 


Val 


Lys 


Gly 
510 


Leu 


Thr 


Ala 


His 


Gly 
515 


Thr 


Phe 


Gly 


Ser 


Ala 
520 


Phe 


Ser 


Ala 


Pro 


Asp 
525 


Ala 


Phe 


Gin 


hys 


Al a 


Gly 


Gin 


Tyr 


Val 


Gly 


Pro 


Phe 


•^iy 


Thr 


Thr 


He 


Gly 


Asn 


Pro 




530 










535 










54 0 








A-p 


Leu 


Lys 


Pro 


Gi u 


Lys 


Ser 


Met 


Thr 


Trp 


Asp 


Phe 


Gly 


He 


Gly 


Tyr 












550 










555 








560 


Ser 


Asn 


Ala 


Arg 


565 


Gly 


He 


Gin 


Ala 


Asp 
57 0 


Val 


Thr 


Leu 


Thr 


Tyr 
575 


Phe 


Hi- 


Thr 


Asp 


His 
580 


Lys 


Asp 


Leu 


He 


Leu 
585 


Ser 


Ser 


Pro 


Asp 


Tyr 
590 


Ala 


Asn 


Ac n 


lie 


Thr 


Thr 


Tyr 


He 


Asn 


Ala 


Asp 


Lys 


Ala 


Arg 


Met 


Ser 


Gly 


He 






5 95 










600 










605 






Gl u 


Ala 


Leu 


Leu 


Ser 


Tyr 


Asp 


Phe 


Gly 


Ser 


Leu 


Phe 


Ala 


Asn 


Lys 


Phe 




610 










615 










620 








Ser 


Leu 


Arg 


Ala 


Phe 


Ala 


Asn 


Ala 


Thr 


He 


Met 


Leu 


Asn 


Ser 


Glu 


Met 


625 










630 










635 










64 0 




Lys 


Ser 


Gin 


Thr 
6'15 


Asp 


Ala 


Pro 


Trp 


Ser 
650 


Glu 


Met 


Tyr 


Tyr 


Val 
655 


Arg 


Lys 


Gin 


Asn 


He 
660 


Thr 


Phe 


Gly 


He 


Glu 
665 


Tyr 


Arg 


Gly 


Lys 


Glu 
670 


Gly 


Leu 


Glu 


Va 1 


Met 
67 5 


Leu 


Asn 


Gly 


Arg 


Phe 
680 


Met 


Gly 


Arg 


Arg 


He 
685 


Glu 


Gin 


Asn 


Trp 


Tyr 
690 


Ala 


Tyr 


Tyr 


Pro 


Glu 
6S5 


Val 


Arg 


Pro 


Glu 


Leu 
700 


Gin 


Gin 


Leu 


Leu 


Ala 


Ala 


Glu 


Glu 


Pro 


Glu 


Leu 


Ala 


Ala 


Gin 


Gly 


Leu 


Leu 


Arg 


His 


Pro 


7 05 










710 










715 










720 


Gin 


Ala 


Met 


Val 


Phe 


Asn 


Ala 


Ser 


Ala 


Tyr 


Tyr 


His 


Met 


Asn 


Lys 


Tyr 










725 










730 










735 


Leu 


Thr 


Phe 


Gly 


Val 


Asn 


Leu 


Asn 


Asn 


He 


Leu 


Asp 


Glu 


Leu 


Tyr 


Thr 








740 










745 










750 




Glu 


Lys 


Asp 


Gly 


Tyr 


His 


Met 


Pro 


Gly Arg 


Asn 


He 


Met 


Gly 


Lys 


Val 






755 










760 










765 








Met 


Val 
770 


Asn 


Phe 



























(2) II I FORMAT I ON TOR SEQ ID HO: 322 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 484 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIKAL SOURCE: 

(A) ORGAIJISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) riAf-lE/KEY: misc_feature 

(B) LOCATION 1 ... 484 



(xi ) SEQUENCE DESCRIPTION: SEQ ID HO:322 



Gly 


Arg 


Ala 


Ser 


Ser 


Pro 


Tyr 


Arg 


Gin 


Met 


Asp 


Gly 


He 


Leu 


Asn 


Asp 


1 








5 










10 








15 


Glu 


Tyr 


Arg 


Gin 


Ala 


Ser 


Met 


Asn 


Arg 


Phe 


Ser 


Asn 


His 


Trp 


Pro 


Cys 








20 










25 










30 




He 


Leu 


Val 


Gly 


Phe 


Val 


Leu 


Trp 


Phe 


Val 


Ser 


Ala 


Ser 


Arg 


Thr 


Val 






35 










40 










45 






Ala 


Gl n 


Asn 


Ala 


Ser 


Glu 


Thr 


Thr 


Val 


Ser 


Tyr 


Asp 


Thr 


Asp 


Thr 


Ala 




50 










55 








60 








Val 


Leu 


Ser 


Glu 


Ala 


Asp 


Vol 


Leu 


Arg 


He 


Ala 


Leu 


Ser 


Glu 


Asn 


Ala 


65 










70 








75 










80 


Thr 


Val 


Lys 


Val 


Ala 
85 


Asp 


Met 


Asp 


Val 


Arg 
90 


Lys 


Gin 


Glu 


Tyr 


Ala 
95 


Arg 


Arg 


Ala 


Ala 


Arg 


Ala 


Asp 


Leu 


Phe 


Pro 


Lys 


Val 


Asp 


Leu 


Asn 


Gly 


Val 








100 










105 










110 




Tyr 


Ser 


His 


Thr 


Leu 


Lys 


Lys 


Gin 


Val 


Leu 


Tyr 


He 


Asp 


Met 


Pro 


Gly 






115 










120 








125 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 

75 



The Ser Ser Ser Giu Giy lie Glu Met Gly Arg Thr His Asn Thr Gin 
130 135 140 



Gly 


Gly 


Vh 1 


Asn 


Val 


Ser 


I let 


Pre 


Leu 


Val 


Ser 


Ala 


Gin 


Leu 


Trp 


Lys 


1-15 










150 










155 










160 


Ser 


He 


Ala 


Met 


Tlir 


Giy 


Glu 


Gl II 


Leu 


Asp 


Leu 


Ala 


Leu 


Glu 


Lys 


Ala 










165 










170 










175 




Arg 


Ser 


Ser 


Arg 


He 


Asp 


Leu 


Val 


Al a 


Glu 


Va 1 


Lys 


Lys 


Ala 


Tyr 


Leu 








180 










185 










190 






Ser 


Val 


Leu 


Leu 


Ala 


Glu 


Asp 


Ser 






Va 1 


Phe 


Lys 


Arg 


Ser 


Tyr 






195 










200 










205 








Asp 


Am 


Ala 


Leu 


Ala 


Asn 


Tyr 


Lys 


Asn 


lie 


Ser 


Asp 


Lys 


Phe 


Asp 


Arg 




210 










215 










220 










Gly 


Leu 


Val 


Ala 


Glu 


Tyr 


Asp 


Lys 


He 


Arg 


Ala 


Asn 


Val 


Gin 


Val 


Arg 


225 










230 










235 










24 0 


Asii 


He 


Glu 


Pro 


Asn 


Leu 


Leu 


Gin 


Ala 


Gin 


Asn 


Ser 


Val 


Ala 


Leu 


Ala 










24 5 










250 










255 




Leu 


Trp 


Gin 


Leu 


Lys 


Val 


Leu 


net 


Ser 


Met 


Glu 


Val 


Glu 


Thr 


Pro 


He 








260 










265 










270 






Arg 


Leu 


Ser 


Gly 


Ser 


Leu 


Ser 


Asp 


Tyr 


Lys 


Glu 


Gin 


Val 


Tyr 


Thr 


Gly 






275 










280 










285 








Tyr 


Phe 


Ala 


Ala 


Asp 


Thr 


Leu 


He 


Ser 


As n 


Asn 


Ser 


Ser 


Leu 


Arg 


Gin 




2 90 










295 










300 










Leu 


Asp 


He 


Gin 


Arg 


Arg 


Leu 


Ala 


Val 


Ser 


Ala 


Asp 


Lys 


Leu 


Asn 


Lys 


305 










310 










315 










32 0 


Tyr 


Ser 


Phe 


Leu 


Pro 


Thr 


Leu 


Asn 


Leu 


Gly 


Gly 


Gin 


Tyr 


Thr 


Tyr 


Ser 










325 










330 










335 




Leu 


Asn 


Ser 


Asn 


Asp 


He 


Lys 


Phe 


Trp 


Gly 


Gl u 


Gly 


Gin 


Arg 


Trp 


Thr 








340 










345 










350 






Pro 


Phe 


Ser 


Thr 


He 


Ser 


Leu 


Ser 


Leu 


Tyr 


He 


Pro 


He 


Phe 


Asn 


Gly 






355 










360 










365 








Gly 


Lys 


Arg 


Leu 


Tyr 


Asn 


Val 


Lys 


Gin 


Ser 


Ala 


Leu 


Ser 


He 


Arg 


Gin 




370 










375 










380 










lie 


Asp 


Leu 


Gin 


Arg 


Arg 


His 


He 


Glu 


Gin 


Ser 


He 


Arg 


Met 


Gly 


He 


385 










390 










395 










400 


Lys 


Asn 


Gin 


Asn 


Asp 


Arg 


Leu 


Arg 


Thr 


Cys 


Met 


Gin 


Arg 


Phe 


Val 


Ala 










405 










410 










415 




Ser 


Glu 


Glu 


Ala 


Val 


Arg 


Ser 


Ala 


Glu 


Lys 


Gly 


Tyr 


Gin 


He 


Ala 


Glu 








420 










425 










430 






Lys 


Arg 


Tyr 


Gin 


Thr 


Gly 


Glu 


Gly 


Thr 


Leu 


Val 


Glu 


Leu 


Asn 


Asp 


Ala 






4 35 










440 










445 








Asp 


Val 


Ala 


Leu 


Leu 


Gin 


Ala 


Arg 


Leu 


Asn 


Tyr 


Asn 


Gin 


Ala 


He 


Phe 




450 










455 










460 










Asp 


Phe 


Met 


Thr 


Ala 


Lys 


Ala 


Glu 


Leu 


Asp 


Lys 


Met 


Asn 


Gly 


Met 


Giy 


46b 










470 










475 










480 


lie 


Pro 


Glu 


Gin 



























(2) IIJFORMATION FOR SEQ ID HO: 323 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEI4GTH: 540 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingivaiis 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1 ... 540 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323 

Phe His Asn Phe Asp Phe Leu Asn Gly He Lys Leu Phe Ser Met Ala 

15 10 15 

Asn Asn Thr Leu Leu Ala Lys Thr Arg Arg Tyr Val Cys Leu Val Gly 

20 25 30 

Phe Cys Trp Leu Met Ala Met Met His Leu Ser Gly Gin Glu Val Thr 

35 4 0 4 5 

Met Trp Gly Asp Ser His Gly Val Ala Pro Asn Gin Val Arg Arg Thr 
50 55 60 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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Leu Val Ly^ Val Ala Leu Ser Glu Ser Leu Pro Fro Gly Ala Lys Gin 

70 75 80 

He Arg He Giy Phe Ser Leu Pro Lys Glu Thr Glu Glu Lys Val Thr 

85 90 95 

Ala Leu Tyr Leu Leu Val Ser Asp Ser Leu Ala Val Arg Asp Leu Pro 

100 105 110 

Asp Tyr Lys Gly Arg Val Ser Tyr Asp Ser Phe Pro He Ser Lys Glu 

lis 120 12 5 

Asp Arg Thr Thr Ala Leu Ser Ala Asp Ser Val Ala Gly Arg Arg Phe 

130 135 i.jr, 

Fhe Tyr Leu Ala Ala Asp He Gly Pro Val Ala Ser Phe Ser Arg Ser 
1*^^ 150 155 160 

Asp Thr Leu Thr Ala Arg Val Glu Glu Val Ala Val Asp Gly Arg Pro 

165 170 175 

Leu Pro Leu Lys Glu Leu Ser Pro Ala Ser Arg Arg Leu Tyr Arg Gly 

19 0 185 190 

Tyr Glu Ala Leu Phe Val Pro Gly Asp Gly Gly Ser Arg Asn Tyr Arg 

195 200 205 

He Pro Ala He Leu Lys Thr Ala Asn Gly Thr Leu He Ala Met Ala 

210 215 220 

Asp Arg Arg Lys Tyr Asn Gin Thr Asp Leu Pro Glu Asp He Asp II- 
225 230 235 240 

Val Met Arg Arg Ser Thr Asp Gly Gly Lys Ser Trp Ser Asp Pro Arg 

245 250 255 

He He Val Gin Gly Glu Gly Arg Asn His Gly Phe Gly Asp Val Ala 

260 265 270 

Leu Val Gin Thr Gin Ala Gly Lys Leu Leu Met He Phe Val Gly Gl*- 

275 280 285 

Val Gly Leu Trp Gin Ser Thr Pro Asp Arg Pro Gin Arg Thr Tv^r H*- 

290 295 300 

Ser Glu Ser Arg Asp Glu Gly Leu Thr Trp Ser Pro Pro Arg Asp 11^ 
310 315 320 

Tlir His Phe He Phe Gly Lys Asp Cys Ala Asp Pro Gly Arg Ser Arg 

325 330 335 

Trp Leu Ala Ser Phe Cys Ala Ser Gly Gin Gly Leu Val Leu Pro Ser 

340 34S 350 

Gly Arg He Thr Phe Val Ala Ala He Arg Glu Ser Gly Gin Glu Tyr 

355 360 365 

Val Leu Asn Asn Tyr Val Leu Tyr Ser Asp Asp Glu Gly Asp Thr Trp 

3'?0 375 380 

Gin Leu Ser Asp Cys Ala Tyr Arg Arg Gly Asp Glu Ala Lys Leu Ser 
385 390 

Leu Met Pro Asp Gly Arg Val Leu Met Ser He Arg Asn Gin Gly Arg 

-505 410 ^15 

Gin Glu Ser Arg Gin Arg Phe Phe Ala Leu Ser Ser Asp Asp Gly Leu 

420 425 430 

Thr Trp Glu Arg Ala Lys Gin Phe Glu Gly He His Asp Pro Gly Cys 

^^35 440 
Asn Gly Ala Met Leu Gin Val Lys Arg Asn Gly Arg Asp Gin Val Leu 

^SO 455 4£;q 

His Ser Leu Pro Leu Gly Pro Asp Gly Arg Arg Asp Giy Ala Val Tyr 
465 470 475 480 

Leu Phe Asp His Val Ser Gly Arg Trp Ser Ala Pro Val Val Val Asn 

485 490 495 

Ser Gly Ser Ser Ala Tyr Ser Asp Met Thr Leu Leu Ala Asp Gly Thr 

500 505 510 

He Gly Tyr Phe Val Glu Glu Gly Asp Glu He Ser Leu Val Ph« He 

515 520 525 

Arg Phe Val Leu Asp Asp Leu Phe Asp Val Arg Gin 
530 535 540 

(2) INFORMATION FOR SEQ ID NO: 32 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A) ORGAfUSM: Porphyromonas gingivalis 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



15 



20 



25 



30 



35 



40 



45 



65 



(i:-. ) FEATURE: 

(A) MAIIE/KEY: mi sc_f eature 
( B) LOCATIOIl 1 . . .293 

(.-:i) SEQL'EIICE DESCRIPTION: SEQ ID 110:32') 





Ser 


Pro 


Ser 


Aep 


Ser 


Ala 


Leu 


Pro 


Ser 


Val 


Trp 


Ser 


Al a 


Asn 


Thr 


i 


















1 0 










1 5 




He 


Ser 


Gly 


Leu 


Val 


Gly 


Gl y 


Lys 


Arg 


He 


Thr 


Leu 


Leu 


He 


Leu 


Tyr 








2 0 










2 5 










30 






Ser 


hlet 


Ala 


He 


Arg 


Ser 


Ser 


Asn 


lie 


Glu 


Ser 


He 


Gin 


Cys 


Phe 


Val 






35 










40 










45 








Met 


Lys 


Lys 


Glu 


Lys 


Leu 


Trp 


He 


Ala 


He 


Val 


Ala 


Gly 


Leu 


Ala 


Phe 




50 










55 










60 










Val 


Leu 


G3y 


Leu 


Tyr 


Ala 


Leu 


Gly 


Arg 


Ser 


Val 


Ala 


Gin 


Leu 


Arg 


Arg 


65 










7 0 










75 










80 


Ser 


Gl n 


Pro 


Ser 


Val 


Thr 


Val 


Thr 


Gly 


Met 


Ala 


Glu 


Arg 


Asn 


Phe 


Lys 










85 










90 










95 




Ser 


Asp 


Leu 


He 


Val 


Trp 


Thr 


Ala 


Ser 


Tyr 


Gin 


Leu 


Gin 


Met 


Met 


Asp 








1 DO 










105 










110 






Leu 


Glu 


Ser 


Ala 


Tyr 


Lys 


Ala 


Leu 


Lys 


Glu 


Lys 


Gin 


He 


Leu 


Val 


Ala 






115 










120 










125 








Asp 


Tyr 


Leu 


Lys 


Asn 


Lys 


Gin 


Leu 


Pro 


Asp 


Ser 


Ser 


Tyr 


He 


Phe 


Ser 




130 










135 










140 










Ser 


Val 


Ala 


He 


Ser 


Lys 


Glu 


Tyr 


Asn 


Tyr 


Tyr 


Tyr 


Asp 


Pro 


Arg 


Gin 


146 










150 










156 










160 


Glu 


Gin 


Asm 


Val 


Arg 


Thr 


Phe 


Ala 


Gly 


Tyr 


Leu 


Leu 


Ser 


Gl n 


Thr 


Val 










166 










170 










175 




Thr 


Val 


Thr 


Ser 


Gin 


Asp 


He 


Glu 


His 


Val 


Glu 


Lys 


He 


Ser 


Arg 


Asp 








180 










185 










190 






11- 


Thr 


Glu 


Leu 


He 


Asn 


Gin 


Gly 


Val 


Glu 


He 


Thr 


Ser 


Asp 


Arg 


Pro 






195 










200 










205 








Ala 


Tyr 


Tyr 


Tyr 


Thr 


Lys 


Leu 


Asn 


Asp 


Leu 


Lys 


Val 


Glu 


Met 


Leu 


Arg 




210 










215 










220 










Asn 


Ala 


Ser 


Glu 


Asp 


Ala 


Phe 


Asn 


Arg 


Ala 


Ser 


Val 


He 


Ala 


Glu 


Gly 


225 










230 










235 










240 


Ser 


Gly 


Ser 


Ser 


Val 


Gly 


Lys 


Met 


Leu 


Ser 


Ser 


Ser 


net 


Gly 


Val 


Phe 










245 










250 










255 




Gin 


He 


Val 


Gly 


Leu 


Asn 


Ser 


Asn 


Glu 


Asp 


Tyr 


Ser 


Trp 


Gly 


Gly 


Ser 








260 










265 










270 






Phe 


Asn 


Thr 


Ser 


Ser 


Lys 


Met 


Lys 


Thr 


Ala 


Ser 


He 


Thr 


Val 


Lys 


Ala 






275 










280 










285 








Ser 


Phe 


Ala 


Leu 


Lys 


























2 90 































(2) HJFORMATIOn FOR SEQ ID MO: 325 



(i) SEQUEHCE CHARACTERISTICS: 

(A) LEtlGTH: 280 amino acids 
50 (B) TYPE: amino acid 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

55 (iii) HYPOTHETICAL: YES 

ivi) ORIGINAL SOURCE: 

(A) ORGAI^IISH: Porphyromonas gingivalis 

60 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 ... 280 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325 



Gly Lys Leu Gin Met Lys Lys Thr He Ala He He Ala Ser Ala Leu 
15 10 15 

Leu Ala Leu Gly Ala Val Gly Cys Lys Lys Asn Ala Asp Thr Thr Ala 
20 25 30 

70 Val Ser Glu Lys Asp Ser He Ala Leu Ser Met Gly He Leu Tyr Gly 
35 40 45 

Gin Asp Phe Ala Asn Gin Phe Glu Met Ser Arg Leu Gin Gly Gin Pro 

50 55 60 

He Asp Ser Val Ala Phe Leu Asp Gly Phe Lys Tyr Gly He Asp Thr 
75 65 70 75 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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9? 95 
A-p A^p Lys Phe Tyn Ala Ala 

105 110 
Thr Val S-r He Ala Met Ly^ Pro 
125 

Arg lie Gin Ala Lys Ly^ Gin Arg 
I'lO 

Gin Asn He Glu Lys Gly Asn Giu 
155 IGO 
Asp Gl-y Val Thr Val Thr Thr Thr 

17 0 17 5 

Glu Giy Thr Gly Ala Thr Pro Ser 
185 190 
Tyr '.'al Gly Thr Leu Val Asp Giy 
205 

Gly lie Glu Phe Ala Val Thr Gly 

220 

Leu Gin Leu Met Lys Val Gly Gin 
235 240 
Glu Leu Ala Tyr Gly Glu Thr Gly 

250 255 
Thr Leu Thr Phe Glu Met Giu Leu 
2 65 2 7 0 



10 



15 



20 



25 



40 



45 



50 



55 



60 



65 



70 



75 



Thr 


Arg 


Phe 


Ser 


Tyr 
8 5 


Asn 


Leu 


Gly 


Arg 


Gin 


Leu 


Al a 


Mis 


Asp 


Ser 


He 








1 00 










1 le t 


Arg 


Al a 


Al a 


Leu 


Leu 


Lys 


Asp 






115 










120 


H 1 - 


Asp 


Al a 


Gin 


Ala 


Phe 


Met 


Gin 




130 










135 




tjl u 


Asn 


Asn 


Met 


L-ys 


Gin 


Phe 


Gly 


14 5 










150 






Tyr 


I J e 


Asp 


Thr 


Phe 


Lys 


Lys 


Glu 










1 65 








Gl y 


Leu 


Al a 


Tyr 


Lys 


Thr 


Leu 


Gin 








1 fi n 

X O V 










Leu 


Al a 


- 

Asp 


Thr 


\/n 1 


Arg 


Va 1 


Lys 






195 










200 


hys 


Gl u 


Phe 


Asp 


Lys 


Asn 


Glu 


Glu 




210 










215 




Val 


lie 


Lys 


Gly 


Trp 


Thr 


Glu 


Met 


225 










230 






L-ys 


Val 


Arg 


Val 


Val 


He 


Pro 


Gin 










245 








Asn 


Tyr 


Thr 


He 


Glu 


Pro 


Phe 


Ser 








260 










He 


Gly 


He 


Lys 


Pro 


Gly 


Lys 


Lys 






275 










280 



(2) INFORMATION FOR SEQ ID NO: 326 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 

(iiij HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGAiJISM: Porphyromonas gingivali; 

Hx) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 ... 803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326 



Cys 
1 


Arg 


Lys 


Gin 


Lys 
5 


Leu 


Ser 


Asn 


Pro 


Lj s 


Arg 


Ser 


Arg 


Arg 


Arg 


He 


Ser 


Phe 


Arg 


Gin 


Ser 


Asp 


IC 
Lys 


Asn 


Pro 


Met 








20 










25 










Arg 


Gin 


Val 


Phe 


Leu 


Pro 


He 


Leu 


Phe 


Val 


Leu 


Leu 


Thr 


Ser 


Thr 


35 










40 










45 


Thr 


Lys 


Asn 


Leu 


Pro 


Glu 


Gly 


Glu 


Gin 


Leu 


Tyr 




50 










55 










60 


Gly 


Lys 


Thr 


Gin 


He 


Leu 


Arg 


Gin 


Asp 


Lys 


Ser 


His 


Ala 


65 










70 






75 






Ala 


Leu 


Thr 


Glu 


Val 


Glu 


Ser 


Thr 


Leu 


Lys 


Val 


Thr 


Pro 


He 


Phe 






85 










90 








Gly 


Ser 


Ala 


Ser 


Ala 


Ser 


Leu 


Pro 


Lys 


He 


Pro 








100 










105 








Trp 


Leu 


Tyr 


Asn 


Ser 


Phe 


Val 


Gly 


Asp 


Ser 


Thr 


Val 


He 


He 




115 










120 










125 


Phe 


Asp 


Lys 


Phe 


Ala 


Ala 


Lys 


Pro 


Val 


Phe 


He 


Ser 




130 










135 








140 




Ser 


Asp 


Ser 


Arg 


Ala 


Lys 


Val 


Ala 


Thr 


Asn 


He 


Leu 


Arg 


145 


Phe 








150 










155 




Tyr 


Asp 


Ala 


Lys 


Val 


Lys 


Ser 


Ser 


Val 


Thr 


Thr 


Leu 










165 










170 








Ser 


Leu 


Lys 


Ala 


Lys 


He 


Ser 


Tyr 


Thr 


Va 1 


Asp 


Met 


Ala 








180 










185 










His 


Tyr 


Asp 


Ser 


He 


He 


Pro 


Leu 


Pro 


He 


Ser 


Thr 


Phe 






195 










200 










205 


He 


Leu 


Ala 


Tyr 


Arg 


Gin 


Thr 


Pro 


Ser 


Leu 


He 


Arg 


Lys 




210 










215 










220 



30 



80 

Asn Gly Als 

95 

Phe Gly Let 
110 



160 

Lys Lys Asp 

175 
Ser Pro Tyr 
190 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



Pfie nsti Leu Ala 
^ S 

L^iu Arg A^p hsn 

5 Glu Al A^p Thr 

2 60 

Leu Ser Glu Afp 
275 

h-jS Arg Thr Ala 
10 2 90 

Ser Leu Glu Val 
305 

Val Arg Pro hys 

15 Leu Tyr Arg Gin 

3 4 0 

Leu Gly Ala Phe 

355 

He Ser Gly Leu 
20 370 

Trp Asp Ala Ser 
385 

He Gly Fro Gly 

25 Gly Gly Glu A^n 
420 

Thr Gly Asn Arg 
435 

Ser Tyr Asii Met 
30 450 

Plie Pro Gly Leu 
4 65 

Phe Gin Ala Ser 

35 Tyr Ser Phe Gly 
500 

His Arg ills Ala 
515 

His Gin Thr Glu 
40 530 

Leu Leu Ser Leu 
545 

Thr Phe Asn Lys 

45 Gin Phe Gly Leu 

580 

Ala Ala Gly Lys 
595 

Phe Ser Gin Phe 

50 610 

He Afp Arg Asn 

625 

Tyr Ser Tyr Gly 

55 Val Gly Gly Ala 
660 

Pro Gly Arg Phe 
67 5 

Gin Val Gly Glu 
60 690 

Leu Phe Gly Asp 
705 

Trp Leu Leu Arg 

65 Val Gly Ser Val 
740 

Val Gly Leu Arg 
755 

Gly Phe Gly Leu 
70 770 

Asn He Pro Arg 
785 

Tyr Pro Phe 

75 



Lys 


Leu 


His 


Glu 


Gl u 


Ara 




230 










i y 


Tyr 


Tyr 


Tyr 


Phe 


Arg 


24 5 










2 50 


Leu 


Leu 


Val 


Arg 


i^iy 


Ala 










265 




Thr 


Pro 


Pro 


Gin 


Ala 


Met 








280 






Val 


Leu 


Leu 


Gly 


Met 


Asn 






2 95 








Glu 


Asp 


t let 


Lys 


Val 


Leu 




310 










He 


Leu 


Al a 


Lys 


Arg 


Phe 


32 5 










330 


Lys 


Asp 


Asp 


Gl u 


Thr 


Thr 










34 5 




Ser 


Val 


He 


Asp 


Leu 


Asn 








360 






Leu 


Asp 


Val 


Arg 


Leu 


Leu 






J / J 








Leu 


Glu 


Th r 


Leu 


Phe 


Thr 




3 90 










Leu 


Asn 


Phe 


Ala 


Leu 


Ala 


405 










410 


Leu 


Ser 


Trp 


Asn 


He 


Gly 










4 25 




Pro 


Glu 


Asn 


Ser 


Ser 


Asn 








4 4 0 






Asn 


Thr 


Ala 


Val 


Asn 


Leu 






455 








Leu 


Asp 


Lys 


Tyr 


Tyr 


Tyr 




470 










Ala 


Thr 


Ala 


Leu 


Asn 


Arg 


485 










4 90 


Phe 


Ser 


Thr 


Thr 


Tyr 


Glu 










505 




He 


Phe 


Pro 


Leu 


Lys 


Leu 








520 






Thr 


Phe 


Gin 


Ala 


He 


Thr 






535 








Gin 


Ser 


Gin 


Phe 


Leu 


Ala 




550 










Ser 


Val 


Ser 


Glu 


Lys 


Ser 


565 










570 


Ser 


Glu 


Ala 


Gly 


Asn 


Leu 










585 




Lys 


Tyr 


Ser 


Asp 


Thr 


Lys 








600 






He 


Lys 


Al a 


Thr 


Gly 


Glu 






615 








Gin 


Ser 


Leu 


Ala 


Thr 


Arg 




630 










Asn 


Met 


Arg 


Val 


Ala 


Pro 


64 5 










650 


Asn 


Ser 


He 


Arg 


Ala 


Phe 










665 




Asn 


Pro 


Asp 


Ser 


Asp 


Asn 








680 






Phe 


Lys 


Leu 


Glu 


Ala 


Asn 






6 95 








Leu 


His 


Ala 


Ala 


Val 


Phe 




710 










Glu 


Asp 


Ser 


Ser 


Arg 


Pro 


725 










730 


Ser 


Asn 


Phe 


Leu 


Asn 


Ser 










745 




Tyr 


Asp 


Leu 


Ala 


Phe 


Leu 








760 






His 


Leu 


Pro 


Tyr 


Asn 


Thr 






775 








Phe 


Lys 


Asp 


Ala 


He 


Gly 



790 



SUBSTITUTE 



249/ 490 



G 1 i; 


Thr 


He 


Ser 


Ala 


Leu 


2 35 










24 0 


Pro 


Gin 


Asp 


He 


He 


Tyr 










255 




Val 


Cys 


Leu 


Arc 


Ala 


Lys 








27 0 






Arg 


Pro 


Trp 


Arg 


He 


Gly 






285 








Gly 


Glu 


Ser 


Pro 


Thr 


Asp 




300 










Tyr 


Tyr 


Arg 


Lys 


I let 


Pro 


315 










320 


Arg 


Phe 


Phe 


Ser 


Gly 


Asn 










335 




Arg 


Lys 


Ser 


Leu 


Al a 


Arg 








350 






Phe 


Leu 


Gin 


Arg 


Asp 


Ser 






365 








Thr 


Thr 


Leu 


Asp 


Lys 


Pro 




380 










Ser 


Lys 


Ser 


Asn 


Asp 


Phe 


395 










4 00 


Arg 


Arg 


Asn 


Val 


Phe 


Gly 










415 




Gly 


Ser 


Tyr 


Glu 


Trp 


Glu 








4 30 






Arg 


Leu 


He 


Asp 


He 


Asn 






4 4 5 








Ser 


Phe 


Pro 


Ser 


He 


Val 




4 60 










Tyr 


Pro 


Thr 


Thr 


Thr 


Thr 


475 










480 


Ala 


His 


Tyr 


Phe 


Ser 


Met 










4 95 




Phe 


Gin 


Pro 


Ser 


Lys 


Glu 








510 






Asn 


Tyr 


Asn 


Leu 


Leu 


Gly 






525 








Ala 


Asn 


Asn 


Pro 


Pro 


Leu 




54 0 










Gin 


Het 


Gly 


Tyr 


He 


Tyr 


555 










560 


Pro 


His 


His 


Leu 


Trp 


Met 










57 5 




Leu 


Asn 


Leu 


He 


Tyr 


Leu 








5 90 






Asn 


Phe 


Val 


Gly 


Val 


Pro 






605 








Leu 


Arg 


Tyr 


Ser 


Tyr 


Thr 




620 










Phe 


Gly 


Thr 


Gly 


Val 


He 


635 










640 


Tyr 


Ser 


Glu 


Gin 


Phe 


Tyr 










655 




Thr 


Val 


Arg 


Ser 


He 


Gly 








67 0 






Gin 


Tyr 


Ser 


Tyr 


Leu 


Asp 






685 








Val 


Glu 


Tyr 


Arg 


Gly 


Lys 




700 










Leu 


Asp 


Ala 


Gly 


Asn 


Val 


715 










720 


Gly 


Gly 


Ala 


Leu 


Ser 


Glu 










735 




He 


Ala 


Leu 


Gly 


Thr 


Gly 








750 






Val 


Val 


Arg 


Val 


Asp 


Val 






765 








Gly 


Lys 


Lys 


Gly 


Tyr 


Tyr 




780 










Phe 


His 


Leu 


Ala 


Val 


Gly 


7 95 










800 



SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO_9929870A1J_> 




wo 99/29870 PCT/AU98/01023 

250/490 

(C) IllFOKriATlOr; for SEQ id tjO:327 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LEMGTH: 783 amino acid:e 
5 (E) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAI JISM : Porphyromonas gingi valis- 
lS (ix} FEATURE: 

(A) NAIIE/KEY: misc_f eature 

(B) LOCATION 1 . . .783 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(::i) SEQUENCE DESCRIPTION: SEQ ID 110:327 



Ser 


Leu 


Ser 


Pro 


Tyr 


He 


Arg 


Phe 


Pro 


Met 


Ser 


Ser 


His 


Ser 


Val 


Arg 


1 








5 










10 










15 


Tyr 


Leu 


He 


Gly 
20 


He 


Ala 


Gly 


Cys 


Leu 

25 


Leu 


Leu 


Met 


Leu 


Ala 
30 


Ser 


Ser 


Cys 


Ser 


Val 


Thr 


Arg 


Tyr 


Val 


Pro 


Asp 


Gly 


Ser 


Arg 


Leu 


Leu 


Asp 


Arg 






35 










40 










45 




Val 


Thr 

50 


He 


Ala 


Ser 


Glu 


Thr 
55 


Gly 


Ser 


He 


Ala 


Leu 
60 


Pro 


Glu 


Asp 


He 


Arg 


Asp 


Tyr 


Thr 


Leu 


Gin 


Gin 


Pro 


Asn 


Tyr 


Arg 


Leu 


Phe 


Gly 


Met 


Thr 


65 










70 










75 








80 


Arg 


Trp 


Leu 


Leu 


Arg 
85 


Val 


Tyr 


Ser 


Ser 


Ser 
90 


Asn 


Pro 


Asn 


Ser 


Asn 
95 


Ser 


Trp 


Trp 


Asn 


Arg 


Ser 


Leu 


Arg 


Lys 


Met 


Gly 


Glu 


Pro 


Pro 


Val 


Leu 


He 








100 










105 








110 






Asp 


Ser 


Val 


Leu 


Thr 


Asp 


Arg 


Thr 


Ala 


Asn 


Arg 


Leu 


Ala 


Lys 


Ala 


Met 






115 










120 










125 






Ala 


Gly 


Asp 


Gly 


Phe 


Leu 


Asp 


Ala 


Thr 


Ala 


Arg 


Ala 


Val 


Val 


Asp 


Thr 




130 










135 








140 










Gly 


Leu 


Tyr 


Lys 


Lys 


Ala 


Arg 


He 


Thr 


Tyr 


Leu 


He 


Gin 


Pro 


Gly 


Ser 


145 










150 










155 








160 


Arg 


Tyr 


Tyr 


He 


Arg 


Asn 


Met 


Ala 


Leu 


Asp 


Val 


Lys 


Asn 


Pro 


Leu 


Leu 










165 










170 








175 




Pro 


Pro 


Val 


Ala 


Leu 


Gly 


Asn 


Ser 


Leu 


Pro 


Ser 


Ala 


Tyr 


Lys 


Val 


Gly 








1 Rn 
i. o u 










185 










1 90 




lie 


Ser 


Glu 


Gly 


Ser 


Pro 


Leu 


Ser 


Pro 


He 


Val 


Leu 


Asp 


Glu 


Glu 


Arg 






195 










200 










205 






Lys 


Ala 


He 


Ala 


Arg 


His 


Met 


Arg 


Asn 


Asn 


Gly 


Phe 


Trp 


Lys 


Phe 


Ser 




210 










215 










220 






Ala 


Glu 


Asp 


Val 


Tyr 


Tyr 


Glu 


Ala 


Asp 


Thr 


Thr 


Val 


Ser 


Gly 


Gly 


Ser 


225 










230 










235 






240 


Gly 


Thr 


Lys 


Ser 


Ala 


Asp 


Leu 


Lys 


Leu 


Val 


Val 


Asn 


Gly 


He 


Gly 


Arg 










245 










250 










255 


Tyr 


Pro 


Tyr 


Arg 


He 


Gly 


Arg 


Val 


Phe 


Phe 


His 


Ala 


Asp 


Tyr 


Asp 


Pro 








260 










265 










270 




Leu 


Glu 


Ser 


Asp 


Phe 


Arg 


Val 


Gin 


Glu 


Leu 


Pro 


Arg 


He 


Asp 


Ser 


He 






275 










280 










285 






Ser 


Arg 


Gly 


Asp 


Tyr 


Thr 


Val 


Tyr 


Tyr 


Gly 


Ser 


Arg 


Gly 


Arg 


Tyr 


He 




290 










295 










300 






Arg 


Ala 


Ser 


Ala 


Leu 


Thr 


Arg 


Ser 


Val 


Ser 


Val 


Thr 


Pro 


Gly 


Ala 


Phe 


305 










310 










315 








320 


Phe 


Cys 


Glu 


Asp 


Asp 


Val 


Glu 


Arg 


Ser 


Tyr 


He 


Lys 


Leu 


Asn 


Ala 


Leu 










325 










330 








335 




Pro 


He 


Val 


Arg 


Asn 


Val 


Asn 


He 


Arg 


Phe 


Val 


Glu 


His 


Asn 


Gly 


Lys 








340 










345 










350 


A^p 


Glu 


He 


Ala 


Leu 


Ala 


Asp 


Ser 


Ser 


Arg 


Leu 


Val 


Asp 


Cys 


Tyr 


He 






355 










360 










365 






Leu 


Thr 


Val 


Pro 


Ala 


Lys 


Ser 


Lys 


Ser 


Phe 


Glu 


Al a 


Glu 


Val 


Leu 


Gly 




37 0 










375 










380 








Thr 


Asn 


Ser 


Ala 


Gly 


Asp 


Phe 


Gly 


Ala 


Ala 


Leu 


Ser 


Leu 


Gly 


Phe 


Thr 


385 










390 










395 








400 


A-p 


Arg 


Asn 


Leu 


Phe 


Arg 


Gly 


Ala 


Glu 


Met 


Phe 


Asn 


He 


Lys 


Leu 


Lys 










4 05 










410 








415 


Gly 


Ala 


Tyr 


Glu 


Ala 


He 


Arg 


Lys 


Gly 


Ser 


His 


Ser 


Phe 


Met 


Glu 


Tyr 


Gly 






420 










425 










430 




Val 


Glu 


Ser 


Ser 


Leu 


Arg 


Phe 


Pro 


Arg 


Leu 


Leu 


Phe 


Pro 


Phe 


He 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID; <WO 9929870A1_ 




wo 99/29870 PCT/AU98/01023 

251/490 



55 



65 



70 



75 







435 










440 










4^5 








Ser 


Asp 


Glu 


Thr 


Arg 


Arg 


Arg 


Leu 


Arg 


Ala 


Ser 


Thr 


Gi u 


Trp 




He 




4 50 










455 










460 










Gly 


Tyr 


Asn 


Tyr 


Gl n 


Thr 


Arg 


Pro 


Glu 


Phe 


Asp 


Arg 


Val 


He 


Leu 


Ser 












4 70 










475 










480 


Ala 


Gin 


Leu 


Asn 


Tyr 


Ser 


Trp 


Gin 


Thr 


Tyr 


Leu 


His 


Asn 


Arg 


Leu 


Arg 










485 










4 90 










495 




His 


Thr 


He 


Arg 


Leu 


Leu 


Asp 


Val 


Asp 


Tyr 


Leu 


His 


Leu 


Pro 


Tyr 


He 








500 










505 










510 






Asp 


Pro 


Asp 


Phe 


Ala 


Gin 


Ser 


Leu 


Pro 


Pro 


Thr 


Thr 


Ala 


Leu 


Tyr 


Asn 






515 










520 










525 








Tyr 


Thr 


Glu 


Gin 


Phe 


He 


Leu 


Gly 


Ser 


Ala 


Tyr 


He 


Leu 


Asn 


Tyr 


Thr 




530 










535 










540 










Thr 


Ala 


Ser 


Ser 


Met 


Glu 


Arg 


Thr 


Val 


Ser 


Am 


Pro 


Phe 


Thr 


Ala 


Arg 


545 










550 










555 










560 


Phe 


Ser 


He 


Gin 


Thr 


Ala 


Gly 


Asn 


Leu 


Leu 


Gin 


Al a 


He 


Ser 


Tyr 


Leu 










565 










570 










575 




Thr 


Asp 


Ser 


Pro 


Lys 


Asp 


Glu 


Mis 


Gly 


Leu 


Tyr 


hys 


Met 


Phe 


Gly 


Leu 








580 










585 










590 






His 


Tyr 


Ala 


Gl n 


Phe 


Val 


Lys 


Leu 


Asp 


Leu 


Asp 


Leu 


Ala 


Lys 


Thr 


Val 






595 










600 










605 








Leu 


Leu 


Glu 


hys 


Asp 


Asn 


Thr 


Leu 


Ala 


Leu 


His 


Leu 


Gly 


Phe 


Gly 


Leu 




610 










615 










620 










Ala 


Phe 


Pro 


Tyr 


Gly 


Asn 


Ala 


Arg 


His 


He 


Pro 


Phe 


Glu 


Leu 


Arg 


Tyr 


625 










630 










635 










640 


Phe 


Ala 


Gly 


Gly 


Ser 


Asn 


Ser 


Val 


Arg 


Gly 


Trp 


Ser 


Val 


Arg 


Thr 


Leu 










64 5 










650 










655 




Gly 


Pro 


Gly 


Ser 


Met 


Lys 


Met 


Thr 


Pro 


Asp 


Lys 


Thr 


Phe 


Phe 


Asp 


Gin 








660 










665 










670 






Met 


Gly 


Asp 


He 


Arg 


Leu 


Asp 


Leu 


Asn 


Val 


Glu 


Tyr 


Arg 


Thr 


Lys 


Leu 






675 










680 










685 








Phe 


'Pro 


L j's 


Phe 


Ar 


Ala 


Al a 


Al a 


Phe 


Va 1 




Ala 


Gly 


Asn 


Va 1 


Trp 




690 










695 










700 










Thr 


He 


Lys 


Glu 


Tyr 


Gl u 


Asn 


Gin 


Glu 


Asp 


Gly 


Leu 


Phe 


Arg 


Phe 


Asp 


706 










710 










715 










720 


Arg 


Phe 


Tyr 


Lys 


Glu 


He 


Ala 


Leu 


Ala 


Tyr 


Gly 


Leu 


Gly 


Leu 


Arg 


Leu 










725 










730 










735 




Asp 


Phe 


Asp 


Tyr 


Phe 


Leu 


Val 


Arg 


Leu 


Asp 


Ala 


Gly 


Leu 


Lys 


Ala 


Tyr 








740 










745 










7 50 






Asp 


Pro 


Gin 


Gin 


Thr 


Gly 


Arg 


Tyr 


Lys 


Trp 


Ala 


He 


Thr 


Arg 


Pro 


Asn 






755 










760 










765 








Leu 


Ser 


Ser 


Asn 


Phe 


Ala 


Trp 


His 


He 


Ala 


Val 


Gly 


Tyr 


Pro 


Phe 






770 










775 










780 










(2) 


It 1 FORMATION 


FOR 


SEQ 


ID NO: 328 
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15 
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45 

(i) SEQUEtJCE CHARACTERISTICS: 

(A) LENGTH: 875 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 
60 (A) NAI^IE/KEY: mi sc_f eature 

(B) LOCATION 1 ... 875 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 8 



Val 


Glu 


Ser 


Lys 


Leu 


Leu 


Cys 


Leu 


Met 


Arg 


Lys 


Arg 


He 


Leu 


Gin 


Leu 


1 


















10 










15 




Phe 


Leu 


Thr 


Ala 

20 


Leu 


Leu 


Leu 


Ala 


Leu 

25 


Gly 


Ser 


Ser 


Leu 


Ala 
30 


He 


Ala 


Gin 


Thr 


Val 
35 


Val 


Thr 


Gly 


Lys 


Val 
40 


He 


Asp 


Ser 


G]u 


Thr 

45 


Ser 


Glu 


Pro 


Leu 


He 
50 


Gly 


Val 


Ser 


Val 


Ser 
55 


Thr 


Gly 


Gin 


Gly 


Ala 
60 


Ser 


Leu 


Arg 


Gly 


Val 


Thr 


Thr 


Asp 


Met 


Asp 


Gly 


Gly 


Phe 


Arg 


Phe 


Glu 


Val 


Pro 


Ala 


Lys 


65 










70 










75 










80 


Ser 


Val 


Leu 


Thr 


Phe 


Arg 


Cys 


Val 


Gly 


Tyr 


Ala 


Thr 


Val 


Thr 


Arg 


Ser 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 











85 










90 










95 




lie 


Gly 


Arg 


Gly 


Ser 


Gin 


Glu 


Asp 


Leu 


Gly 


Thr 


He 


Leu 


Leu 


Asp 


Pro 








1 00 










105 










HO 






Gin 


Ala 


lie 


Gly 


Leu 


Asp 


Glu 


He 


Gin 


\' a i 


He 


Ala 


Ser 


Val 


Val 


Pro 






115 










120 










125 








L/C 


Asp 


Arg 


IK't 


Thr 


Pro 


Val 


Pro 


Val 


Ser 


Asn 


He 


Arg 


Val 


Ala 


Asp 




130 










135 










14 0 










lie 


Gin 


Aiu 


Ala 


Ser 


Leu 


Asn 


Val 


Glu 


Fhe- 


Pro 


Gl u 


Leu 


Val 


Lys 


Ser 


1 J5 










150 










155 










160 


Thr 


Pro 


Ser 


Thr 


Tyr 


Thr 


Thr 


Lys 


Gly 


Ser 


Gly 


Gly 


Phe 


Gly 


Asp 


Gly 










165 










170 










176 




Aig 


Thr 


Asn 


Va 1 


Arg 


Gly 


Phe 


Asp 


Thr 


Tyr 


Asn 


Phe 


Gly 


Val 


Leu 


He 








180 










185 










190 






Pisn 


Giy 


Val 


Pro 


Val 


Asn 


Gly 


1-1 et 


Glu 


Asp 


Gly 


Lys 


Val 


Tyr 


Trp 


Ser 






195 










200 










205 








hsn 


Trp 


Ser 


Gly 


Leu 


Met 


Asn 


Gin 


Ala 


Ser 


Thr 


He 


Gl n 


He 


Gin 


Arg 




210 










215 










220 










Giy 


Leu 


Gly 


Ala 


Ser 


Lys 


Leu 


Gly 


He 


Ser 


Ser 


Val 


Gly 


Gly 


Thr 


Met 


225 










230 










235 










240 


A-n 


lie 


He 


Thr 


Lys 


Thr 


Thr 


Asp 


Ala 


Asn 


Thr 


Gly 


Gly 


Ser 


Ala 


Tyr 










245 










2 50 










255 




Val 


Gly 


net 


Gly 


Asn 


Asp 


Gly 


Leu 


His 


Lys 


Gl u 


Ser 


Phe 


Ser 


He 


Ser 








260 










265 










270 






Thr 


Gly 


I let 


Asn 


Asp 


Gly 


Trp 


Ala 


He 


Thr 


He 


Ala 


Gly 


Ser 


His 


Met 






27 5 










280 










285 








Thr 


Giy 


Leu 


Gly 


Tyr 


Val 


Lys 


Gly 


Leu 


Lys 


Gly 


Arg 


Ala 


Phe 


Ser 


Tyr 




2 90 










295 










3 00 










Phe 


Phe 


Asn 


Va 1 


Ser 


Lys 


Lys 


Phe 


Asn 


Glu 


Arg 


His 


Thr 


Leu 


Ser 


Leu 


305 










310 










315 










320 


Thr 


Gly 


Phe 


Gl y 


Ala 


Pro 


Gin 


Trp 


His 


Asn 


Gin 


Arg 


Ser 


Ser 


Lys 


Tyr 










325 










330 










335 




Ser 


Val 


Ala 


Asp 


Tyr 


Asp 


Lys 


Tyr 


Gly 


He 


Arg 


His 


Asn 


Gin 


Ser 


Phe 








34 0. 










345 










350 






Gly 


Tyr 


Leu 


Arg 


Gly 


Glu 


Leu 


Thr 


Pro 


Thr 


Ala 


Tyr 


Ala 


Tyr 


Asn 


Thr 






355 










360 










365 








Tyr 


His 


Lys 


Pro 


Gin 


Phe 


Ser 


Leu 


Asn 


His 


Phe 


Trp 


Lys 


Met 


Asp 


Glu 




370 










375 










380 










Asn 


Thr 


Ser 


Leu 


Tyr 


Thr 


Ala 


Unk 


Tyr 


Ala 


Ser 


Leu 


Ala 


Thr 


Gly 


Gly 


385 










390 










395 










400 


Gly 


Arg 


Arg 


Ala 


Tyr 


Gly 


Lys 


Asn 


Ser 


Lys 


Trp 


Val 


Leu 


He 


Asn 


Tyr 










405 










410 










415 




Asn 


Thr 


Gly 


Gin 


Pro 


Tyr 


Glu 


Gin 


Thr 


Lys 


Val 


Thr 


Pro 


Asp 


Gly 


Leu 








420 










425 










430 






lie 


Asp 


Tyr 


Asp 


Ala 


Val 


Leu 


Ala 


Ala 


Asn 


Ala 


Ala 


Ala 


Ser 


Asn 


Gly 






435 










440 










445 






Ser 


Gi u 


Ala 


He 


Phe 


Ala 


Leu 


Gly 


Ser 


Asn 


Ser 


His 


Lys 


Trp 


Phe 


Gly 




450 










455 










460 










Leu 


Leu 


Ser 


Ser 


Phe 


Lys 


Lys 


Lys 


Leu 


Asn 


Ser 


Ser 


Leu 


Thr 


Leu 


Thr 


465 










470 










4 75 










480 


Ala 


Gly 


Tyr 


Asp 


Gly 


Arg 


Tyr 


Tyr 


Arg 


Gly 


Asp 


His 


Tyr 


Asp 


Lys 


He 










485 










4 90 










4 95 




Thr 


Asp 


Leu 


Leu 


Gly 


Gly 


Ser 


Tyr 


Tyr 


He 


Glu 


Asp 


Pro 


Lys 


Thr 


Lys 








500 










505 










510 






Leu 


Ala 


Tyr 


His 


Ala 


Glu 


Gly 


Gin 


Gin 


Leu 


Lys 


Val 


Gly 


Asp 


He 


Val 






5X5 










520 








525 








Asn 


Arg 


Asp 


Tyr 


Thr 


Gly 


Glu 


He 


Met 


Trp 


His 


Gly 


Leu 


Phe 


Ala 


Gin 




530 










535 










540 










Met 


Glu 


His 


Ser 


Ser 


Glu 


Trp 


He 


Asp 


Ala 


Phe 


Val 


Ser 


Gly 


Ser 


He 


545 










550 










555 










560 


Asn 


Tyr 


Glu 


Leu 


Tyr 


Arg 


Asn 


His 


Asn 


Tyr 


Gly 


Gly 


Ser 


Lys 


Ser 


Thr 










565 










570 










575 




Gly 


Tyr 


Leu 


Pro 


Gly 


Val 


Ser 


Pro 


Trp 


Lys 


Ser 


Phe 


Leu 


Pro 


Trp 


Ser 








580 










585 










590 






Gly 


Lys 


Ala 


Gly 


Leu 


Ser 


Tyr 


Lys 


Phe 


Ala 


Gin 


Gly 


His 


Asn 


Val 


Phe 






595 










600 










605 








Ala 


Asn 


Gly 


Gly 


Phe 


Phe 


Thr 


Arg 


Ala 


Pro 


Leu 


Phe 


Gly 


Asn 


He 


Tyr 




610 










615 










620 






Ala 


Ala 


Gly 


Ala 


He 


He 


Pro 


Asn 


Asp 


Lys 


Ala 


Asn 


Met 


Glu 


Lys 


Val 


625 










630 










635 








640 


Leu 


Thr 


Gly 


Glu 


Val 


Gly 


Tyr 


Gly 


Phe 


Thr 


Asn 


His 


Lys 


Asn 


Phe 


Glu 










645 










650 










655 




Phe 


Asn 


He 


Asn 


Gly 


Tyr 


Tyr 


Thr 


Lys 


Trp 


Met 


Asp 


Arg 


Val 


Thr 


Ser 








660 










665 










670 






Lys 


Arg 


He 


Gly 


Asn 


Glu 


Tyr 


Val 


Tyr 


Leu 


Asn 


Gly 


Val 


Asp 


Ala 


Val 






675 










680 










685 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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253/490 



10 



15 



20 



25 



45 



50 



55 



60 



65 



70 



75 



His 




Gly 


Val 


Gl u 


Ai a 


Glu 


Val 


Ser 


Tyr 


Arg 


Pro 


lie 


Arg 


Gin 


He 




6 90 




















7 00 












Leu 


Arg 


Gly 


Met 


Phe 


Se r 


Leu 


Gly 


Asp 


Trp 


Thr 


Trp 


Gin 


Asn 


Asn 


7 05 










710 










715 










72 0 


Vol 


Sei- 


Tyr 


Thr 


Ser 


Tyr 


Asp 


Glu 


Ala 


Gly 


Asn 


Glu 


Thr 


Gly 


Gin 


Asp 










725 










730 










735 




lie 


Thr 


Tyr 


Tie 


Lys 


Gly 


Leu 


Hi s 


Val 


Gly 


Asp 


Ai a 


Ala 


Gin 


Met 


Thr 








74 0 










74 5 










750 






hlh 


Ala 


Val 


Ser 


Ala 


Asp 


lie 


Glu 


Leu 


Fhe 


Lys 


Gly 


Phe 


His 


Val 


He 






755 










7 60 










7 65 








Gly 


Ly^ 


Tyr 


A^n 


Phe 


Leu 


Gly 


Lys 


Asn 


Tyr 


Ala 


Gly 


Phe 


Asn 


Pro 


AI a 




770 










775 










7 80 










Thr 


Arg 


Asn 


Ala 


Gin 


Gl n 


Tyr 


Gl u 


Ala 


Asp 


Gly 


Lys 


Gl u 


lie 


Val 


Gl u 


7 85 










7 90 










795 










800 


Ser 


Trp 


Lys 


Leu 


Pro 


Asp 


Val 


Gly 


Leu 


Phe 


Asp 


Leu 


Ser 


Ala 


Ser 


Tyr 










005 










810 










815 




Asn 


Phe 


Lys 


Leu 


Gly 


Ser 


Leu 


Ser 


Thr 


Thr 


Phe 


Tyr 


Phe 


Asn 


Met 


Asp 








820 










825 










830 






Asn 


V3l 


Ala 


Asp 


Lys 


Arg 


Tyr 


Va 1 


Ser 


Asp 


Ala 


Asp 


Asp 


Asn 


He 


He 






835 










840 










845 








Gly 


Ly^ 


Lys 


His 


Asp 


Glu 


Ala 


Ser 


Ala 


Leu 


Val 


Trp 


Tyr 


Gly 


Phe 


Gly 




850 










855 










860 










Arg 


Thr 


Trp 


Ser 


Thr 


Gly 


lie 


Arg 


Val 


Asn 


Phe 












865 










87 0 










875 













(2) IIir0RI!ATlOn FOR SCQ ID tiO:329 



(i) SEQUEIJCE CHARACTERISTICS: 

(A) LEriGTH: 460 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

35 (iii) HYPOTHETICAL: YES 

( vi ) ORIGINAL SOURCE : 

(A) ORGAtJISM: Porphyromonas gingivali^ 

40 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 ... 460 



(Mi) SEQUENCE DESCRIPTIOII: SEQ ID NO: 329 



Asn 


Arg 


Cys 


Asn 


Pro 


Ser 


Arg 


Gin 


Tyr 


Phe 


His 


Leu 


Ser 


Gin 


Asn 


Asp 


1 








5 










10 










15 




Val 


Gly 


Arg 


Ser 


Leu 


Arg 


Ala 


Phe 


Leu 


He 


His 


Leu 


Ser 


Met 


Lys 


Phe 








20 










25 










30 






Ser 


He 


Arg 


Leu 


Phe 


Leu 


Cys 


He 


He 


Phe 


Leu 


Leu 


Ser 


Ala 


Phe 


He 






35 










40 










45 








Leu 


Pro 


Ala 


Leu 


Gly 


Gin 


Lys 


Ser 


Lys 


Gin 


Val 


Gin 


Arg 


Leu 


Glu 


Lys 




50 










55 










60 










Gin 


Arg 


Lys 


Glu 


Ala 


Leu 


Lys 


Ala 


He 


Glu 


Lys 


Thr 


Asp 


A.rg 


Glu 


Leu 


65 










70 










75 










80 


Arg 


Asn 


Thr 


Lys 


Lys 


Asp 


Lys 


Gin 


Asp 


Lys 


Gin 


Lys 


His 


Leu 


Asn 


Leu 










65 










90 










95 




Leu 


Asn 


hys 


Gin 


Val 


Ala 


Gin 


Arg 


Lys 


Gin 


Met 


Val 


Gin 


Leu 


Leu 


Asp 








100 










105 










110 






Asn 


Glu 


Val 


Lys 


Glu 


Leu 


Gin 


Ser 


Asp 


He 


Asp 


Ser 


Met 


Thr 


Gly 


Val 






115 










120 










125 








Cys 


His 


Gin 


Leu 


Ser 


Val 


Glu 


Glu 


Lys 


Ala 


Arg 


Ser 


Asp 


Glu 


Tyr 


Ala 




130 










135 










140 










Gl n 


Ala 


Leu 


Gin 


Ser 


Met 


Gin 


Lys 


Arg 


Lys 


Arg 


Ser 


Leu 


Asp 


Arg 


He 


145 










150 










155 










160 


Leu 


Phe 


He 


Ser 


Ser 


Ala 


Lys 


Ser 


Phe 


Asp 


Glu 


Gly 


Met 


Arg 


Arg 


Met 










165 










170 










175 




Arg 


Phe 


Leu 


Glu 


Gin 


Tyr 


Ala 


Ser 


Ala 


Tyr 


Lys 


Leu 


Ala 


Ser 


Val 


Arg 








180 










185 










190 






Leu 


Arg 


Asp 


Thr 


Arg 


Ser 


Lys 


Leu 


Glu 


Thr 


Glu 


Arg 


Ala 


Thr 


Val 


Glu 






195 










200 










205 








Asp 


Ala 


Lys 


Lys 


Glu 


Lys 


Gly 


His 


Leu 


Leu 


Val 


He 


Arg 


Glu 


Glu 


Glu 




210 










215 










220 










Lys 


Lys 


Lys 


Leu 


Glu 


Gly 


Gin 


Gin 


Ala 


Glu 


Gin 


Arg 


Arg 


Gin 


Val 


Gin 


225 










230 










235 










240 
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10 



15 



20 



25 



30 



40 



45 



50 



55 



60 



65 



70 



75 



Al a 


Leu 


Gly 


Ala 


Lys 
24 5 


Gin 


Lys 


Asp 


Leu 


Glu 
2 50 


Al a 


Gin 


Leu 


Arg 


Lys 
255 


Gl n 


Ly.-: 


Ly3 


Gin 


Ala 
2 60 


Glu 


Al a 


Leu 


Asn 


Arg 
2 65 


Lys 


lie 


Giu 


Lys 


Gin 
270 


He 


Ala 


Lys 


Gl u 


He 


Glu 


Al a 


Ala 


Glu 


Arg 


Arg 


Ala 


Arg 


Giu 


Glu 


Arg 


Glu 


Arg 






27 5 










280 










285 






Leu 


Al a 


Arg 


Glu 


Ala 


Lys 


Ala 


Lys 




Lys 


Pro 


Val 


Pro 


Al a 


Giu 


Pro 




2 90 










2 95 








300 










Glu 


Arg 


Lys 


Al a 


Glu 


Thr 


Lys 


Gly 


Gly 


Tyr 


Al a 


net 


Asp 


Ai a 


Ser 


Glu 


305 










310 








315 










320 


Ai'g 


Al a 


Leu 


Ser 


Gly 


Ser 


Phe 


Ala 


Gin 


Asn 


Lys 


Gly 


Arg 


Leu 


Pro 


Gly 










325 










330 










335 


Pro 


Va 1 


Arg 


Gly 


Arg 


Tyr 


Arg 


He 


Val 


Ser 


Asp 


Phe 


Gly 


Val 


His 


Gin 








34 0 










345 








350 






His* 


Ser 


Giu 
355 


Leu 


Lys 


Lys 


Val 


Gin 

360 


Val 


Asn 


Asn 


Gly 


Gly 
365 


He 


Asp 


He 


Ala 


Va 1 


Al a 


Thr 


Gly 


Ser 


Asp 


Ala 


Thr 


Ser 


Val 


Phe 


Asp 


Gly 


Val 


Val 




370 










375 










380 








Ser 


Ser 


Val 


Phe 


Val 


He 


Pro 


Gly 


Tyr 


Asn 


Ser 


Ala 


Vai 


[iet 


Val 


Arg 


305 










390 










395 










400 


His 


Gly 


Asn 


Tyr 


He 
405 


Thr 


Val 


Tyr 


Ala 


Asn 
410 


Leu 


Ser 


Lys 


Val 


Tyr 
415 


Val 


Asii 


Ser 


Gly 


Thr 


Arg 


Val 


Lys 


Thr 


Gly 


Gin 


Ala 


Leu 


Gly 


Arg 


Ala 


Tyr 








420 










425 










430 




Thi- 


A-p 


Pro 


Ser 


Asn 


Asn 


Gin 


Thr 


He 


lie 


His 


Phe 


Glu 


He 


Trp 


Lys 






435 










440 










445 




Gl u 


Arg 
450 


Ser 


Ly^ 


Gin 


Asn 


Pro 
455 


Arg 


Leu 


Trp 


Leu 


Arg 
4 60 











(2) Il-JfORMATIOH FOR SEQ ID UO:330 



(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 342 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANIS!1: Porphyromonas gingivalis 

{ix) FEATURE: 

(A) NAI4E/KEY: misc_feature 

(B) LOCATION 1 . . .34 2 

{>:i ) SEQUENCE DESCRIPTION: SEQ ID NO:330 



Ser 


Phe 


Tyr 


Gin 


Glu 


He 


Asp 


Arg 


Leu 


Met 


Lys 


Lys 


Tyr 


Leu 


Leu 


Tyr 


1 








5 










10 










15 


Ala 


Ser 


Leu 


Leu 

20 


Thr 


Ser 


Val 


Leu 


Leu 
25 


Phe 


Ser 


Cys 


Ser 


Lys 
30 


Asn 


Asn 


Pro 


Asn 


Glu 


Pro 


Val 


Glu 


Asp 


Arg 


Ser 


He 


Glu 


He 


Ser 


He 


Arg 


Val 






35 










40 










45 






Asp 


Asp 


Phe 


Thr 


Lys 


Thr 


Gly 


Glu 


Ala 


Val 


Arg 


Tyr 


Glu 


Arg 


Asn 


Gin 




50 










55 










60 








Gly 


Ser 


Ala 


Ala 


Glu 


Arg 


Leu 


He 


Thr 


Asn 


Leu 


Tyr 


Leu 


Leu 


Leu 


Phe 


65 










70 










75 








80 


Asp 


Gin 


Ser 


Gly 


Ala 


Asn 


Pro 


Ala 


Lys 


Tyr 


Tyr 


He 


Thr 


Gly 


Asn 


Thr 










85 










90 








95 




Phe 


Thr 


Gly 


Gly 
100 


Thr 


Trp 


Leu 


Pro 


Asp 
105 


Asp 


Met 


Lys 


Val 


Lys 
110 


Leu 


Asp 


net 


Thr 


Gin 
115 


Ser 


Glu 


Ala 


Gly 


Glu 
120 


Arg 


Lys 


Val 


Tyr 


Val 
125 


Val 


Ala 


Asn 


Val 


Asp 


Asn 


Ala 


Val 


Lys 


Thr 


Ala 


Leu 


Asp 


Ala 


Val 


Ala 


Asn 


Glu 


Ser 




130 










135 








140 










Asp 


Leu 


Gin 


Thr 


Val 


Lys 


Arg 


Thr 


Thr 


Ala 


Met 


Pro 


Trp 


Ser 


Thr 


Asp 


145 










150 










155 








160 


He 


Ala 


Ser 


Pro 


Phe 


Leu 


Met 


Ser 


Gly 


Asn 


Lys 


Thr 


His 


Asp 


Phe 


Leu 


Ala 








165 










170 








175 




Asn 


Arg 


Leu 


Leu 


Asp 


Asn 


Val 


Pro 


Leu 


Val 


Arg 


Ala 


lie 


Ala 


Lys 








180 










185 








190 




Val 


Glu 


Leu 


Asn 


He 


Ser 


Leu 


Ser 


Glu 


Lys 


Phe 


Gin 


He 


Val 


Pro 


He 






195 










200 








205 
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BNSDOCID: <WO 9929S70A1_t_> 
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255/490 

12% Val A^n Giy Ser Leu Ser Gl u Phe Ly^ Phe Arg Tyr Val Asn Ph^ 

210 215 2 20 

Asp Lys Glu Thr Tyr Val Vol hys Fro Thr Th.r Ly^ Pro Asp Asn Leu 
-25 230 235 240 

5 He Ser Ser Ala Aen Gly Val Trp Pro Gin He Thr Asp Trp Thr Val 
24S 250 255 

Trp Gly Ala Ser Leu Asn Thr Ser Pro Ala Pro Asp Ala Gly Thr Gly 

260 265 270 

Tyr Thr Leu Asp Ala Asn Gly Lys Val Thr Ala Leu Arg He Val Thr 
10 275 280 285 

Tyr Leu Asn Glu Arg Asp Ser Lys Gly Ala Thr Val Glu Val Ala Leu 

290 295 300 

Pro Arg Val Asp Asp Gly Thr Leu Pro Pro Pro Glu Phe Gly Pro Glu 
305 310 315 320 

15 Leu Tyr Arg Leu Pro Leu Pro Asp Lys He Leu Arg Asn His Trp Tyr 

325 330 335 

Lys Tyr Glu Val Glu He 
340 

20 (2) lUFORMATION FOR SEQ ID MO: 331 

(i) SEQUEIJCE CHARACTERISTICS; 

(A) LEIIGTH: 329 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



30 



40 



45 



50 



55 



60 



65 



70 



75 



{vi) ORIGIIIAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingivalis 



(ix) FEATURE: 
35 (A) rJAI4E/KEY: misc_feature 

(B) LOCATION 1 ... 329 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 



Asn 


Leu 


Arg 


Thr 


Gin 


Pro 


Phe 


Phe 


Leu 


Val 


Asp 


Leu 


Gin 


Ser 


Met 


He 


1 








5 










10 










15 




Arg 


Thr 


He 


Leu 


Ser 


Arg 


Tyr 


Val 


Ser 


Ser 


Asn 


Phe 


Trp 


Ser 


Arg 


Gly 








20 










25 










30 


Ala 


Thr 


Phe 
35 


Phe 


Phe 


Thr 


He 


Phe 
40 


Pro 


Ala 


Phe 


He 


Leu 
4b 


Ala 


Ala 


Thr 


Ala 


Leu 


Pro 


Ala 


Cys 


Gly 


Gly 


Gly 


Thr 


Ala 


Ser 


Gly 


Ser 


Asp 


Arg 


Thr 




50 










55 










60 






Leu 


Ala 


Val 


Thr 


He 


Glu 


Pro 


Gin 


Ly^ 


Tyr 


Phe 


He 


Glu 


Ser 


He 


Ala 


65 










70 






75 










80 


Asp 


hys 


Ser 


Val 


Gin 
85 


Val 


Val 


Ala 


Leu 


Val 
90 


Pro 


Ala 


Gly 


Ser 


Asn 
95 


Pro 


Glu 


Glu 


Tyr 


Asp 


Pro 


Ser 


Pro 


Thr 


Val 


Met 


Lys 


Arg 


Leu 


Ser 


Glu 


Ala 








100 










105 






110 






Asp 


Ala 


Tyr 


Phe 


Tyr 


He 


Gly 


Gly 


Leu 


Gly 


Phe 


Glu 


Gin 


Arg 


Asn 


Leu 






115 










120 










125 






Ala 


Ala 


He 


Arg 


Asp 


Asn 


Asn 


Pro 


Lys 


Leu 


Pro 


Leu 


Phe 


Gl u 


Met 


Gly 




130 










135 










X40 








Lys 


Ala 


Leu 


Ala 


Asp 


Ala 


Gly 


Ser 


Ala 


Asp 


Leu 


His 


Gly Ser 


Cys 


Thr 


145 










150 










155 








1 60 


Asp 


His 


Ser 


His 


Thr 
165 


Asp 


Leu 


His 


Ala 


His 
170 


Asp 


Pro 


His 


Tyr 


Trp 
175 


Ser 


Ser 


Val 


Val 


Gly 
180 


Ala 


Lys 


Ala 


Leu 


Ser 
185 


Arg 


Ala 


Ala 


Tyr 


Asp 
190 


Ala 


Leu 


Val 


Glu 


Leu 
195 


Tyr 


Pro 


Asn 


Glu 


Lys 
200 


Asp 


Lys 


Trp 


Asp 


Lys 
205 


Gly 


His 


Asp 


Arg 


Leu 


Asn 


Gly 


Arg 


He 


Asp 


Ser 


Val 


Lys 


Arg 


Leu 


Val 


Asp 


Thr 


Met 




210 










215 










220 








Phe 


Ala 


Asn 


Gly 


Lys 


Ala 


Asp 


Lys 


Ala 


Phe 


Val 


He 


Tyr 


His 


Pro 


Ser 


225 










230 










235 








240 


Leu 


Ser 


Phe 


Phe 


Ala 
245 


Gin 


Glu 


Phe 


Gly 


Leu 

250 


Arg 


Gin 


He 


Val 


He 
255 


Glu 


Glu 


Asp 


Gly 


Lys 


Glu 


Pro 


Thr 


Ala 


Ala 


His 


Leu 


Arg 


Arg 


Val 


He 


Asp 








260 










265 






270 




Gin 


Ala 


Arg 
275 


Ala 


Asp 


Gly 


Val 


Arg 
280 


He 


Val 


Phe 


He 


Gin 
2B5 


Pro 


Glu 


Phe 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCIO: <WO 9929870A1_I_> 



wo 99/29870 PCT/AU98/01023 

256/ 490 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Glu 


Thr Arn Gin 


Al a 


Glu 


Asp He Ala Arg 


'ji u 


He 


Gly 


Ala 


Arg 


Pro 




2 90 






295 




300 










Vol 


Arg He A^n 


Fro 


Leu 


Arg Ser Ser Trp 


Glu 


Glu 


Giu 


lie 


Leu 


His 


305 






31 0 




315 










320 


He 


Ala Arg Ala 


Leu 
325 


Ala 


His Glu Arg 














{2) 


II I FORI lATIOII 


FOR 


SEQ 


ID HO: 332 















10 (i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 878 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 Hi) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
20 (A) ORGAIIISM: Porphyromonas gingival is 

(ir.) FEATURE: 

(A) MAIIE/KEY: misc_f eature 

(B) LOCATION 1 ... 878 



(Xi) SEQUEIJCE DESCRIPTION: SEQ ID NO: 332 



Ala 


Asp 


Ser 


He 


Arg 


Tyr 


Pro 


Leu 


Tyr 


Phe 


Phe 


Gly 


Arg 


Asn 


Arg 


Lys 


1 








5 










10 










15 




Lys 


Cys 


Phe 


Arg 


Glu 


Pro 


He 


Pro 


Thr 


Leu 


Tyr 


Asn 


Lys 


Asn 


Met 


He 








20 










25 










30 






Gly 


Lys 


Lys 


He 


Phe 


Phe 


He 


Leu 


Leu 


Ala 


Leu 


He 


Ala 


Phe 


Ser 


Gly 






35 










40 










45 








Leu 


A-n 


Ala 


Ala 


Thr 


Asp 


Thr 


Glu 


Phe 


Lys 


Tyr 


Pro 


Thr 


Asp 


Ala 


Asn 




50 










55 










60 










He 


He 


Gly 


His 


Val 


Lys 


Asp 


Ser 


Lys 


Thr 


Gly 


Glu 


His 


Leu 


Val 


Gly 


65 










70 










75 










80 


He 


Thr 


He 


Ala 


He 


Lys 


Gly 


Thr 


Thr 


Phe 


Gly 


Thr 


Ser 


Thr 


Asp 


Ala 










85 










90 










95 




Thr 


Gly 


His 


Tyr 


Tyr 


Leu 


Arg 


Asn 


Leu 


Arg 


Pro 


Gly 


Glu 


He 


Thr 


Leu 








100 










105 










110 






He 


Met 


Arg 


Gly 


Met 


Gly Tyr 


Lys 


Ser 


Gin 


Glu 


Arg 


Val 


Val 


Arg 


Val 






115 










120 










125 








Glu 


Lys 


Asp 


Lys 


Thr 


He 


Glu 


Val 


Asn 


Phe 


Giu 


Ala 


Glu 


Glu 


Asp 


Ala 




130 










135 










140 










He 


Asn 


Leu 


Asp 


Glu 


Val 


Val 


He 


Ser 


Ala 


Asn 


Arg 


Glu 


Leu 


Thr 


Leu 


145 










150 










155 










160 


Arg 


Arg 


Leu 


Ala 


Pro 


Thr 


Leu 


Val 


Asn 


Val 


Leu 


Asn 


Glu 


Lys 


Val 


Phe 










165 










17 0 










175 




Ser 


Gin 


Val 


Asn 


Ala 


Ser 


Asn 


Leu 


Ala 


Gin 


Gly 


Leu 


Ser 


Phe 


Gin 


Pro 








180 










185 










190 






Gly 


Val 


Arg 


Val 


Glu 


Asn 


Asn 


Cys 


Gin 


Asn 


Cys 


Gly 


Phe 


Asn 


Gin 


Val 






195 










200 










205 








Arg 


He 


Asn 


Gly 


Leu 


Asp 


Gly 


Arg 


Tyr 


Ala 


Gin 


He 


Leu 


He 


Asp 


Ser 




210 










215 










220 










Arg 


Pro 


He 


Met 


Ser 


Ala 


Leu 


Ala 


Gly 


Val 


Tyr 


Gly 


Leu 


Glu 


Gin 


He 


225 










230 










235 










240 


Pro 


Ala 


Asn 


Met 


He 


Glu 


Arg 


Val 


Glu 


Val 


Val 


Arg 


Gly 


Gly 


Gly 


Ser 










245 










250 










255 




Ala 


Leu 


Tyr 


Gly 


Ser 


Ser 


Ala 


He 


Ala 


Gly 


Val 


Val 


Asn 


He 


He 


Thr 








260 










265 










270 






Lys 


Giu 


Pro 


Ser 


His 


Asn 


Ser 


Phe 


Thr 


Phe 


Asn 


Glu 


Ser 


Leu 


Ser 


Phe 






275 










280 










285 








Thr 


Gly 


Phe 


Ser 


Lys 


Leu 


Asp 


Asn 


Asn 


Thr 


Asn 


Phe 


Asn 


Ala 


Ser 


He 




290 










295 










300 










Val 


Ser 


Asp 


Asp 


Asn 


Arg 


Ala 


Gly 


Ala 


Met 


Val 


Phe 


Gly 


Gin 


Ala 


Arg 


305 










310 










315 










320 


Tyr 


Arg 


Asn 


His 


Trp 


Asp 


Ala 


Asn 


Asn 


Asp 


Gly 


Tyr 


Ser 


Glu 


Leu 


Gly 










325 










330 










335 




Lys 


He 


Asp 


Ala 


Arg 


Ser 


Leu 


Gly 


Ala 


His 


Ser 


Tyr 


Leu 


Arg 


Leu 


Ser 








340 










345 










350 






Asp 


Tyr 


Ser 


Lys 


Leu 


Thr 


Gly 


Glu 


Phe 


His 


Thr 


He 


Ser 


Glu 


Phe 


Arg 






355 










360 










365 






Arg 


Gly Gly 


Asp 


Arg 


He 


Asp 


Leu 


Pro 


Pro 


His 


Val 


Val 


Gly 


Val 


Ala 




370 










375 










380 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCtD: <WO 9929e70A1_L> 



wo 99/29870 



PCT/AU98/01023 



257/490 



Glu Gin Thr Asp His Ser VaJ Phe Ser Gly Asn Leu Lys Tyr A^-p Leu 
385 390 395 400 

Phe Ser Ser Asn Tyr Lys His His Phe Gin Ala Tyr Thr Ser Gly Gin 
405 410 415 

5 He Val Asn Arg Lys Ser Tyr Tyr Gly Gly He Gly Glu He Asp Val 
420 425 430 

Asn Gly His Pro Gly Gly Thr Glu Gly Tyr Pro He Pro Gin Asp Gin 

435 440 445 

Tyr Gly Asn Asn Tyr Gly Val Thr Lys Gly Lys Thr Tyr Met Gly Gly 
10 450 455 460 

He Gin Tyr Ser Tyr Asp Leu Asp Lys Phe Leu Leu Met Pro Ser Gin 
465 470 475 480 

Leu Leu Phe Gly Ala Glu Tyr Thr Arg Asp Glu Leu Asn Asp Val Met 
485 490 495 

15 Pro He Leu Ser Trp Gin Thr Gly Glu Asp Ala Asn Gly Asn Thr He 
500 505 510 

Pro Leu Tyr Pro Glu Leu Asp Gin Asn lie Asn Asn Tyr Ser Leu Phe 

515 520 525 

Gly Gin Asn Glu Trp Lys Asn Asp Arg Trp Ser He Leu Val Gly Ala 
20 530 535 540 

Arg Leu Asp Lys His Ser Glu Val Lys Asp Met He Leu Ser Pro Arg 
545 550 555 560 

Thr Thr Leu Arg Phe Asn Val Asn Pro Asp He Asn Leu Arg Ala Thr 
565 570 575 

25 Tyr Ala Lys Gly Phe Arg Ala Pro Gin Val Phe Asp Glu Asp Leu His 
580 585 590 

Val Gly Val Val Gly Gly Glu Ala Gin Lys Val Phe Asn Asp Pro Asn 

595 600 605 

Leu Lys Pro Glu He Ser His Ala Phe Ser Leu Ser Ala Asp Met Tyr 
30 610 615 620 

His Arg Phe Gly Asn Val Gin Thr Asn Phe Leu Val Glu Gly Phe Tyr 
625 630 635 640 

Thr Arg Leu Leu Asp Val Phe Thr Asn Glu Glu Gin Pro Asp Gin His 
645 650 655 

35 Asp Gly lie Lys Arg Tyr Thr Arg lie Asn Gly Ser Gly Ala Lys Val 
660 665 670 

Phe Gly Leu Asn Leu Glu Gly Lys Val Ala Tyr Lys Ser Phe Gin Leu 

675 680 685 

Gin Ala Gly Leu Thr Leu Ala Ser Asn Lys Tyr Asp Glu Ala Gin Glu 
40 690 695 700 

Trp Gly Leu Asn Thr Val Lys Asp Thr Asn Gly Ala Phe Val Thr Glu 
705 710 715 720 

Ala Asn Ala Asn Gly Gin Gin Glu Tyr Lys Asn Glu Ser Met Thr Asp 
725 730 735 

45 Thr Gin He Thr Arg Thr Pro Ser Val Tyr Gly Tyr Phe Thr Leu Ala 
740 745 750 

Tyr Asn Pro Ala His Ser Trp Asn He Ala Leu Thr Gly Ala Tyr Thr 

755 760 765 

Gly Gin Met Tyr Val Pro His Ala He Glu Tyr Gly Val Lys Ser Ala 
50 770 775 780 

Glu Leu Asp He Met Gin Asn Asn Pro Glu He Thr Asp Glu Thr Gly 
785 790 795 800 

Lys Ala Pro Arg He Asp Glu Leu Lys Lys Thr Pro Ala Phe Phe Asp 
805 810 815 

55 Leu Gly Leu Lys Val Gly Tyr Asp Phe His Val Phe Gin Ala Thr Glu 
820 825 830 

Val Gin Leu Tyr Val Gly Met Asn Asn He Phe Asn Ser Phe Gin Lys 

835 840 845 

Asp Phe Asp Arg Gly Ala Ala Arg Asp Ser Gly Tyr He Tyr Gly Pro 
60 850 855 860 

Thr Gin Pro Arg Thr Gly Tyr Met Gly Leu Val Val Lys Phe 
865 870 875 



65 



70 



(2) IMFORHATIOH FOR SEQ ID NO: 333 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LEHGTH: 206 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
75 (vi) ORIGIMAL SOURCE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <V^O 9929870A1_L> 




wo 99/29870 ' PCTJAV9mm3 

258/ 490 



10 



15 



20 



25 



30 



35 



55 



60 



65 



70 



75 



(R) ORGRMISM: Porphycoroonas gingivalis 

(i;-:) FEATURE: 

(A) MAME/KEY: misc_f eature 

(B) LOCATION 1. . .206 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333 



Lys 


Gin 


lie 


V a ± 


Met 


Thr 


Val 


Lys 


Arg 


t\L a 


V d i. 


Arg 


L re 




Leu 


Lieu 


1 








c 
D 










1 0 










J- o 




Thr 


Leu 


lie 


Gl y 


He 


Leu 


Phe 


Ser 


Ser 


Pro 


Ser 


Leu 


Va 1 


Arg 


Al a 


Gin 








z u 


























Ser 


Leu 


Phe 


Ser 


Thr 


Gl u 


His 


Val 


Leu 


Gin 


Leu 


Tyr 


Asn 


Lys 


He 


Leu 






3 5 










4 0 


















Tyr 


Gly 


Glu 


Ser 


Al a 


Al a 


Asp 


Thr 


Val 


Al a 


Glu 


Lys 


Thr 


Al a 


Gly 


Glu 




50 










55 










60 










Ser 


Ala 


Phe 


Pro 


Phe 


He 


Asp 


Lys 


Leu 


He 


Asn 


Leu 


Gly 


Arg 


Thr 


Phe 


65 










70 










75 










80 


Leu 


Gly 


Lys 


Pro 


Tyr 


Arg 


Tyr 


Arg 


Gly 


Pro 


Ser 


Pro 


Trp 


Pro 


Met 


Asp 










85 










90 










95 




Cys 


Ser 


Gly 


Tyr 


Val 


Ser 


Tyr 


Leu 


Tyr 


Ser 


Lys 


Phe 


Asp 


He 


Lys 


Leu 








100 










105 










110 






Pro 


Arg 


Gly 


Ala 


Ala 


Ala 


Gin 


Ser 


Gin 


Tyr 


Thr 


Asn 


Pro 


He 


Glu 


Arg 






115 










120 










125 








Glu 


Asp 


Val 


Arg 


Pro 


Gly 


Asp 


Leu 


Leu 


Phe 


Phe 


Lys 


Gly 


Arg 


Asn 


Ala 




130 










135 










140 










Arg 


Ser 


Asn 


Arg 


He 


Gly 


His 


Val 


Ala 


Leu 


Val 


Val 


Ser 


Val 


Asp 


Glu 


14b 










150 










155 










160 


Asp 


Asp 


He 


Thr 


Met 


Met 


His 


Ser 


Arg 


Asn 


Ser 


Arg 


Gly 


He 


Val 


He 










165 










170 










175 




Glu 


Lys 


Leu 


Asn 


Arg 


Ser 


Ala 


Tyr 


Phe 


Ser 


Arg 


Arg 


Leu 


Val 


Ser 


Tyr 








180 










185 










1 90 






Gly 


Arg 


Val 


Pro 


Gly 


Ala 


Lys 


Arg 


Val 


He 


Pro 


Arg 


Lys 


Ser 










195 










200 










205 









(2) 111 FORMAT I ON FOR SEQ ID NO: 334 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 67 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE; protein 

45 ( iii ) HYPOTHETICAL: YES 

( vi ) ORIGINAL SOURCE : 

(A) ORGA^^ISM: Porphyromonas gingivalis 

50 (ix) FEATURE: 

(A) WAT'IE/KEY: misc_f eature 
iB) LOCATION 1 . . .467 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334 



Lys 


Gly 


Thr 


Trp 


Asn 


Arg 


Lys 


Asn 


Arg 


Arg 


Glu 


Glu 


Met 


Lys 


Arg 


Thr 


1 








5 










10 










15 




He 


Leu 


Leu 


Thr 


Ala 


Leu 


Thr 


Val 


Leu 


Ser 


Ser 


Leu 


Ser 


Leu 


Leu 


Arg 








20 










25 










30 






Ala 


Gin 


Asn 


Glu 


Ser 


Glu 


Ala 


Ser 


Thr 


Asn 


Pro 


Met 


Ser 


Gly 


Leu 


Ser 






35 










40 










45 








Leu 


Glu 


Asp 


Cys 


He 


Arg 


He 


Ala 


Lys 


Glu 


Arg 


Asn 


Leu 


Asn 


Leu 


Arg 




50 










55 










60 










Arg 


Gin 


Glu 


He 


Glu 


Gin 


Glu 


Asn 


Arg 


He 


He 


Ser 


Leu 


Asp 


Ala 


Ala 


65 










70 










75 










80 


Arg 


His 


Ser 


Phe 


Leu 


Pro 


Ser 


Val 


Asn 


Ala 


Gly 


He 


Gly 


His 


Asn 


Tyr 










05 










90 










95 




Ser 


Phe 


Gly 


Arg 


Ser 


Lys 


Asp 


Lys 


Thr 


Gly 


Val 


Thr 


Val 


Asp 


Arg 


Ser 








100 










105 










110 






Ser 


Met 


Asn 


Thr 


Asn 


Leu 


Ser 


He 


Gly 


Ala 


Ser 


Val 


Glu 


Val 


Phe 


Ser 






115 










120 










125 








Gly 


Thr 


Arg 


Arg 


Leu 


His 


Asp 


Leu 


Lys 


Gin 


Gin 


Lys 


Tyr 


Asn 


Val 


Glu 




130 










135 










140 










Asp 


Gly 


He 


Ala 


Arg 


Leu 


Gin 


Lys 


Ala 


Arg 


Glu 


Asp 


Leu 


Ser 


Leu 


Gin 


145 










150 










155 










160 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO ^9929870A1_L> 
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259/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



He 


Al a 


Al a 


Leu 




He 


Asn 


Leu 


Toil 


tr ne 


Arg 


tji n 


Gl u 


Met 


Thr 


Arg 










1 65 










inn 










175 


Thr 


Al a 


Gl u 


Thr 


Gin 


I ^ 
eu 


Al a 


Leu 


± J. e 


Arg 


("'lit 

LjI u 


Lai n 


Arg 


Asn 


Arg 


Th r 








180 










X O D 










190 






Ala 


Glu 


Met 


Val 




Val 


vjx y 


Lys 


Trp 


Ala 


(jIU 


Gly 


Lys 


Leu 


Leu 


Asp 






1 95 




















one. 








He 


A*=n 




1 rt 

'jX n 




a 


Lys 


Asp 


Glu 


Gin 


Leu 


Leu 


Va 1 


Gin 


Tyr 


Arg 




210 










^ ± O 










220 






Ser 


Glu 


Glu 


Glu 


eu 


Al a 


Arg 


Leu 


Asp 


Leu 


Gly 


Gin 


Ala 


Leu 


Glu 


Leu 


225 










230 










o "a c;. 

^ JO 










240 


Gl u 


His 


Pro 


Glu 


Ser 


1 1 


M J. a 


V a J. 


Lys 


Ml a 


Pro 


Asp 


Thr 


Asp 


Va 1 


Leu 










^1 










250 








255 




Va 1 


Ala 


mil 


Arg 


Leu 


Gly 


Ser 


Leu 


Leu 


Ala 


Pro 


Glu 


Glu 


He 


Tyr 


Arg 








260 










265 










270 


Thr 


/Mo 


Leu 


L3 J. y 


Leu 


Lys 


Pro 


Ala 


Leu 


His 


Ser 


Ser 


Glu 


Leu 


Gl n 


He 






27 5 










280 










285 








Al a 


3*=" r 


rtx a 


Mr g 


o 1 U 


Gly 


Leu 


Al a 


Ser 


Ala 


Arg 


Ala 


Ala 


Tyr 


Phe 


Pro 




2 90 










9 Q 

1^. r o 










300 








Thr 


Leu 


Se r 


L'^u 


oe r 


MX a 


vjJ- y 


Tyr 


Ser 


Asn 


Gly 


Tyr 


Phe 


Arg 


Asp 


Leu 


305 










310 










315 








320 




Lys 


Glu 




Ala 


Al a 


r ± e 


Asn 


Pro 


Ser 


Phe 


Se r 


Gl u 


Gin 


T rp 


Lys 










325 




















3 35 


As n 


Asn 


Gl 


Ser 


1 y £ 


S^r 


H »=■ 


Gly 


Leu 


Ser 


Leu 


Asn 


He 


Pro 


He 


Phe 








34 0 










34 5 
















Ser 


Ala 


Met 


Gin 


Thr 


Gin 


A<=-n 


Arg 


Va 1 




S*^ r 


S* r 


Arg 


Leu 


oi n 


He 






355 










3 60 


















Ar g 


Ser 


Ser 


u u 


Leu 


Arg 


T /i-i 1 

ijeu 


~ 1 
Val 


Glu 


Glu 


Lys 


Lys 


Al a 


Leu 


Tyr 


Lys 




37 0 










37 5 










380 








Gl u 


He 


Arg 


Gin 


Ala 


Tyr 


Ser 


Asn 


Ala 


Val 


Ala 


Ala 


Asp 


Lys 


Ala 


He 


385 










390 










395 








4 00 


Al a 


Ala 


Ala 


Glu 




Ser 


Lys 


Ala 


Ala 


1 n r 


Leu 


Lys 


Ml ct 


Tyr 


Glu 


Tyr 










4 05 










410 








415 


Ala 


Arg 


Asp 


Ser 


Phe 


Glu 


Ala 


Gly 


Arg 


Leu 


Ser 


Ala 


Tyr 


Glu 


Tyr 


Ala 








420 










425 










430 






Glu 


Ala 


Lys 


Thr 


Lys 


Tyr 


Ala 


Leu 


Ser 


Gin 


Val 


Glu 


Glu 


Leu 


Arg 


Ala 






^35 










440 










445 






Lys 


Tyr 


Asp 


Phe 


He 


Tyr 


Lys 


Ala 


Lys 


Val 


Leu 


Asp 


Phe 


Tyr 


Gin 


Gly 




4 50 










455 










460 






Lys 


Asp 


Phe 




























465 
































(2) 


I M FORMAT I ON 


FOR 


SEQ 


ID NO: 335 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 amino acids 
iB) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii} MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A) ORGTM^IISH: Porphyromona s gingivalif 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 

( B) LOCATION 1 ... 451 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 335 



Lys 


Thr 


Ser 


Tyr 


Arg 


Asn 


Asn 


Met 


Arg 


Phe 


Gin 


His 


Tyr 


Leu 


He 


Cys 


1 








5 










10 










15 


Thr 


Ala 


Ala 


Val 
20 


Ala 


Ala 


Leu 


Ala 


Ala 
25 


Asn 


Pro 


Leu 


Thr 


Gly 
30 


Gin 


Ser 


Asn 


Met 


Thr 
35 


Leu 


Glu 


Glu 


Cys 


He 
40 


Asp 


Tyr 


Ala 


Arg 


Arg 
45 


His 


Ser 


Ser 


Ala 


Val 


Ala 


Leu 


Ser 


Ala 


Ala 


Glu 


Leu 


Glu 


Gin 


Ser 


Lys 


Ala 


Asp 


Tyr 




50 










55 










60 




Leu 


Gin 


Ala 


Val 


Gly 


Asn 


Phe 


Leu 


Pro 


Arg 


Val 


Ser 


Ala 


Gly 


Thr 


Gly 


65 










70 










75 








80 


Ala 


Ser 


Trp 


Asn 


Phe 


Gly 


Arg 


Gly 


Leu 


Asp 


Ala 


Glu 


Thr 


Asn 


Thr 


Tyr 










85 










90 










95 


Thr 


Asp 


He 


Asn 
100 


Ser 


Phe 


Asn 


Asn 


Ser 
105 


Tyr 


Ser 


He 


His 


Ala 
110 


Thr 


Met 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDCX;iO: <WO_9929B70A1_I_> 




wo 99/29870 

260/490 



PCT/AU98/01023 



65 



1 nr 


Leu 


t^ne 


Asp 


vtX y 


Xieu 


•jX n 


Ser 


Va 1 


Tyr 


Arg 


Le u 


Arg 


Met 


MX a 


Ml S 






lit; 
X J. J) 










ion 
X z u 










125 








MX a 


Arg 


Arg 


Gl u 


MX a 


Ser 


Arg 


Leu 


Ser 


vax 


Arg 


(jX u 


(jx n 


ox n 


oX U 


x<e u 




X Ov 










X J o 










14 0 










Ml a 


MX a 


Leu 


GX':, 


1 nr 


1 n r 


uX u 


Mia 


Tyr 


Tyr 


Asp 


Leu 


vax 


Tyr 


MX cj 


Arg 


1 /1 

±H o 










150 










±oo 










160 


o± n 


iMex, 


Gin 


^ U 


L'=u 


Al a 


M*=* t 


ijx n 


Lys 


Tyr 


Glu 


LiXU 


Ser 


Ser 


Arg 


Leu 










165 










X / u 










17 5 




His 


rg 


Gin 


Thr 


Ala 


i^g 


Me t 


Glu 


Glu 


Leu 


foxy 


Met 




Ser 


Arg 


Pro 








X o u 










lot: 
X H O 










X 






Asp 


vax 


XiSU 


uX u 




ox n 


oer 


Arg 


Met 


MX a 


Gly 


TV 

Asp 


Arg 


Leu 


MX a 


Leu 






IOC 




















one; 








Thr 


Gin 


Al a 


Asp 


Asn 


LjX n 


Cys 


He 


He 


Al a 


Leu 


lie 


Arg 


Leu 


Lys 


LjX u 




210 










Z X o 










o o n 










Lys 


t'le t 


Asn 


irne 


IT ro 


X X e 


Asp 


Asp 


Gl u 


Leu 


Val 


vax 


Asp 


Asp 


Met 


Pro 


22 6 










O A 










235 










2 4 0 


M J- u 


Asp 


S^^r 


eu 


S«r 


Al a 


Asp 


ne L. 


MX a 


oX u 


Ser 


Asp 


Ser 


oe r 


Al a 


^jx y 










/i. ^ D 










250 










c: c 




V a X 


tr ne 


Al a 


Arg 


Al a 


Al a 


MX S 


MX E 


nxs 


rro 


vax 


Leu 


Toil 


Arg 


MX c* 


Lys 








2 60 










*> c; p> 










9 T A 






Leu 


Asp' 


Gl U 


Gin 


MX a 


Al a 


1 n r 


Asp 


Arg 


Leu 


Arg 


MX d 


MX a 


Arg 


Gl y 


MX a 






27 5 










2 80 










2 8 5 








Phe 


L&u 


Pro 


Ser 


Val 


Ser 


Va 1 


Ser 


IjXy 


Gly 


i rp 


Asn 


Thr 


Gl y 


Phe 


Ser 




2 90 










2 95 










300 










Arq 


Phe 


Leu 




Gl 


r 


Asp 


T\'r 


Thr 


ro 


Phe 


S'^r 


Glu 


"1 
G n 


Ph^ 


Arg 


3 05 










310 










315 










320 






rg 


Gl y 


Glu 


Tyr 


Va 1 


S*=r 


L'^'U 


A-^n 


L^u 


S*=' r 


He 


Pro 


11'=' 


Ph^ 










325 




















J O D 




Ser 




Phe 


Ser 




Val 


Ser 


His 


Leu 


Arg 


Gin 


Al a 


rg 


Al a 


Glu 


rg 








34 0 










34 5 










350 






rg 


Ala 


Al a 


J 2 c 


Val 


Ar 
rg 


rg 


^jX y 


*jX u 


MX a 


LjX u 


Arg 


Arg 


Xieu 


Tyr 


oe r 






355 










360 










365 








Glu 


He 


Ala 


Gin 


Ala 


Met 


Ala 


Asp 


Arg 


Asp 


Ala 


Ala 


Leu 


Ala 


Ser 


Tyr 




370 










375 




















Arg 


Gin 


Ala 


Lys 


Glu 


His 


Thr 


Asp 


Ala 


Met 


Gin 


Thr 


Ala 


Tyr 


Glu 


Ala 


385 










390 










395 










400 


Val 


Leu 


Gin 


Arg 


Tyr 


Glu 


Glu 


Gly 


Leu 


Asn 


Thr 


Ala 


He 


Asp 


Leu 


Thr 










4 05 










410 










415 




Thr 


Gin 


Ala 


Asn 


Arg 


Leu 


Leu 


Asp 


Ala 


Arg 


Val 


Gin 


Arg 


Leu 


Arg 


Ala 








420 










425 










430 






Ala 


Met 


Thr 


Tyr 


Arg 


Leu 


Lys 


Cys 


Lys 


Leu 


He 


Ala 


Tyr 


Tyr 


Gly 


Cys 






435 










440 










445 








Leu 


Ser 


Asp 






























450 






























(2) 


irjrORIIATION 


FOR 


SEQ 


ID MO: 336 

















10 



15 



20 



25 



30 



35 



40 



45 



(i) SEQUEDCE CHARACTERISTICS: 

(A) LENGTH: 962 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 (iii) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

60 (ix) FEATURE: 

(A) NAJ-IE/KEY: mlsc_feature 

(B) LOCATIOM 1. . . 962 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336 



Ala He Phe Val Val Ser Leu Gin He Glu Lys He Thr Glu Asn Lys 
15 10 15 

Tyr Asn Ser Asp Lys Ser Met Asn Lys Phe Tyr Lys Ser Leu Leu Gin 
20 25 30 

70 Ser Gly Leu Ala Ala Phe Val Ser Met Ala Thr Ala Leu Thr Ala Ser 
35 40 45 

Ala Gin He Ser Phe Gly Gly Glu Pro Leu Ser Phe Ser Ser Arg Ser 

50 55 60 

Ala Gly Thr His Ser Phe Asp Asp Ala Met Thr He Arg Leu Thr Pro 
75 65 70 75 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 PCT/AU98/01023 

261/490 

Asp Phe Asn Pro Glu Asp Leu lie Ala Gin Ser Arg Trp Gin Ser Gin 

85 90 95 

Arg Asp Gly Arg Pro Val Arg He Gly Gin Val Tie Pro Val Asp Val 
100 105 110 

5 Asp Phe Ala Ser Lys Ala Ser His He Ser Ser He Gly Asp Val Asp 
115 120 125 

Val Tyr Arg Leu Gin Phe Lys Leu Glu Gly Ala Lys Ala He Thr Leu 

130 135 140 

Tyr Tyr Asp Ala Phe Asn He Pro Glu Gly Gly Arg Leu Tyr He Tyr 
10 1^5 150 155 160 

Thr Pro Asp His Glu He Val Leu Gly Ala Tyr Thr Asn Ala Thr His 

165 170 175 

Arg Arg Asn Gly Ala Phe Ala Thr Glu Pro Val Pro Gly Ser Glu Leu 
180 185 190 

15 He net Asp Tyr Glu Val Ser Arg Gly Gly Thr Leu Pro Asp He Lys 
195 200 205 

He Ser Gly Ala Gly Tyr He Phe Asp Lys Val Gly Gly Arg Pro Val 

210 215 220 

Thr Asp Asn His Tyr Gly He Gly Glu Asp Asp Ser Asp Ser Asp Cys 
^0 225 230 235 240 

Glu He Asn He Asn Cys Pro Glu Gly Ala Asp Trp Gin Ala Glu Lys 

2^5 250 255 

Asn Gly Val Val Gin Met He Met Val Lys Gly Gin Tyr He Ser Met 
260 265 270 

ZO Cys Ser Gly Asn Leu Leu Asn Asn Thr Lys Gly Asp Phe Thr Pro Leu 
275 280 205 

I]e He Ser Ala Gly His Cys Ala Ser He Thr Thr Asn Phe Gly Val 

2?0 295 300 

Thr Gin Ser Glu Leu Asp Lys Trp He Phe Thr Phe His Tyr Glu Lys 
CJU 305 310 315 320 

Arg Oiy Cys Ser Asn Gly Thr Leu Ala He Phe Arg Gly Asn Ser He 

325 330 335 

He Gly Ala Ser Met Lys Ala Phe Leu Pro He Lys Gly Lys Ser Asp 
oc- ^"^^ 345 350 

OO Gly Leu Leu Leu Gin Leu Asn Asp Glu Val Pro Leu Arg Tyr Arg Val 
355 360 365 

Tyr Tyi: Asn Gly Trp Asp Ser Thr Pro Asp He Pro Ser Ser Gly Ala 

370 375 380 

Gly He His His Pro Ala Gly Asp Ala Met Lys He Ser He Leu Lys 
40 385 390 395 400 

Lys Thr Pro Ala Leu Asn Thr Trp He Ser Ser Ser Gly Ser Gly Gly 

AOS 410 415 

Thr Asp Asp His Phe Tyr Phe Lys Tyr Asp Gin Gly Gly Thr Glu Gly 
,c "^^^ 425 430 

Gly Ser Ser Gly Ser Ser Leu Phe Asn Gin Asn Lys His Val Val Gly 

435 440 445 

Thr Leu Thr Gly Gly Ala Gly Asn Cys Gly Gly Thr Glu Phe Tyr Gly 

^50 455 460 

Arg Leu Asn Ser His Trp Asn Glu Tyr Ala Ser Asp Gly Asn Thr Ser 
OU 465 470 475 480 

Arg Met Asp He Tyr Leu Asp Pro Gin Asn Asn Gly Gin Thr Thr He 

485 490 495 

Leu Asn Gly Thr Tyr Arg Asp Gly Tyr Lys Pro Leu Pre Ser Val Pro 

500 505 510 

Arg Leu Leu Leu Gin Ser Thr Gly Asp Gin Val Glu Leu Asn Trp Thr 

515 520 525 

Ala Val Pro Ala Asp Gin Tyr Pro Ser Ser Tyr Gin Val Glu Tyr His 

530 535 540 

He Phe Arg Asn Gly Lys Glu He Ala Thr Thr Lys Glu Leu Ser T^-r 
^0 550 555 560 

Ser Asp Ala He Asp Glu Ser He He Gly Ser Gly He He Arc Tyr 

565 570 575 

Glu Val Ser Ala Arg Phe He Tyr Pro Ser Pro Leu Asp Gly Val Glu 

580 585 590 

Ser Tyr Lys Asp Thr Asp Lys Thr Ser Ala Asp Leu Ala He Gly Asp 

595 600 605 

He Gin Thr Lys Leu Lys Pro Asp Val Thr Pro Leu Pro Gly Gly Gly 

eiO 615 620 

Val Ser Leu Ser Trp Lys Val Pro Phe Leu Ser Gin Leu Val Ser Arg 
70 625 630 635 640 

Phe Gly Glu Ser Pro Asn Pro Val Phe Lys Thr Phe Glu Val Pro Tyr 

645 650 655 

Val Ser Ala Ala Ala Ala Gin Thr Pro Asn Pro Pro Val Gly Val Val 

660 665 670 

He Ala Asp Lys Phe Met Ala Gly Thr Tyr Pro Glu Lys Ala Ala II- 



55 



65 



75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 




PCT/AU98/01023 



262/490 



675 



680 



685 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



Ala 


Ala 


Val 


Tyr 


Val 


Het 


Pro 


Ser 


Ala 


Pro 


Asp 


Ser 


Thr 


Phe 


His 


Leu 




690 










695 










700 










Phe 


Leu 


Lys 


Ser 


Asn 


Thr 


Asn 


Arg 


Arg 


Leu 


Gin 


Lys 


Val 


Thr 


Thr 


Pro 


705 










710 










715 










720 


Ser 


Asp 


Trp 


Gin 


Ala 


Gly 


Thr 


Trp 


Leu 


Arg 


He 


Asn 


Leu 


Asp 


Lys 


Pro 










725 










730 










735 




Phe 


Pro 


Val 


Asn 


Asn 


Asp 


His 


Met 


Leu 


Phe 


Ala 


Gly 


He 


Arg 


Het 


Pro 








740 










745 










750 






A^n 


Lys 


Tyr 


Lys 


Leu 


Asn 


Arg 


Ala 


He 


Arg 


Tyr 


Val 


Arg 


Asn 


Pro 


Asp 






755 










760 










765 








Asn 


Leu 


Phe 


Ser 


lie 


Thr 


Gly 


Lys 


Lys 


He 


Ser 


Tyr 


Asn 


Asn 


Gly 


Val 




770 










775 










780 










Ser 


Phe 


Glu 


Gly 


Tyr 


Gly 


lie 


Pro 


Ser 


Leu 


Leu 


Gly 


Tyr 


Het 


Ala 


He 


785 










7 90 










795 










800 


Lys 


Tyr 


Leu 


Val 


Val 


Asn 


Thr 


Asp 


Ala 


Fro 


Lys 


He 


Asp 


Met 


Ser 


Leu 










805 










810 










815 




Val 


Gin 


Glu 


Pro 


Tyr 


Ala 


Lys 


Gly 


Thr 


Asn 


Val 


Ala 


Pro 


Phe 


Pro 


Glu 








820 










825 










830 






Leu 


Val 


Gly 


lie 


Tyr 


Val 


Tyr 


Lys 


Asn 


Gly 


Thr 


Phe 


He 


Gly 


Thr 


Gin 






835 










84 0 










845 








Asp 


Pro 


Ser 


Val 


Thr 


Thr 


Tyr 


Ser 


Val 


Ser 


Asp 


Gly 


Thr 


Glu 


Ser 


Asp 




850 










855 










860 










Glu 


Tyr 


Glu 


lie 


Lys 


Leu 


Val 


Tyr 


Lys 


Gly 


Ser 


Gly 


He 


Ser 


Asn 


Gly 


865 










870 










875 










880 


Val 


Ala 


Gin 


lie 


Glu 


Asn 


Asn 


Asn 


Ala 


Val 


Val 


Ala 


Tyr 


Pro 


Ser 


Val 










885 










890 










895 




Val 


Thr 


Asp 


Arg 


Phe 


Ser 


lie 


Lys 


Asn 


Ala 


His 


Met 


Val 


His 


Ala 


Ala 








900 










905 










910 






Ala 


Leu 


Tyr 


Ser 


Leu 


Asp 


Gly 


Lys 


Gin 


Val 


Arg 


Ser 


Trp 


Asn 


Asn 


Leu 






915 










920 










925 








Arg 


Asn 


Gly 


Val 


Thr 


Phe 


Ser 


Val 


Gin 


Gly 


Leu 


Thr 


Ala 


Gly 


Thr 


Tyr 




930 










935 










940 










Met 


Leu 


Val 


Met 


Gin 


Thr 


Ala 


Asn 


Gly 


Pro 


Val 


Ser 


Gin 


Lys 


He 


Val 


945 










950 










955 










960 


Lys 


Gin 































(2) IMFORHATION FOR SEQ ID MO: 337 

ti) SEQUEMCE CHARACTERISTICS: 

{A) LENGTH: 702 amino acids 
(B) TYPE: amino acid 
( D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: protein 

( ill ) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGAMISH: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 

(B) LOCATION 1...702 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337 



Trp 


Lys 


Lys 


Ser 


Asn 


Pro 


Gin 


Tyr 


Arg 


Gly 


Arg 


Thr 


Ala 


Asn 


Tyr 


Arg 


1 








5 










10 










15 




Asp 


His 


Tyr 


Asn 


Thr 


Glu 


Ser 


Tyr 


Leu 


Ser 


Leu 


Arg 


Arg 


Asn 


Gin 


Thr 








20 










25 










30 






Met 


Lys 


Tyr 


Leu 


He 


Arg 


Leu 


Phe 


Leu 


Ser 


Leu 


Het 


Leu 


Leu 


Ser 


Leu 






35 










40 










45 








Trp 


Thr 


Gly 


Cys 


Thr 


His 


Glu 


Glu 


Leu 


Ser 


He 


cys 


Asp 


Gly 


Glu 


Asn 




50 










55 










60 










Thr 


Leu 


Val 


Leu 


Arg 


Val 


Glu 


Thr 


Gly 


Lys 


Ala 


Pro 


As n 


Ala 


Arg 


Ala 


65 










70 










75 










80 


Thr 


Gl u 


Pro 


Gly 


Gin 


Gly 


He 


Tyr 


Asn 


Glu 


Asn 


Lys 


Val 


Gly 


Ser 


He 










85 










90 










96 




Ser 


Val 


Leu 


Phe 


Tyr 


Leu 


Glu 


Gly 


Gin 


Leu 


Arg 


Trp 


Gin 


Val 


Lys 


Ser 








100 










105 










110 






Thr 


Asp 


Tyr 


Gin 


He 


His 


Glu 


Gly 


Ala 


Tyr 


He 


He 


Pro 


Val 


Lys 


Glu 






115 










120 










125 








Gin 


1-iet 


Arg 


Pro 


Leu 


Phe 


Asn 


Gly 


Asn 


Asn 


Asn 


Phe 


Ser 


He 


Tyr 


Val 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <W0 9929870A1_L> 



wo 99/29870 

263/490 



% 



PCT/AU98/01023 





130 










135 










140 










Val 


Ala 


Asn 


Leu 


Asp 


Fhe 


Asn 


Ala 


Pro 


Ala 


Thr 


Glu 


Ala 


Ala 


Leu 


Ser 


145 










150 










155 










160 


Gin 


Phe 


Val 


Val 


Glu 


Lys 


Ser 


He 


Glu 


Val 


Ser 


Ser 


Thr 


Thr 


Ala 


Pro 










165 










170 










175 




Ala 


Asp 


Phe 


Val 


t'let 


Leu 


Ala 


His 


Gly 


Asn 


Lys 


Gin 


He 


Asn 


Met 


Ala 








180 










185 










190 






Thr 


Thr 


Glu 


Gly 


Lys 


Leu 


Leu 


Gly 


Asp 


Tyr 


Lys 


Leu 


Lys 


Arg 


Val 


Ala 






195 










200 










205 








Ala 


Lys 


He 


Arg 


Met 


He 


Lys 


Pro 


Thr 


He 


Asn 


Val 


Gin 


Gly 


Tyr 


Glu 




210 










215 










220 










Val 


Val 


Gly Asn 


He 


Gin 


Ala 


Lys 


Phe 


Arg 


Asn 


Ser 


Val 


Thr 


Lys 


Gly 


225 










230 










235 










240 


Phe 


Leu 


Thr 


Thr 


Glu 


Ala 


Gin 


Glu 


He 


Pro 


Ala 


Ala 


Ala 


Ser 


Tyr 


Lys 










245 










250 










255 




Thr 


Ser 


Glu 


Tyr 


Leu 


Asp 


He 


Ala 


Glu 


Ser 


Ala 


Pro 


Ala 


Asn 


Ser 


He 








260 










265 










270 






His 


Phe 


Tyr 


Ser 


Tyr 


Tyr 


Asn 


Lys 


Trp 


Thr 


Leu 


Ser 


Thr 


Pro 


Glu 


Lys 






275 










280 










285 








Arg 


Pro 


Glu 


Phe 


Phe 


He 


Met 


Val 


Lys 


Phe 


Lys 


Lys 


Thr 


Gly 


Gin 


Pro 




2 90 










295 










300 










Asp 


Asn 


Thr 


Ala 


Lys 


Pro 


Tyr 


Tyr 


Tyr 


Arg 


Val 


Pro 


Leu 


Glu 


Ser 


Gin 


305 










310 










315 










320 


Asp 


Asn 


Gin 


Val 


Lys 


Ser 


Asn 


Val 


Leu 


Tyr 


Asn 


Leu 


Asn 


Val 


Lys 


He 










325 










330 










335 




Glu 


He 


Leu 


Gly 


Ser 


Leu 


Gin 


Glu 


Pro 


Gl u 


Ala 


Val 


Ser 


Val 


Asn 


Gly 








340 










345 










350 






Thi- 


Leu 


Ala 


He 


Glu 


Glu 


Trp 


He 


Leu 


His 


Gin 


Asp 


Ala 


Phe 


Asn 


Leu 






355 










360 










365 








Pro 


Ala 


Thr 


Asn 


Tyr 


Leu 


He 


Val 


Glu 


Gin 


His 


Gl u 


He 


Phe 


Met 


Asn 




37 0 










37 5 










380 










Asn 


Val 


Asn 


Thr 


Tyr 


Ser 


Val 


Lys 


Tyr 


Gin 


Thr 


Ser 


Gin 


Lys 


Pro 


He 


385 










390 










395 










4 00 


Ser 


He 


Ser 


He 


Gin 


Ser 


Val 


Thr 


Phe 


Ser 


Tyr 


Val 


Ser 


Ser 


Asp 


Gly 










405 










410 










415 




Thr 


Gin 


His 


Asn 


Asp 


Leu 


Val 


Ala 


Ser 


Ser 


Ser 


Asp 


Gin 


Tyr 


Pro 


Thr 








420 










425 










430 






lie 


Thr 


Ser 


Asp 


Asn 


Thr 


Ser 


He 


He 


He 


Thr 


Ser 


Lys 


He 


Pro 


Val 






435 










440 










445 








Asn 


Asn 


Val 


Pro 


Lys 


Lys 


He 


Val 


Phe 


Glu 


Val 


Thr 


Asn 


Gly 


Val 


Ala 




4 50 










455 










460 










Giy 


Leu 


Lys 


Glu 


Thr 


Val 


Thr 


Val 


Leu 


Gin 


Tyr 


Pro 


Ala 


Gin 


Phe 


He 


465 










470 










475 










480 


Val 


Asn 


Thr 


Leu 


Gly 


Thr 


Ala 


Ser 


Ala 


Trp 


Arg 


Pro 


Asp 


Gly 


Ser 


Leu 










485 










490 
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Ala 


Pro 


Gly 


Leu 


Asn 


Asn 


Lys 


Ala 


He 


Tyr 


His 


Val 


Val 


Val 


Leu 


Val 








600 










505 










510 






Pro 


Pro 


Glu 


Asn 


Leu 


Phe 


Glu 


Asp 


Gly 


Thr 


Gin 


Thr 


He 


He 


Gly 


Tyr 






515 










520 










525 








Pro 


Pro 


Thr 


Glu 


Thr 


He 


Ser 


Phe 


His 


Lys 


Lys 


Glu 


Asn 


Asn 


Thr 


Tyr 




530 










535 










540 










Pro 


He 


Val 


Trp 


Ser 


Asp 


Thr 


Asn 


Thr 


Thr 


Lys 


Gin 


Asp 


Leu 


Glu 


Thr 


545 










550 










555 










560 


Ser 


Arg 


Met 


He 


Ser 


Pro 


Ser 


Phe 


Glu 


Leu 


Ala 


Ser 


Gin 


Leu 


Gly 


Ala 










565 










570 










575 




Thr 


Leu 


Pro 


Met 


Pro 


Tyr 


Leu 


Glu 


Tyr 


Trp 


Pro 


Gly 


Thr 


Ser 


Tyr 


Leu 








580 










585 










590 






Leu 


Asp 


Tyr 


Ser 


Gly 


Asn 


Tyr 


Asn 


Asn 


Lys 


Arg 


Tyr 


Ala 


Leu 


Phe 


Asn 






595 










600 










605 








Cys 


Ala 


Phe 


Tyr 


Trp 


Glu 


Lys 


Arg 


Lys 


Val 


Asn 


Asn 


Glu 


Glu 


He 


Lys 




610 










615 










620 










Phe 


Asp 


Asp 


Trp 


Arg 


Leu 


Pro 


Thr 


Glu 


Ala 


Glu 


He 


Lys 


Leu 


He 


Asp 


625 










630 










635 










640 


Lys 


Leu 


Gin 


His 


Asn 


Glu 


Gin 


Ser 


Ala 


Val 


Gin 


Ala 


He 


Met 


Thr 


Gly 










645 










650 










655 




Asn 


Tyr 


Tyr 


Trp 


Asp 


Ser 


Tyr 


Ser 


Ala 


Asn 


Gly 


Ser 


Tyr 


Lys 


Met 


Gin 








660 










665 










670 






Gly 


Gly 


Gly 


Gly 


Gin 


Gly Asn 


Ser 


Ser 


Lys 


Ala 


Tyr 


Val 


Arg 


Cys 


Val 






675 










680 










685 








Arg 


Asp 


Val 


Lys 


Lys 


Pro 


He 


Arg 


Asp 


Lys 


Ly^ 


Ser 


Gly 


Lys 








690 










695 










700 
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50 
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60 



65 
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(2) INFORMATION FOR SEQ ID 1J0:338 
75 (i) SEQUENCE CHARACTERISTICS: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCIO: <W0 ^9929870A1_L> 



4 



WO 99/29870 ^ PCT/AU98/01023 

264/ 490 



(A) LENGTH: 1312 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii) HOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGItlAL SOURCE: 
10 (A) ORGAIIISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) N7UIE/KEY: misc_feature 

(B) LOCATION 1 . , . 1312 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338 



Lys 


Arg 


Val 


Asp 


Ser 


Tyr 


Glu 


Cys 


Arg 


Gin 


Lys 


Ala 


Cys 


Lys 


Cys 


Ala 


1 








5 










10 










15 




He 


Cys 


Val 


He 


Gin 


Lys 


Phe 


Thr 


Asn 


Val 


Lys 


Leu 


Asn 


Asp 


Met 


Arg 








20 










25 










30 




Lys 


He 


Leu 


Ser 


Phe 


Leu 


Met 


Met 


Cys 


Ser 


Leu 


His 


Leu 


Gly 


Leu 


Gin 






35 










40 










45 








Ser 


Gin 


Thr 


Trp 


His 


Gly 


Asp 


Pro 


Asp 


Ser 


Val 


Ala 


Ala 


Leu 


Pro 


Ser 




50 










55 










60 










li- 


Gly 


He 


Gin 


Glu 


Ser 


Ser 


Cys 


Thr 


Arg 


He 


Thr 


Phe 


Glu 


Val 


Val 


es 










70 










75 










80 


Phe 


Pro 


Gly 


Phe 


Tyr 


Ser 


Val 


Glu 


Lys 


Arg 


Glu 


Gly 


Asn 


Gin 


Val 


Phe 










85 










90 










95 




Gin 


Arg 


He 


Ser 


Met 


Pro 


Gly 


Cys 


Gly 


Ser 


Phe 


Gly 


Asn 


Leu 


Gly 


Glu 








100 










105 










110 






Al a 


GIu 


Leu 


Pro 


Val 


Leu 


Ly^ 


Lys 


Met 


He 


Ala 


Val 


Pro 


Glu 


Phe 


Ser 






115 










120 










125 








Thr 


Ala 


Asn 


Val 


Ala 


Val 


Lys 


He 


Lys 


Glu 


Thr 


Glu 


Thr 


Phe 


Asp 


Asn 




130 










135 










140 








Tyr 


Asn 


He 


Tyr 


Pro 


Asn 


Pro 


Thr 


Tyr 


Val 


Val 


Glu 


Glu 


Leu 


Pro 


Glu 












150 










155 










160 


Gly 


Gly 


Thr 


Tyr 


Leu 


Val 


Glu 


Ala 


Phe 


Ala 


He 


Asn 


Asn 


Asp 


Tyr 


Tyr 










165 










170 










175 


Ser 


Gin 


Asn 


Val 


Ser 


Leu 


Pro 


Ser 


Thr 


His 


Tyr 


Val 


Tyr 


Ser 


Gin 


Asp 








180 










185 










190 




Gly 


Tyr 


Phe 


Arg 


Ser 


Gin 


Arg 


Phe 


He 


Glu 


Val 


Thr 


Leu 


Tyr 


Pro 


Phe 






195 










200 










205 








Arg 


Tyr 


Asn 


Pro 


Val 


Arg 


Gin 


Glu 


He 


Leu 


Phe 


Ala 


Lys 


Lys 


He 


Glu 




210 










215 










220 










Val 


Thr 


He 


Thr 


Phe 


Asp 


Asn 


Pro 


Gin 


Pro 


Pro 


Leu 


Gin 


Lys 


Asn 


Thr 


225 










230 










235 










240 


Gly 


He 


Phe 


Asn 


Lys 


Val 


Ala 


Ser 


Ser 


Ala 


Phe 


He 


Asn 


Tyr 


Glu 


Ala 










245 










250 








255 




Asp 


Gly 


Lys 


Ser 


Ala 


He 


Glu 


Asn 


Asp 


Met 


Val 


Phe 


Ser 


Arg 


Gly 


Thr 








260 










265 










270 






Thr 


Thr 


Tyr 


He 


Ser 


Gly 


Asn 


Val 


Ala 


Ser 


Asn 


Leu 


Pro 


Gin 


Asn 


Cys 






275 










280 










285 






Asp 


Tyr 


Leu 


Val 


He 


Tyr 


Asp 


Asp 


Met 


Phe 


Asn 


Val 


Asn 


Gin 


Gin 


Pro 




2 90 










295 










300 










His 


Asp 


Glu 


He 


Lys 


Arg 


Leu 


Cys 


Glu 


His 


Arg 


Ala 


Phe 


Tyr 


Asn 


Gly 


305 










310 










315 








320 


Phe 


Asp 


Val 


Ala 


Ala 


Val 


Ser 


He 


Lys 


Asp 


Val 


Leu 


Asn 


Ser 


Phe 


Pro 










325 










330 










335 




Ser 


Asn 


Ala 


Thr 


Ser 


Tyr 


He 


Asn 


Glu 


Thr 


Lys 


Leu 


Lys 


Asn 


Phe 


He 








340 










345 










350 






Arg 


Ser 


Val 


Tyr 


Asn 


Gin 


Ser 


Asn 


Ala 


Lys 


Arg 


Thr 


Leu 


Asp 


Gly 


Lys 






355 










360 










365 




Leu 


Gly 


Tyr 


Val 


Leu 


Leu 


He 


Gly 


Lys 


Pro 


Leu 


Ser 


Lys 


Tyr 


Leu 


Ala 




370 










375 










380 








Asp 


Thr 


Asp 


Asn 


Thr 


Lys 


Val 


Pro 


Thr 


Ser 


Phe 


He 


His 


Asn 


Val 


Ser 


385 










390 










395 










4 00 


Leu 


He 


Pro 


Ser 


His 


Pro 


Thr 


Phe 


Gly 


Ser 


He 


Cys 


Ala 


Ser 


Asp 


Tyr 










405 










410 










415 




Phe 


Phe 


Ser 


Cys 


Val 


Ser 


Pro 


Leu 


Asp 


Thr 


Val 


Gly 


Asp 


Leu 


Phe 


He 








420 










425 










430 






Gly 


Arq 


Phe 


Ser 


Val 


Thr 


Asn 


Ala 


His 


Glu 


Leu 


His 


Asn 


Leu 


He 


Glu 






435 










440 










445 








Lys 


Thr 


He 


Asn 


Lys 


Glu 


He 


Ser 


Tyr 


Asn 


Pro 


He 


Ala 


His 


Lys 


Asn 




450 










455 










460 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



He 


Leu 


Tyr 


Ala 


Glu 


Gly 


Lys 


Gly 


Cys 


Asp 


Ala 


Pro 


He 


Leu 


Arg 


Leu 












470 










475 










480 


Phe 


Leu 


Lys 


Glu 


He 


Ala 


Ser 


Gly 


Tyr 


Thr 


Val 


Asn 


Ser 


He 


Leu 


Lys 










485 










490 










4 95 




Ser 


Asn 


GJ n 


Val 


Ser 


Ala 


He 


Asp 


Ser 


He 


Phe 


Asp 


Cys 


Leu 


Asn 


Asn 








500 










505 








510 






Gly 


Ser 


His 


His 


Phe 


Tyr 


Phe 


Asn 


Thr 


His 


Gly 


Met 


Pro 


Thr 


Val 


Trp 






515 










520 










525 








Gly 


He 


Gly 


Gin 


Gly 


Leu 


Asp 


Val 


Asn 


Thr 


Leu 


Thr 


Ala 


Arg 


Leu 


Asn 




530 










535 










540 










Asn 


Thr 


Ser 


Ser 


Gin 


Gly 


Leu 


Cys 


Thr 


Ser 


Leu 


Ser 


Cys 


Ser 


Ser 


Ala 


545 










550 










555 










560 


Val 


Ala 


As'p 


Ser 


Thr 


He 


Arg 


Ser 


Leu 


Gly 


Glu 


Val 


Leu 


Thr 


Thr 


Tyr 










565 










570 










575 




Ala 


Pro 


Asn 


Lys 


Gly 


Phe 


Ser 


Ala 


Phe 


Leu 


Gly 


Gly 


Ser 


Arg 


Ala 


Thr 








580 










585 










590 






Gin 


Tyr 


Ala 


Val 


Tyr 


Leu 


Glu 


Gly 


Pro 


Cys 


Pro 


Pro 


Ser 


Glu 


Phe 


Tyr 






595 










600 










605 








Glu 


Tyr 


Leu 


Pro 


Tyr 


Ser 


Leu 


Tyr 


His 


Asn 


Leu 


Ser 


Thr 


Val 


Val 


Gly 




610 










615 










620 










Gl u 


Met 


Leu 


Leu 


Ser 


Ser 


He 


He 


Asn 


Thr 


Asn 


Ser 


Val 


Asp 


Thr 


Tyr 


625 










630 










635 










640 


Ser 


Lys 


Phe 


Asn 


Phe 


Asn 


Leu 


Leu 


Gly 


Asp 


Pro 


Ala 


Leu 


Asn 


He 


Met 










645 










650 










655 




Ala 


His 


Gly 


Met 


Gl u 


Val 


Ser 


Asn 


Cys 


He 


Thr 


Leu 


Pro 


Asn 


Asn 


Thr 








660 










665 










670 






lie 


He 


Ser 


Ser 


Pro 


He 


Thr 


He 


Lys 


Asn 


Gly 


Gly 


Cys 


Leu 


Lys 


He 






67 5 










680 










685 








Pro 


Glu 


Lys 


Gly 


Val 


Leu 


His 


Phe 


Thr 


Asn 


Asn 


Gly 


Ser 


He 


Gin 


Val 




690 










695 










700 










Met 


Ser 


Gly 


Gly 


Thr 


Leu 


Glu 


He 


Gly 


Asn 


Gin 


Ala 


Lys 


He 


Ser 


Gly 


705 










710 










715 










720 


Glu 


Thr 


Gl y 


Ala 


Asn 


Pro 


Thr 


Phe 


He 


Thr 


Val 


Tyr 


Gly 


Asp 


Gly 


Leu 










725 










730 










735 




Ala 


He 


Asn 


Lys 


Gin 


Val 


Glu 


He 


Asp 


Asn 


He 


Asp 


Arg 


Leu 


Asn 


Leu 








740 










745 










750 






Phe 


Ser 


Thr 


His 


Ser 


Val 


Met 


Pro 


Lys 


Phe 


His 


Phe 


Asp 


Ser 


Val 


Lys 






755 










760 










765 








Phe 


Asn 


Ser 


Ala 


Pro 


Leu 


Tyr 


Thr 


Thr 


Asn 


Cys 


He 


Val 


Glu 


He 


Ser 




770 










775 










730 










Asn 


Cys 


Glu 


Phe 


Thr 


Asn 


Arg 


Ser 


Asp 


He 


He 


Ser 


Lys 


Asn 


Cys 


Asp 


785 










790 










795 










800 


Leu 


Ser 


Val 


Glu 


Asn 


Ser 


Met 


Phe 


Ser 


Ser 


Ser 


Gly 


He 


Thr 


Val 


Phe 










805 










810 








815 




Lys 


Pro 


Met 


Ala 


Thr 


Ser 


Ser 


He 


Thr 


Gly 


Leu 


Ser 


Thr 


Lys 


Ala 


Lys 








820 










825 










830 






lie 


Thr 


Asp 


Asn 


Thr 


Phe 


Phe 


Ala 


Thr 


Gly 


Asn 


Phe 


Ala 


Tyr 


His 


He 






835 










840 










845 








Thr 


Asn 


Thr 


Pro 


Gly 


Leu 


Thr 


Ala 


Thr 


Ser 


Asn 


Ala 


Ala 


He 


Lys 


Leu 




850 










855 










860 










A^p 


Asn 


He 


Pro 


Glu 


Tyr 


Tyr 


He 


Ser 


Gly 


Asn 


Lys 


He 


Val 


Asn 


Cys 


865 










870 










875 










880 


Asp 


Glu 


Ala 


Leu 


Val 


Leu 


Asn 


Asn 


Ser 


Gly Asn 


Arg 


Thr 


Asn 


Arg 


Leu 










885 










890 










895 




His 


Asn 


He 


Thr 


Arg 


Asn 


Val 


He 


Lys 


Asn 


Cys 


Arg 


He 


Gly 


Ser 


Thr 








900 










905 










910 






Leu 


Tyr 


Asn 


Ssr 


Tyr 


Gly 


He 


Tyr 


Asn 


Arg 


Asn 


Lys 


He 


Ser 


Asn 


Asn 






915 










920 










925 








Hie 


He 


Gly 


Val 


Arg 


Leu 


Leu 


Asn 


Asn 


Ser 


Cys 


Phe 


Tyr 


Phe 


Asp 


Asn 




930 










935 










940 










Ala 


Pro 


Val 


He 


Asn 


Glu 


Gl u 


Asp 


Lys 


Gin 


Thr 


Phe 


He 


Ser 


Asn 


Arg 


945 










950 










955 










960 


Thr 


Trp 


Gin 


Leu 


Tyr 


Ser 


Ser 


Asn 


Gly Thr 


Phe 


Pro 


Leu 


Asn 


Phe 


His 










965 










970 










975 




Tyr 


Asn 


Ser 


Leu 


Gin 


Gly 


Gly 


Asp 


Thr 


Asp 


Thr 


Trp 


lie 


Tyr 


Asn 


Asp 








980 










985 










990 






Thr 


Tyr 


Thr 


Asn 


Arg 


Tyr 


He 


Asp 


Val 


Ser 


Asn 


Asn 


His 


Trp 


'3iy 


Asn 






995 










1000 








1005 






Asn 


Asp 


Leu 


Phe 


Asp 


Pro 


Asn 


Gl n 


Val 


Phe 


Asn 


Thr 


Pro 


Asp 


Leu 


Phe 




1010 








1015 








1020 






lie 


Trp 


He 


Pro 


Phe 


Trp Asp 


Gly 


Leu 


Pro 


Asn 


Gly Arg 


Ser 


Gly 


Asn 


1025 








1030 








1035 








3 


Ser 


Ser 


Ala 


Glu 


Ala 


Val 


Glu 


Phe 


Gin 


Thr 


Ala 


Leu 


Asp 


Cys 


He 


Gly 










1045 








1050 






1055 


Asn 


Ser 


Asp 


Tyr 


Leu 


Ser 


Ala 


Lys 


Val 


Ala 


Leu 


Lys 


Met 


Met 


Val 


Glu 
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• 
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1060 1065 1070 

Thr Tyr Pro Glu Ser Asp Phe Ala lie Ala Ala Leu hys Glu Leu Phe 

1075 1080 1085 

Arg lie Glu hys Met Ser Gly Asn Asp Tyr Glu Gly Leu Lys Asp Tyr 
5 1090 1095 1100 

Phe Arg Ser Asn Pro Thr lie lie Ser Ser Gin Asn Leu Phe Pro Thr 
1105 1110 1115 1120 

Ala Asp Phe Leu Ser Ala Arg Cys Asp lie Val Cys Glu Asn Tyr Gin 
1125 1130 1135 

10 Ser Ala lie Asp Trp Tyr Glu Asn Arg Leu Asn Ser Glu lie Ser Tyr 
1140 1145 1150 

Gin Asp Ser Val Phe Ala Val lie Asp Leu Gly Asp lie Tyr Trp Asn 

1155 1160 1165 

Met Gin Leu Asp Ser Leu Arg Gly Thr Gly lie Asp Leu Asn He Leu 
15 1170 1175 1180 

Ser Cys Glu Gin Arg Lys Ser Leu Glu Ser His Gin Asn Val Lys Asn 
1185 1190 1195 1200 

Tyr Leu Leu Ser Thr Leu Pro Glu Ser Thr Gly Thr Leu Leu Pro Pro 
1205 1210 1215 

20 Leu Glu Cys Asn Lys Ser Ser Leu Asp Lys Ser Lys He He Ser He 
1220 1225 1230 

Ser Pro Asn Pro Ala Lys Ala Val Val Thr He He Tyr Tyr Thr Asp 

1235 1240 1245 

Asn Pro Ser Cys Ser Val He Lys He Tyr Gly He Asn Gly Ala Ser 
25 1250 1255 1260 

Ala Asp He Thr Gly Leu Pro Lys His Leu Ser Glu Gly Tyr Tyr Ser 
1265 1270 1275 1280 

He Gin Phe Asn Thr Ser Asn Phe Asp Pro Gly Phe Tyr Leu Val Thr 
1285 1290 1295 

30 Leu Asn Val Asp Gin Lys He He Asp Thr Glu Lys Leu Arg He Lys 
1300 1305 1310 

(2) INFORMATION FOR SEQ ID NO: 33 9 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 938 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



45 



50 



(Yi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAI-IE/KEY: misc__f eature 

(B) LOCATION 1 ... 938 

(xi) SEQUENCE DESCRIPTION: SEQ ID 110:33 9 



55 



60 



65 



70 



75 



Ser 


Glu 


Asn 


Tyr 


Arg 


Tyr 


Gly 


Lys 


He 


Thr 


Asn 


Gin 


He 


Met 


Ala 


He 


1 








5 










10 










15 




Met 


Met 


Lys 


Ser 


He 


Val 


Phe 


Arg 


Ala 


Phe 


Leu 


Thr 


He 


Leu 


Leu 


Ser 








20 










25 










30 






Trp 


Ala 


Ala 


He 


Thr 


Asn 


Pro 


Thr 


Ala 


Gin 


Glu 


He 


Ser 


Gly 


Met 


Asn 






35 










40 










45 








Ala 


Ser 


Cys 


Leu 


Ala 


Ala 


Pro 


Ala 


Gin 


Pro 


Asp 


Thr 


He 


Leu 


Tyr 


Glu 




50 










55 










60 








Ser 


Phe 


Glu 


Asn 


Gly 


Pro 


Val 


Pro 


Asn 


Gly 


Trp 


Leu 


Glu 


He 


Asp 


Ala 


65 










70 










75 








80 


Asp 


Ala 


Asp 


Gly 


Ala 


Thr 


Trp 


Gly 


Ser 


Pro 


Ser 


Gly 


Ser 


Phe 


Ser 


Val 










85 










90 










95 




Pro 


Tyr 


Gly 


His 


Asn 


Gly 


Leu 


Cys 


Thr 


Tyr 


Ser 


His 


He 


Arg 


Ser 


Gly 








100 










105 










110 






He 


Ser 


Thr 


Ala 


Gly 


Asn 


Tyr 


Leu 


He 


Thr 


Pro 


Asn 


He 


Glu 


Gly 


Ala 






115 










120 










125 








Lys 


Arg 


Val 


Lys 


Tyr 


Trp 


Val 


Cys 


Asn 


Gin 


Tyr 


Ser 


Thr 


Asn 


Pro 


Glu 




130 










135 










140 










His 


Tyr 


Ala 


Val 


Met 


Val 


Ser 


Thr 


Thr 


Gly 


Thr 


Ala 


He 


Gl u 


Asp 


Phe 


145 










150 










155 








160 


Val 


Leu 


Leu 


Phe 


Asp 


Asp 


Ser 


He 


Thr 


Gly 


Lys 


Pro 


Thr 


Pro 


Leu 


Val 










165 










170 










175 




Trp 


Arg 


Arg 


Arg 


He 


Val 


Asp 


Leu 


Pro 


Glu 


Gly 


Thr 


Lys 


Tyr 


He 


Ala 
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180 185 190 

Trp Arg His Tyr Lys Val Thr Asp Ser His Thr Glu Phe Leu Lys Leu 

195 200 205 

Asp Asp Val Thr Val Tyr Arg Ser He Glu Gly Pro Glu Pro Ala Thr 
5 210 215 220 

Asp Phe Thr Val He Asn He Gly Gin Asn Val Gly Arg Leu Thr Trp 
225 230 235 240 

Asn Tyr Pro Glu Asp Tyr Gin Pro Glu Gly Lys Gly Asn Glu Glu Leu 
245 250 255 

10 Gin Leu Ser Gly Tyr Asn He Tyr Ala Asn Gly Thr Leu Leu Ala Gin 
260 265 270 

He Lys Asp Val Ser He Leu Glu Tyr Val Asp Ser Thr Tyr Ser Leu 

275 280 285 

Arg Asp Asn Pro Leu Gin Val Glu Tyr Cys Val Thr Ala Val Tyr Asp 
15 290 295 300 

Glu Ser He Glu Ser Ser Thr Val Cys Gly Thr Leu His Tyr Ala Thr 
305 310 315 320 

Asp Ala He Leu Tyr Glu Asn Phe Glu Asn Gly Pro Val Pro Asn Gly 
325 330 335 

20 Trp Leu Val He Asp Ala Asp Gly Asp Gly Phe Ser Trp Gly His Tyr 
340 345 350 

Leu Asn Ala Tyr Asp Ala Phe Pro Gly His Asn Gly Gly His Cys Ser 

355 360 365 

Leu Ser Ala Ser Tyr Val Pro Gly He Gly Pro Val Thr Pro Asp Asn 
25 370 375 380 

Tyr Leu He Thr Pro Lys Val Glu Gly Ala Lys Arg Val Lys Tyr Trp 
385 390 395 400 

Val Ser Thr Gin Asp Ala Asn Trp Ala Ala Glu His Tyr Ala Val 1 let 
405 410 415 

30 Ala Ser Thr Thr Gly Thr Ala Val Gly Asp Phe Val He Leu Phe Glu 
420 425 430 

Glu Thr Met Thr Ala Lys Pro Thr Gly Ala Trp Tyr Glu Arg Thr He 

435 440 445 

Asn Leu Pro Glu Gly Thr Lys Tyr He Ala Trp Arg His Tyr Asn Cys 
35 450 455 460 

Thr Asp He Tyr Phe Leu Lys Leu Asp Asp He Thr Val Phe Gly Thr 
465 470 475 480 

Pro Ala Ser Glu Pro Glu Pro Val Thr Asp Phe Val Val Ser Leu He 
485 490 495 

40 Glu Asn Asn Lys Gly Arg Leu Lys Trp Asn Tyr Pro Asn Gly Tyr Glu 
500 505 510 

Pro Asp Lys Thr Asp Asp Lys Asp Pro Leu Gin Leu Ala Gly Tyr Asn 

515 520 525 

He Tyr Ala Asn Gly Ser Leu Leu Val His He Gin Asp Pro Thr Val 
45 530 ' 535 540 

Leu Giu Tyr He Asp Glu Thr Tyr Ser Ser Arg Asp Asp Gin Val Glu 
545 550 555 560 

Val Glu Tyr Cys Val Thr Ala Val Tyr Asn Asp Asn He Glu Ser Gin 
565 570 575 

50 Ser Val Cys Asp Lys Leu He Tyr Asp Ser Gin Ser Asp He He Leu 
580 585 590 

Tyr Glu Gly Phe Glu Ala Gly Ser He Pro Glu Gly Trp Leu Leu He 

595 600 605 

Asp Ala Asp Gly Asp Asn Val Asn Trp Asp Tyr Tyr Pro Trp Thr Met 
55 610 615 620 

Tyr Gly His Asp Ser Glu Lys Cys He Ala Ser Pro Ser Tyr Leu Pro 
625 630 635 640 

Met He Gly Val Leu Thr Pro Asp Asn Tyr Leu Val Thr Pro Arg Leu 

645 650 655 

Glu Gly Ala Lys Leu Val Lys Tyr Trp Val Ser Ala Gin Asp Ala Val 

660 665 670 

Tyr Ser Ala Glu His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala 

675 680 685 

Val Glu Asp Phe Val Leu Leu Phe Glu Glu Thr Met Thr Ala Lys Ala 
o5 690 695 700 

Asn Gly Ala Trp Tyr Glu Arg Thr He Thr Leu Pro Ala Gly Thr Lys 
'^05 710 715 720 

Tyr He Ala Trp Arg His Tyr Asp Cys Thr Asp Met Phe Phe Leu Leu 
725 730 735 

/U Leu Asp Asp He Thr Val Tyr Arg Ser Thr Glu Thr Val Pro Glu Pro 
740 745 750 

Val Thr Asp Phe Val Val Ser Leu He Glu Asn Asn Lys Gly Arg Leu 

755 760 765 

Lys Trp Asn Tyr Pro Asn Gly Tyr Glu Pro Asp Lys Thr Asp Asp Lys 
/5 770 775 780 
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Pro 


Leu Gin 


Leu 


Thr 


Gly 


Tyr 


Asn 


He 


Tyr 


Al a 


Asn 


Gly 


Ser 


Leu 




7 85 








1 90 










7 95 










800 






Va 1 


li i £ lie 


Gin 


Asp 


Pro 


Thr 


Val 


Leu 


Glu 


Tyr 


He 


Asp 


Glu 


Thr 










805 










810 










815 








er 


Ser Ar^ 


Asp 


vj J, y 


vjj. n 


V a J. 


Glu 


M*^ t 


\JX u 


Tyr 


Cys 


V a X 


1 nr 


Ml a 




























P "5 n 
o J u 








Val 


Tyr 


Asn Asp 


Asn 


He 


(jIu 


Ser 


Gin 


Se t 


Val 


Cys 


Asp 


Lys 


Leu 


Asn 








835 








840 










845 








10 


Tyr 


Thr 


He Thr 


Ser 


Leu 


Asp 


Asn 


He 


Gin 


Ser 


Asp 


Thr 


Ser 


Leu 


Lys 




850 








855 










860 












lie 


Tyr 


Pro Asn 


Pro 


Ala 


Ser 


Tyr 


Val 


Val 


Arg 


He 


Glu 


Gly 


Leu 


Ser 




865 








870 










875 










8 80 




Arg 


Ser 


Lys Ser 


Thr 


He 


Glu 


Leu 


Tyr 


Asn 


Ala 


Leu 


Gly 


He 


Cys 


He 


15 








885 










890 










8 95 




Leu 


Arg 


Glu Glu 


Thr 


His 


Ser 


Glu 


Lys 


Thr 


Glu 


He 


Asp 


Val 


Ser 


Arg 








900 










905 










910 








Leu 


Asn 


Asp Gly 


Val 


Tyr 


Leu 


He 


Lys 


Val 


Val 


Gly 


Gly 


Asn 


Lys 


Thr 








915 








920 










925 








20 


Thr 


Thr 


Glu Lys 


Val 


Glu 


He 


Lys 


Arg 


Pro 
















930 








935 






















(2) 


IJIE'ORI lATIOM 


FOR 


SEQ 


ID NO: 3^10 




















(i ) 


SEQUEHCE CHARACTERISTICS; 




















(A) LEMGTH: 606 amino 


acids 




















(B) Tj 


'PE: 


amino acid 
























(D) TOPOLOGY: 


linear 


















ou 




(ii) 


MOLECULE TYPE: 


protein 


















































Uii) 


HYPOTHETICAL: YES 
























(vi) 


ORIGItlAL SOURCE: 




















oo 






{A} ORG All ISM: 


Porphyromon; 


iS gingivalis 










(ix) 


FEATURE : 






























(A) NAI4E/KEY: 


misc feature 




















(B) LOCATION 1 . . . 606 


















AO 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 340 












lie 


He 


Phe Cys 


Thr 


He 


His 


His 


Ser 


Glu 


Leu 


Glu 


He 


Met 


Asn 


Ser 




1 






5 










10 










15 






He 


Met 


Lys Tyr 


Gin 


Leu 


Tyr 


Thr 


Ala 


Val 


He 


Met 


Ala 


Leu 


Ser 


Val 






20 










25 










30 








Ser 


Ser 


Val Cys 


Gly 


Gin 


Thr 


Pro 


Arg 


Asn 


Thr 


Glu 


Thr 


Lys 


Arg 


Pro 








35 








40 










45 










Asp 


Thr 


Leu Arg 


Arg 


Glu 


Leu 


Thr 


He 


Val 


Asn 


Asp 


Gin 


Thr 


Val 


Glu 


ou 




50 








55 










60 










Met 


Glu 


His Ala 


Asp 


Pro 


Leu 


Pro 


Ala 


Ala 


Tyr 


Lys 


Ala 


He 


Glu 


Pro 




65 








70 










75 










80 




Arg 


Leu 


Lys Pro 


Phe 


Arg 


Pro 


Glu 


Tyr 


Asn 


Lys 


Arg 


Thr 


Phe 


Gly 


Phe 










85 










90 










95 






Va] 


Pro 


Glu Val 


Ser 


Ser 


Ser 


Gly 


Arg 


Asn 


Asn 


Leu 


Pro 


Asn 


He 


Leu 






100 










105 










110 








Pro 


Thr 


Glu Gly 


His 


Met 


Lys 


His 


Arg 


Gly 


Tyr 


Leu 


Asn 


He 


Gly 


He 








115 








120 










125 










Gly 


His 


Thr Leu 


Asn 


Gin 


Arg 


Met 


Asp 


Ala 


Gl y 


Tyr 


Arg 


Leu 


He 


Asp 


60 




130 








135 










140 










Ala 


Glu 


Gin Glu 


Arg 


Leu 


Asn 


Leu 


Phe 


Leu 


Ser 


Tyr 


Arq 


Gly 


Met 


Lys 




1'15 








150 










155 










160 




Ser 


Ala 


Phe Asn 


Thr 


Gly 


Asp 


Phe 


Asp 


Gly 


Asp 


Arg 


Lys 


Asp 


Arg 


Arg 










165 










170 














65 


Met 


Met 


Ala Gly 


Val 


Asp 


Tyr 


Glu 


Gin 


Arg 


Arg 


Pro 


Ser 


Phe 


Val 


Leu 






180 










185 








190 








Ala 


Thr 


Gly Leu 


Tyr 


Tyr 


Ser 


Asn 


His 


Tyr 


Phe 


Asn 


Asn 


Tyr 


Gly 


Arg 








195 








200 










205 










Gly 


Ala 


Thr Thr 


Asn 


Val 


Gly 


Ser 


He 


Pro 


Gin 


Leu 


Ser 


Thr 


Pro 


Val 


70 




210 








215 










220 










Thr 


Pro 


Gin Met 


Asp 


Asn 


Gly 


Thr 


His 


Asn 


Val 


Arg 


Val 


Tyr 


Leu 


Gly 




225 








230 










235 










240 




Ala 


Lys 


Asn Asp 


Val 


He 


Asp 


Ala 


Arg 


He 


Asp 


Tyr 


Arg 


Phe 


Phe 


Arg 










245 










250 










255 




75 


Ser 


He 


Pro Tyr 


Leu 


Gly 


Thr 


Asp 


Pro 


Me t 


Lys 


Ala 


Leu 


Thr 


Glu 


His 






260 










265 








270 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Thr 


Pro 


G 1 u 


L'^'U 


Asn 


1 

V SIX 


Thr 


Met 


Ser 


Asn 


Glu 


Leu 


Ser 


Asp 


Asp 


lie 






27 5 










9 R n 










285 






Lys 


L'='U 


Gly 


Va 1 


Gl u 


Va 1 


Arg 


1 nr 


Giy 




Leu 


trne 


Phe 


Ala 


Lys 


Asn 




2 90 










2 95 










300 










Ser 


Gl u 


Met 


lie 


Gin 


Thr 


Gly 


V a J. 




oe r 


LjJ. u 


Th r 


Asp 


Arg 


Asn 


Leu 


305 










310 










J ± o 








320 




T v.- r 


Va 1 


Gl u 


Lii y 


Al a 


Pro 


I nr 


J. ± e 


J. y 


Phe 


V a 1 


Gly Asp 


Ser 


Asp 










325 










J J 'J 










■3 "3 t; 


sn 


1]*=' t 


Gin 


1 rp 


Gem 

rts n 


r J, e 


Lain 


Al a 


Gly 


V a 1 


Gly 


lie 


Ser 


Ser 


His 


Phe 








34 0 










34 5 










350 








Al a 


Jjj s 


Gly 


Arg 




r ne 


r^ne 


Trp 


Pro 


Lys 


Leu 


Asp 


Ala 


Ser 


Leu 






355 










A 










365 








Ser 


lie 


Phe 


ro 


Ser 


i rp 


Arg 


V a X 


Tyr 


t\x a 


Lys 


r\± a 


Phe 


Gly 


Gly 


Va 1 




37 0 










"7 










380 










J. J. e 


Ar<3 


Asn 


La J. y 


Leu 


n 1 = 
rtl 3 


Asp 


Va 1 


I'le t 


Gin 


Glu 


Glu 


Met 


Pro 


Tyr 


Leu 


3f^5 










J i?U 










3 95 








400 


I lec 


r ro 


As n 


Tnr 


lie 


V a ± 


Leu 


Pro 


Ser 


Arg 


Asn 


Ala 


Leu 


Thr 


Ala 


Gin 










4 05 










410 










415 




Leu 


Gly 


Va 1 


Lys 


Gly 


Asn 


lie 


Ala 


Asp 


Vai 


Val 


Arg 


Met 


Glu 


Val 


Tyr 








4 20 










4 25 










430 






Asp 


Pne 


Se r 


Lys 


Leu 


Thr 


Gly 


Val 


Pro 


Phe 


Tyr 


Thr 


Pro 


Thr 


Leu 
















4 4 0 










4 4 5 








Pro 


Leu 


Tyr 


Asn 


Pro 


Ser 


Asp 


Leu 


Tyr 


Gin 


Tyr 


Asn 


Val 


Ser 


Phe 


Leu 




4 50 










4 55 










460 










Pro 


J. J. e 


1 yr 


Al a 


Asp 


Gly 


Ser 


Arg 


Trp 


Arg 


Ala 


Giy 


Gly 


Lys 


Leu 


Glu 












4 7 0 










4 7 5 










4 80 


1 y L 


e r 


I J r 


Arg 


Asp 


Met 


Leu 


Arg 


Phe 


Leu 


Va 1 


Asp 


Ala 


Ser 


Tyr 


Gly 










4 85 










4 90 










4 95 




i r p 


Asn 


L*^u 


Asp 


vj J. y 


Gly 


Leu 


Va 1 


Al a 


Ser 


Met 


Gin 


Pro 


Asp 


Leu 








500 










OU ~) 










510 




lie 


Leu 


Ly^ 


Al a 


Gl u 


Val 




V d X 


nl s 


Pro 


-Lie 


Ml a 


Pro 


Leu 


Asp 


Val 






515 










520 










525 








Arg 


Leu 


Arg 


Tyr 


Thr 


Gin 


Leu 


Asn 


Gly 


Arg 


Tyr 


Arg 


Tyr 


Ser 


Phe 


Gly 




530 










535 










540 








Ser 


Ala 


Gly 


Ser 


Glu 


Ala 


Leu 


Gly 


lie 


Gly 


Asn 


Val 


His 


Leu 


Leu 


Ser 


5^5 










550 










555 










560 


Ala 


Asp 


Val 


Ser 


Tyr 


Lys 


Leu 


Lys 


Lys 


Asn 


Leu 


Ser 


Leu 


Tyr 


Leu 


Lys 










565 










570 










575 




lie 


Asp 


Asn 


Met 


Leu 


Ala 


Glu 


Thr 


Thr 


Glu 


Leu 


He 


Gly 


Tyr 


Tyr 


Pro 








580 










585 








590 




Met 


Gin 


Pro 


Phe 


His 


Cys 


Phe 


Ala 


Gly 


Phe 


Ser 


Trp 


Thr 


Phe 










595 










600 










605 









( 2 ) IN FORf lATI OH FOR SEQ ID MO : 3 4 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAME/KEY: mi sc_f ea tur e 

(B) LOCATION 1 . . . 357 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 341 



65 


Asn 

1 


Ser 


His 


Ala 


Thr 
5 


Val 


Ser 


Val 


He 


Cys 
10 


Ser 


Met 


Met 


Glu 


Lys 
15 


Cys 




He 


Phe 


Ala 


His 
20 


Tyr 


Pro 


His 


Asn 


Leu 
25 


Val 


Phe 


Met 


He 


Arg 
30 


Lys 


His 




Phe 


Gly 


He 


He 


Leu 


Gly 


Phe 


Leu 


Ser 


Leu 


Val 


Phe 


Ser 


Ala 


Gly Ala 


70 






35 










40 










45 








Gin 


Gin 

50 


Glu 


Lys 


Gin 


Val 


Phe 
55 


His 


Phe 


Leu 


Asn 


Leu 

60 


Pro 


Ala 


Thr 


Ala 




Gin 


Ala 


Leu 


Ala 


Ala 


Gly 


Giy 


Lys 


Ala 


He 


Thr 


He 


Val 


Asp 


Asp 


Asn 




65 










70 










75 








80 


75 


Pro 


Gly 


Leu 


Ala 


Phe 


Glu 


Asn 


Pro 


Ala 


Leu 


Leu 


Gly 


Tyr 


Glu 


Ser 


Gly 










85 










90 






95 
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Gly 


Arg 


Ala 


Phe 
100 


Leu 


Ser 


Tyr 


Leu 


Tyr 
105 


Tyr 


Met 


Ser 


Gly 


Ser 
110 


His 


Met 




Gly 


Asn 


Ala 
115 


Cys 


Tyr 


Ala 


Ser 


Ser 

120 


Val 


Gly 


Glu 


Arg 


Gly 
125 


Met 


Trp 


Gly 


5 


Val 


Gly 
130 


I'let 


Arg 


Phe 


Leu 


Asn 
135 


Tyr 


Gly 


Ser 


Met 


Gin 
14 0 


Gly 


Tyr 


Asp 


Gin 




Asn 


Ala 


lie 


Ala 


Thr 


Gly 


Ser 


Phe 


Ser 


Ala 


Ser 


Asp 


He 


Ala 


Val 


Gin 




145 










150 










155 










160 




Gly 


Phe 


Tyr 


Ser 


His 


Glu 


Leu 


Ser 


Asn 


His 


Phe 


Arg 


Gly 


Gly 


Val 


Ser 


10 








165 










170 










175 






Leu 


Lys 


Ala 


Leu 
180 


Tyr 


Ser 


Ser 


He 


Glu 
185 


Thr 


Tyr 


Ser 


Ser 


Phe 
190 


Gly 


Leu 




Gly 


Val 


Asp 
195 


Val 


Gly 


He 


Ser 


Tyr 
200 


Tyr 


Asp 


Asp 


Asp 


Lys 
205 


Gly 


Tyr 


Ser 


15 


Ala 


Ser 
210 


Ala 


Leu 


Phe 


Lys 


Asn 
215 


Val 


Gly 


Ala 


Gin 


Leu 
220 


Lys 


Gly 


Tyr 


Asn 




Glu 


Glu 


Arg 


Glu 


Pro 


Leu 


Asp 


Trp 


Asp 


Phe 


Gin 


Leu 


Gly 


Phe 


Ser 


Arg 




225 










230 










235 










240 


20 


Ser 


Phe 


lie 


Asn 


Ala 


Pro 


Phe 


Arg 


Leu 


His 


He 


Thr 


Leu 


Phe 


Asn 


Leu 










245 










250 










255 






Asn 


Pro 


His 


Tyr 
260 


Phe 


Lys 


Arg 


Leu 


Val 
265 


Pro 


Arg 


Asp 


Leu 


Ser 
270 


Lys 


Met 




Gin 


Lys 


Phe 


Leu 


Arg 


His 


Phe 


Ser 


He 


Gly 


Ala 


Glu 


Phe 


Thr 


Pro 


Ser 


25 






275 










280 










285 








Glu 


Arg 
2 90 


Phe 


Trp 


Val 


Gly 


Leu 
295 


Gly 


Tyr 


Thr 


Pro 


Gl n 
300 


He 


Ala 


Gin 


Asp 




Phe 


Glu 


Val 


Glu 


Gly 


Gly 


Asn 


Lys 


Trp 


Gly 


Gly 


Leu 


Ser 


Ala 


Gly 


Val 




305 










310 










315 










320 




Gly 


Phe 


Thr 


Ser 


Gly 


Val 


Val 


Arg 


Val 


Gly 


Val 


Ser 


Ala 


Ala 


Thr 


Tyr 


30 










325 










330 










335 






His 


Pro 


Ala 


Ala 
34 0 


Leu 


Ser 


Phe 


Met 


Cys 
345 


Ser 


Val 


Gly 


He 


Arg 
350 


Leu 


Asp 




Asp 


Lys 


Ser 
355 


lie 


Phe 

























35 



55 



60 



65 



70 



75 



[2) III FOR[ 1 AT I ON FOR S EQ ID MO : 3 4 2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 337 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

45 (iii) HYPOTHETICAL: YES 

(vi) ORIGIUAL SOURCE: 

(A) ORGAIIISEt: Porphyromonas gingival is 

50 (ix) FEATURE: 

(A) MAIIE/KEY: mis cofeature 

(B) LOCATION 1 ... 337 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 2 



Pro 


Gin 


Pro 


Val 


Gly 


Leu 


Lys 


Glu 


He 


Thr 


He 


Lys 


Pro 


Met 


Cys 


Leu 


1 








5 










10 










15 




Glu 


Pro 


He 


He 
20 


Ala 


Pro 


He 


Ser 


Ser 
25 


Glu 


Leu 


Leu 


Glu 


Gin 
30 


Glu 


Leu 


Thr 


Ala 


Asp 
35 


Arg 


Phe 


Leu 


Arg 


Met 
40 


Thr 


Asn 


Lys 


Ala 


Gly 
45 


Asn 


Glu 


He 


Tyr 


Val 

50 


Phe 


Thr 


Ala 


Glu 


Glu 
55 


Ala 


Pro 


His 


Cys 


Met 

60 


Lys 


Glu 


Val 


Gly 


Arg 


Leu 


Arg 


Glu 


Glu 


Ala 


Phe 


Arg 


HiE 


Tyr 


Gly 


Gly 


Gly 


Thr 


Gly 


Lys 


65 










70 










75 










80 


Ala 


He 


Asp 


He 


Asp 
85 


Glu 


Phe 


Asp 


Thr 


Met 
90 


Pro 


Gly 


Ser 


Tyr 


Lys 
95 


Gin 


Leu 


li- 


Val 


Trp 
100 


Asp 


Pro 


Gin 


Asn 


Lys 
105 


Ala 


He 


Leu 


Gly 


Gly 
110 


Tyr 


Arg 


Phe 


ne 


Tyr 
115 


Gly 


Arg 


Asp 


Val 


Ala 
120 


Phe 


Asp 


Thr 


Asp 


Gly 
125 


Lys 


Pro 


Leu 


Leu 


Ala 
130 


Thr 


Ala 


Glu 


Met 


Phe 
135 


Arg 


Phe 


Sec 


Asp 


Ala 
14 0 


Phe 


Leu 


His 


Asp 


Tyr 


Leu 


Pro 


Tyr 


Thr 


Val 


Glu 


Leu 


Gly 


Arg 


Ser 


Phe 


Val 


Ser 


Leu 


Gin 


145 










150 






155 










160 
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10 



15 



20 



25 



45 



50 



55 



60 



65 



70 



75 



Tyr 


Gin 


Ser 


Thr 


Arg 


Met 


Gly 


Thr 


Lys 


Ala 


He 


Phe 


Val 


Leu 


Asp 


Asn 










165 










170 










175 




Leu 


Trp 


Asp 


Gly 


He 


Gly 


Ala 


Leu 


Thr 


Val 


Val 


Asn 


Pro 


Glu 


Ala 


Leu 








180 










185 










190 






Tyr 


Phe 


Tyr 


Gly 


Lys 


Val 


Thr 


Met 


Tyr 


Lys 


Asp 


Tyr 


Asp 


Arg 


Arg 


Ala 






195 










200 










205 








Arg 


Asn 


Leu 


He 


Leu 


Tyr 


Phe 


Leu 


Arg 


Lys 


His 


Phe 


Ser 


Asp 


Pro 


Glu 




210 










215 










220 










Gly 


Leu 


Val 


Lys 


Pro 


He 


His 


Pro 


Leu 


Pro 


He 


Glu 


He 


Ser 


Ala 


Glu 


225 










230 










235 










240 


Asp 


Glu 


Ala 


Leu 


Phe 


Ser 


Ser 


Ser 


Asp 


Phe 


Asp 


Thr 


Asn 


Tyr 


Lys 


Thr 










245 










250 










255 




Leu 


Asn 


He 


Glu 


Val 


Arg 


Lys 


Leu 


Gly 


He 


Asn 


He 


Pro 


Pro 


Leu 


Val 








260 










265 










270 






Ser 


Ala 


Tyr 


He 


Ala 


Leu 


Ser 


Pro 


Glu 


Met 


Arg 


Val 


Phe 


Gly 


Thr 


Ala 






27S 










280 










285 








Val 


Asn 


Gl u 


Ser 


Phe 


Gly 


Glu 


Val 


Glu 


Glu 


Thr 


Gly 


He 


Phe 


He 


Ala 




290 










295 










300 










Val 


Gly 


hys 


He 


Leu 


Glu 


Glu 


Lys 


Lys 


Gin 


Arg 


His 


He 


Glu 


Ser 


Phe 


305 










310 










315 










320 


He 


Leu 


Ser 


Arg 


Asn 


Glu 


Lys 


Lys 


Gly 


Leu 


Asp 


Ser 


Ser 


Asn 


Gly 


Arg 










325 










330 










335 




Ser 

































(2; IIJFORMATIOIJ FOR SEQ ID 110:3^3 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 amino acids 
30 (B) TYPE: amino acid 

{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

35 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

40 (ix) FEATURE: 

(A) NAl'IE/KEY: misc^f eature 
(B} LOCATION 1 . . . 566 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 34 3 



Tyr 


Asp 


Gly 


Ala 


Arg 


Leu 


Val 


Tyr 


Thr 


Leu 


Phe 


Arg 


Asn 


Arg 


Asn 


Asp 


1 








5 










10 










15 




He 


His 


Pro 


Met 


Lys 


Thr 


He 


Val 


Arg 


Tyr 


Ser 


Arg 


Leu 


Pro 


Val 


Ala 








20 










25 










30 






Leu 


Phe 


Phe 


Cys 


Leu 


Leu 


Gly 


Ala 


Val 


His 


Leu 


Ser 


Val 


Glu 


Ala 


Gin 






35 










40 










45 








Met 


Leu 


Asn 


Thr 


Pro 


Phe 


Glu 


Leu 


Sec 


Asp 


Gin 


He 


Val 


Leu 


Ser 


Pro 




50 










55 










60 










Tlir 


Glu 


Arg 


Gin 


Tyr 


Arg 


Glu 


He 


Cys 


Val 


Gin 


Thr 


Lys 


Glu 


Lys 


Arg 


65 










70 










75 










80 


Gly Ala 


Asp 


Leu 


Phe 


Pro 


Leu 


Ser 


Asp 


Lys 


Leu 


Arg 


Asp 


Ser 


Ala 


Tyr 










85 










90 










95 




Val 


Arg 


Phe 


Gly 


Ser 


Ala 


Tyr 


Gly 


Asp 


He 


Ala 


Gly 


Asp 


Tyr 


Leu 


Pro 








100 










105 










110 






Tyr 


Asn 


Gly 


Asn 


Asn 


Tyr 


Ser 


Ser 


Leu 


Ser 


Leu 


Glu 


Ser 


Gly 


Gly 


Arg 






115 










120 










125 






He 


Ser 


Val 


Arg 


Asn 


Tyr 


Gly 


Thr 


Leu 


Gin 


Gly 


Ser 


Ala 


Ser 


Tyr 


Ser 




130 










135 










140 








Arg 


Gly 


Met 


His 


Lys 


Arg 


He 


Gly 


Trp 


Asn 


Ala 


Leu 


Arg 


Asn 


Ala 


Glu 


145 










150 










155 










160 


Ala 


Tyr 


Tyr 


Pro 


Tyr 


Leu 


Val 


Ser 


Asp 


Ser 


Thr 


Gly 


Gly 


Asp 


Tyr 


His 










165 










170 










175 




Phe 


Glu 


Asp 


Tyr 


Arg 


Leu 


Ala 


Gly 


Tyr 


Tyr 


Ser 


Phe 


Arg 


Ala 


Gly 


Arg 








180 










185 










190 






Leu 


Pro 


Leu 


Gly 


He 


Gly 


Phe 


Ser 


Tyr 


Arg 


Gly 


Glu 


Val 


Ala 


Tyr 


Arg 






195 










200 










205 








Leu 


Thr 


Asp 


Pro 


Arg 


Thr 


Thr 


Asn 


Thr 


Thr 


Gly 


Ala 


Leu 


Glu 


Leu 


Ser 




210 










215 








220 










Cys 


Ala 


Thr 


Ser 


Leu 


Thr 


Leu 


Pro 


Arg 


Glu 


Asn 


Arg 


Leu 


Ser 


Leu 


Ser 


225 










230 










235 








240 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 




wo 99/29870 

272/490 



PCT/AU98/01023 



50 



65 



70 



75 



Ala 


Al a 


Tyr 


Leu 


Tyr 


His 


Arg 


Gl n 


His 


Leu 


Thr 


Gin 


Tyr 


Asn 


Trp 


Arg 










2 4 5 










250 










2 5 6 




Pro 


Gly 


Gin 


Gin 


Asp 


Lys 


Phe 


Phe 


Val 


Se r 


Tyr 


Gly 


Phe 


Gly 


Gin 


Val 








2 60 










265 










27 0 






Asp 


Va 1 


Ser 


Asn 


Ser 


Pro 


He 


Trp 


Phe 


Gly 


He 


Ser 


Arg 


Met 


Asn 


Tyr 






27 5 










280 










285 








Val 


Asn 


Gly 


Trp 


Lys 


Leu 


Ser 


Ser 


Arg 


Leu 


Asp 


Thr 


Arg 


Arg 


Gl y 


Asp 




2 90 










2 95 










300 










Ala 


lie 


Gly 


Leu 


Asp 


Tyr 


Ser 


Gly 


Tyr 


Phe 


Leu 


Asp 


Thr 


Glu 


Glu 


Arg 


305 










310 










315 










320 


Ser 


Ser 


lie 


Asn 


Leu 


Phe 


Ala 


Leu 


Leu 


Tyr 


Asn 


Arg 


Leu 


Arg 


Leu 


Tyr 










325 










330 










335 




Gly 


Ser 


Trp 


Hi £ 


Leu 


Ser 


Asp 


Phe 


Asp 


Phe 


Ser 


Phe 


Ser 


Ala 


Asp 


Tyr 








3-4 0 










345 










350 






Ala 


Leu 


Arg 


Gin 


Gly 


He 


Glu 


Arg 


He 


Tyr 


Glu 


Asp 


Tyr 


Lys 


Pro 


Asp 






355 










360 










365 








Asp 


Asn 


Tyr 


His 


He 


Tyr 


Asp 


Leu 


Arg 


He 


Leu 


Ala 


He 


Arg 


Arg 


Trp 




37 0 










375 










380 










Tyr 


Met 


Leu 


Asn 


Glu 


Phe 


Ser 


Ala 


Gin 


Ala 


Gin 


Ala 


Ser 


Tyr 


Arg 


He 


385 










3 90 










395 










4 00 


Arg 


Thr 


Asp 


Arg 


Gly 


Cys 


Ala 


Leu 


Arg 


Val 


Ser 


Ala 


Gly 


Ser 


Asp 


Phe 










4 05 










410 










415 




Tyr 


bl y 


Tyr 


Asp 


Gl u 


Thr 


Tyr 


Arg 


Lys 


Hi s 


Gly 


His 


His 


Thr 


Met 


Ser 








4 2 0 










4 25 










4 30 






Gly 


Met 


Leu 


Arg 


Pro 


Phe 


Ala 


Gly 


He 


Ala 


Tyr 


Asp 


His 


Ala 


Gly 


Ser 






4 35 










44 0 










4 4 5 








Lys 


Leu 


Asp 


Phe 


Gly 


Leu 


Ser 


Leu 


Ser 


Ala 


Ala 


Tyr 


Arg 


J let 


Va 1 


Leu 




450 










455 










460 










Thr 


His 


Ser 


Tyr 


Lys 


He 


Arg 


Thr 


He 


Gin 


Lys 


Glu 


Gin 


Leu 


Asp 


Tyr 


465 










470 










475 










480 


Gin 


Leu 


Ala 


Tyr 


Leu 


Pro 


Tyr 


Ala 


Tyr 


Arg 


Asn 


Arg 


Glu 


Gly 


Val 


Glu 










4 85 










4 90 










4 95 




Val 


Arg 


Ser 


Ser 


Leu 


Tyr 


Val 


Ser 


He 


Pro 


Met 


Gin 


Asn 


Thr 


His 


Arg 








500 










505 










510 






Leu 


I-let 


Thr 


Glu 


Leu 


Arg 


Leu 


Tyr 


Gly 


Asp 


Leu 


Met 


Lys 


Arg 


Lys 


Asp 






515 










520 










52 5 








Gly 


lie 


Ala 


Tyr 


Gly 


Lys 


Thr 


Pro 


Gly Val 


He 


Ser 


His 


He 


Leu 


Ser 




530 










535 










540 










Asp 


Pro 


Gin 


Ala 


Glu 


Arg 


Thr 


Ser 


Gly 


His 


Thr 


He 


Gly 


Ala 


He 


Cys 


545 










550 










555 










560 


Asn 


lie 


Ser 


Tyr 


Leu 


Phe 






























565 
























(2) 


lUFORMATIOll 


FOR 


SEQ 


ID MO: 34 4 

















10 



15 



20 



25 



30 



35 



40 



45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 819 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

55 (vi) ORIGINAL SOURCE: 

{IK} ORGANISM: Porphyromonas gingivalis 

{ix} FEATURE: 

(A) MAI-IE/KEY: misc_f eature 
60 (B) LOCATION l..,819 

(:-:i) SEQUEMCE DESCRIPTION; SEQ ID NO: 344 



Arg 


Tlir 


Asn 


Val 


Phe 


Leu 


Ser 


Leu 


Ser 


His 


Lys 


He 


Gly 


Arg 


Arg 


Gly 


1 








5 










10 










15 




Ala 


Ser 


cys 


Ser 

20 


Asn 


Arg 


Asn 


Ala 


Trp 
25 


Met 


Ala 


Glu 


Asn 


Lys 
30 


Pro 


Ser 


Ser 


Pro 


Glu 
35 


Pro 


Asp 


Asn 


Thr 


Gly 
40 


Val 


Gly 


Asn 


Ser 


Pro 
45 


Ser 


Asp 


Tyr 


Leu 


His 
50 


Gly 


Glu 


Ala 


He 


He 
55 


Pro 


Pro 


Leu 


Ser 


ser 
60 


Leu 


Ser 


Asn 


Phe 


Asn 


Asp 


Lys 


Arg 


Phe 


Met 


Lys 


Lys 


Leu 


His 


Met 


He 


Ala 


Ala 


Leu 


Ala 


65 










70 










75 










80 


Val 


Leu 


Pro 


Phe 


Cys 
85 


Leu 


Thr 


Ala 


Gin 


Ala 
90 


Pro 


Val 


Ser 


Asn 


Ser 
95 


Glu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSOOCID: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



35 



273/490 

lie Asp Ser Leu Ser Asn Val Gin Leu Gin Thr Val Gin Val Val Ala 

100 105 110 

Thr Arg Ala Thr Ala Lys Thr Pro Val Ala Tyr Thr Asn Val Arg Lys 
115 120 125 

5 Ala Glu Leu Ser Lys Ser Asn Tyr Gly Arg Asp lie Pro Tyr Leu Leu 
130 135 140 

Met Leu Thr Pro Ser Val Val Ala Thr Ser Asp Ala Gly Thr Gly lie 
145 150 155 160 

Gly Tyr Ser Gly Phe Arg Val Arg Gly Thr Asp Ala Asn Arg lie Asn 
10 165 170 175 

He Thr Thr Asn Gly Val Pro Leu Asn Asp Ser Glu Ser Gin Ser Val 

180 185 190 

Phe Trp Val Asn Met Pro Asp Phe Ala Ser Ser He Glu Asp Leu Gin 
195 200 205 

15 Val Gin Arg Gly Val Gly Thr Ser Thr Asn Gly Ala Gly Ala Phe Gly 
210 215 220 

Ala Ser Val Asn Met Arg Thr Asp Asn Leu Gly Leu Ala Pro Tyr Gly 
225 230 235 2^0 

Arg Val Asp Leu Ser Gly Gly Ser Phe Gly Thr Phe Arg Arg Ser Val 
20 245 250 255 

Lys Leu Gly Ser Gly Arg He Gly Arg His Trp Ala Val Asp Ala Arg 

2G0 265 270 

Leu Ser Lys He Gly Ser Asp Gly Tyr Val Asp Arg Gly Ser Val Asp 
275 280 285 

25 Leu Lys Ser Tyr Phe Ala Gin Val Gly Tyr Phe Gly Ser Asn Thr Ala 
290 295 300 

Leu Arg Phe He Thr Phe Gly Gly Lys Glu Val Thr Gly He Ala Trp 
305 310 315 320 

Asn Gly Leu Ser Lys Glu Asp Glu Ala Lys Tyr Gly Arg Arg Tyr Asn 
30 325 330 335 

Ser Ala Gly Leu Met Tyr Val Asp Ala Gin Gly Val Pro His Tyr Tyr 

340 345 350 

His Asn Thr Asp Asn Tyr Glu Gin Arg His Tyr His Ala He Met Thr 

355 360 365 

His Ser Phe Ser Pro Ser Val lie Leu Asn Leu Thr Ala His Tyr Thr 

370 375 380 

Ala Gly Tyr Gly Tyr Thr Asp Glu Tyr Arg Thr Gly Arg Lys Leu Lys 
385 390 395 400 

Glu Tyr Ala Leu Gin Pro Tyr Val Glu Asn Ser Val Thr Val Lys Lys 
40 405 410 415 

Thr Asp Leu He Arg Gin Lys Tyr Leu Asp Asn Asp Phe Gly Gly Leu 

420 425 430 

He Gly Ser Leu Asn Trp His Thr Gly Ala Trp Asp Leu Gin Phe Gly 

435 440 445 

Ala Ser Gly Asn He Tyr Lys Gly Asp His Phe Gly Arg He Thr Tyr 

450 455 460 

lie Lys Lys Tyr Asn Gin Pro Leu Ala Pro Asp Phe Glu Tyr Tyr Arg 
^^^^ 470 475 480 

Asn Arg Ala Asp Lys Arg Glu Gly Ala Ala Phe Ala Lys Ala Asn Trp 

485 490 495 

Gin He Thr Pro Glu Leu Asn Met Tyr Ala Asp Leu Gin Tyr Arg Thr 

500 505 510 

He Gly Tyr Thr He Asn Gly He Thr Asp Glu Tyr Asp Glu Val Gin 

515 520 525 

Gly Ser Met Gin His He Asp Leu Asp Lys Thr Phe Arg Phe Leu Asn 

530 535 540 

Pro Lys Ala Gly Leu Thr Tyr Ser Phe Asp Asp Ala His Thr Ala Tyr 
545 550 555 560 

Ala Ser Val Ala Val Ala His Arg Glu Pro Asn Arg Thr Asn Tyr Thr 

565 570 575 

Glu Ala Gly He Gly Gin Tyr Pro Thr Pro Glu Arg Leu He Asp Tyr 

580 585 590 

Glu Leu Gly Tyr Arg Tyr Ala Ser Pro Leu Leu Ser Ala Gly Val Gly 

595 600 605 

Leu Tyr Tyr Met Gin Tyr Lys Asp Gin Leu Val Leu Asp Gly Arg Leu 

610 615 620 

Ser Asp Val Gly Gin Met Leu Thr Ser Asn Val Pro Asp Ser Tyr Arg 
625 630 635 640 

Met Gly Leu Glu Leu Thr Leu Gly Trp Gin He Leu Pro Arg Leu Leu 

645 650 655 

Arg Trp Asp Ala Ser Phe Thr Met Ser Arg Asn Lys He Asp Arg Tyr 

660 665 670 

Val Gin Tyr Thr Ser Val Tyr Asp Ala Asp Tyr Asn Trp Leu Glu Leu 
675 680 685 

75 Lys Glu Glu Thr Leu Glu Ser Thr Asp He Ala Tyr Ser Pro Asn Val 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



45 



50 



55 



60 



65 



70 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



274/ 490 





690 










695 










7 00 










lie 


Ala 


Gly 


Ser 


Met 


Leu 


Thr 


Leu 


Ser 


His 


Al a 


Gl y 


Phe 


Glu 


Met 


Al a 


705 








710 










715 










7 20 


Trp 


Th i' 


Ser 


Arg 


Phe 


Val 


Ser 


Lys 


Gin 




Leu 


Acn 


Asn 


Thr 


Gin 










725 










730 










735 




Ser 


Asp 


Arg 


Met 


Leu 


Ser 


Ser 


Tyr 


Trp 


Val 


Asn 


A^p 


Leu 


Arg 


Leu 


Gly 








740 










745 










750 






Tyr 


Val 


Leu 


Pro 


Val 


His 


Phe 


Val 


Lys 


Arg 


Val 


Ala 


Leu 


Gly Val 


Gin 






755 










760 










765 








Leu 


Asn 


Asn 


Leu 


Phe 


Asn 


Leu 


Met 


Tyr 


Ala 


Ser 


Asn 


Ala 


Tyr 


lie 


Tyr 




770 










775 










780 










A^p 


Ala 


Gly Tyr 


Val 


Gin 


Ala 


Ser 


Gly 


Glu 


Leu 


Ser 


Ala 


Tyr 


Ala 


Asp 


785 










790 










795 










800 


Leu 


Arg 


Tyr 


Tyr 


Pro 


Gin 


Ala 


Gly 


Phe 


Asn 


Ala 


Leu 


Gly 


Ser 


Leu 


Thr 










805 










810 










815 




He 


Asp 


Phe 





























20 



25 



(2) IKrORMATIOM FOR SEQ ID HO: 345 

(i) SEQUEMCE CHARACTERISTICS: 

(Al LENGTH: 532 amino acids 
(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{lii) HYPOTHETICAL: YES 



30 (vi) ORIGINAL SOURCE: 

(A) ORGAMISM; Porphyromonas gingival is 

(ix) FEATURE: 

I A) MAI-IE/KEY: misc_feature 
35 (B) LOCATION 1...532 

txi) SEQUEIJCE DESCRIPTIOM: SEQ ID )50:34 5 

Tyr Arg Arg Ser Gly Arg Val Cys Pro Arg lie Leu Arg Asn Lys Arg 
40 1 5 10 15 

Ser Tyr Ala Asp Thr Arg Leu Gin Pro Arg Arg Ala Val Arg Leu Arg 

20 25 30 

Ala Gly Thr Arg Thr Lys Met Lys Arg Arg Phe Leu Ser Leu Leu Leu 
35 40 45 

45 Leu Tyr lie Leu Ser Ser lie Ser Leu Ser Ala Gin Arg Phe Pro Met 
50 55 60 

Val Gin Gly lie Glu Leu Asp Thr Asp Ser Leu Phe Ser Leu Pro Lys 
65 70 75 80 

Arg Pro Trp Arg Ala lie Gly Lys Thr He Gly Val Asn Leu Ala Val 
50 85 90 95 

Trp Gly Phe Asp His Phe lie Met Asn Glu Asp Phe Ala Asp He Ser 

100 105 110 

Trp Gin Thr He Lys Ser Asn Phe Gin Thr Gly Phe Gly Trp Asp Asn 
115 120 125 

55 Asp Lys Phe Val Thr Asn Leu Phe Ala His Pro Tyr His Gly Ser Leu 
130 135 140 

Tyr Phe Asn Ala Ala Arg Ser Asn Gly Leu Ser Phe Arg His Ser Ala 
145 150 155 160 

Pro Phe Ala Phe Phe Gly Ser Leu Met Trp Glu Leu Leu Met Glu Asn 
GO 165 170 175 

Glu Pro Pro Ser He Asn Asp Leu Cys Ala Thr Thr He Gly Gly He 

180 185 190 

Ala Leu Gly Glu Met Gly His Arg Leu Ser Asp Leu Leu He Asp Asn 
195 200 205 

65 Arg Thr Thr Gly Trp Glu Arg Met Gly Arg Glu Val Ala He Ala Leu 
210 215 220 

He Asn Pro Met Arg Phe Leu Asn Arg Leu Thr Ala Gly Glu Val Thr 
225 230 235 240 

Ser Val Gly Ser Arg Ser Gly Gin He Phe Gin Ser Val Pro He Asn 
70 245 250 255 

He Val Val Asp Ala Gly Phe Arg Phe Leu Ala Asp Lys Arg His Ala 

260 265 270 

Arg Thr Gly Ala Thr Ala Leu Thr Leu Asn Leu Arg Phe Asp Tyr Gly 
275 280 285 

75 Asp Pro Phe Arg Ser Glu Thr Phe Ser Pro Tyr Asp Phe Phe Gin Phe 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCtD: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



275/490 



45 



50 



55 



60 



65 



70 



75 





2 90 










295 










300 










Lys 


Ala 


Gly 


Leu 


Ser 


Phe 


Ser 


Glu 


Ser 


Gin 


Pro 


Leu 


Leu 


Ser 


Gin 


He 


305 










310 










315 










320 


Asn 


Leu 


He 


Gly 


He 


Leu 


Ser 


Gly 


Cys 


Gin 


Leu 


Leu 


Ala 


His 


Glu 


Arg 










325 










330 










335 




Thr 


Val 


Leu 


Val 


Gly 


Gly 


Leu 


Phe 


Gin 


His 


Phe 


Asp 


Tyr 


Tyr 


Asn 


Ser 








340 










345 










350 






Glu 


Lys 


Arg 


He 


Ser 


Lys 


Asn 


Ser 


Glu 


Glu 


Val 


Leu 


Val 


Thr 


Pro 


Tyr 






355 










360 










365 






Arg 


lie 


Ser 


Gin 


Val 


Ala 


Ala 


Leu 


Gly 


Gly 


Gly 


Leu 


He 


Phe 


Gin 


His 




37 0 










37 5 






380 










His 


Gly 


Lys 


Phe 


Arg 


Arg 


Arg 


Pro 


Leu 


Glu 


Leu 


Tyr 


Ala 


Glu 


Thr 


Tyr 


"5 O c: 
J □ D 










3 90 










395 










4 00 


Leu 


As n 


V a i 


V a J. 


Pro 


Met 


Gl y 


Ala 


Ser 


Leu 


Ser 


Asp 


His 


Tyr 


Asn 


Val 




















410 










415 




sp 


sn 


rg 


ASp 


Tyr 


Asn 


Le u 


Gly 


Ser 


Gly 


Leu 


Ser 


Gly 


Lys 


Leu 


Tyr 








4 20 










4 25 










4 30 




Leu 


Gly 


Ala 


Thr 


Tyr 


Asn 


Asp 


Leu 


Trp 


Ser 


Trp 


Leu 


Leu 


Gly 


Val 


Glu 






A 35 










440 










445 








Ser 


Tyr 


Arg 


Leu 


Tyr 


Thr 


Trp 


He 


Gly 


Tyr 


Glu 


Glu 


Pro 


His 


Gin 


Lys 




■150 










4 55 








460 








Asn 


Thr 


Asp 


Val 


Ser 


Ser 


Phe 


Met 


Val 


Gin 


Gly 


Asp 


Glu 


Ser 


Lys 


Ala 


465 










470 










475 






480 


Arg 


Leu 


Leu 


Val 


Thr 


Ser 


Ser 


Glu 


Phe 


Ala 


Phe 


His 


Pro 


Gly 


Pro 


Trp 










485 










4 90 










4 95 




His 


Val 


Ala 


He 


Val 


Ala 


Arg 


Arg 


Phe 


He 


Arg 


Lys 


Thr 


Ala 


Tyr 


Gin 








500 










505 










510 




Phe 


Tyr 


Pro 


Asn 


Val 


Ser 


Phe 


Asp 


Thr 


Gly 


Asp 


He 


Gin 


Leu 


Arg 


Val 






515 










520 










525 








Gly 


Phe 


His 


Phe 




























530 






























(2) 


INFORMATION 


FOR 


SEQ 


ID HO: 34 6 

















10 



15 



20 



25 



30 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAI4E/KEY: mi sc_f eature 

(B) LOCATION 1 . . . 300 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 6 



Lys 


Arg 


Arg 


Lys 


Lys 


Gin 


Met 


Lys 


Arg 


Leu 


He 


Val 


Phe 


Leu 


Ala 


Met 


1 








c 










10 










15 




Gly 


Gly 


Leu 


Leu 


Phe 


Thr 


Leu 


Ala 


Asn 


Ala 


Gin 


Glu 


Ala 


Asn 


Thr 


Ala 








20 










25 










30 






Ser 


Asp 


Thr 


Pro 


Lys 


Lys 


Asp 


Trp 


Thr 


He 


Lys 


Gly 


Val 


Thr 


Gly 


Leu 






35 










40 








45 






Asn 


Ala 


Ser 


Gin 


Thr 


Ser 


Leu 


Thr 


Asn 


Trp 


Ala 


Ala 


Gly 


Gly 


Glu 


Asn 




50 










55 








60 










Thr 


Val 


Ala 


Gly 


Asn 


Leu 


Tyr 


Leu 


A.sn 


He 


Asp 


Ala 


Asn 


Tyr 


Leu 


Lys 


65 










70 










75 










80 


Asp 


Lys 


Trp 


Ser 


Trp 


Asp 


Asn 


Gly 


Leu 


Arg 


Thr 


Asp 


Phe 


Gly 


Leu 


Thr 










85 










90 










95 




Tyr 


Thr 


Thr 


Ala 


Asn 


Lys 


Trp 


Asn 


Lys 


Ser 


Val 


Asp 


Lys 


He 


Glu 


Leu 








100 










105 










110 






Phe 


Thr 


Lys 


Ala 


Gly 


Tyr 


Glu 


He 


Gly 


Lys 


His 


Trp 


Tyr 


Gly 


Ser 


Ala 






115 










120 










125 






Leu 


Phe 


Thr 


Phe 


Leu 


Ser 


Gin 


Tyr 


Ala 


Lys 


Gly 


Tyr 


Glu 


Lys 


Pro 


Ser 




130 










135 










140 








Asp 


His 


Leu 


Thr 


Gly 


Val 


Lys 


His 


He 


Ser 


Asn 


Phe 


Phe 


Ala 


Pro 


Ala 


145 










150 










155 










160 


Tyr 


Leu 


Thr 


Leu 


Gly 


He 


Gly 


Ala 


Asp 


Tyr 


Lys 


Pro 


Asn 


Glu 


Lys 


Phe 










155 










170 










175 




Ser 


Leu 


Tyr 


Leu 


Ser 


Pro 


Thr 


Thr 


Gly 


Lys 


Leu 


Thr 


Val 


Val 


Ala 


Asp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929e70A1_l_> 



wo 99/29870 



PCT/AU98/01023 



276/490 



25 



30 









180 










185 










190 






Asp 


Tyr 


Leu 


Ser 


Ser 


Leu 


Gly 


Ala 


Phe 


Gly 


Val 


Lys 


Val 


Gly 


Glu 


Lys 






195 










200 










205 








Thr 


Met 


Phe 


Glu 


Leu 


Gly 


Ala 


Leu 


Val 


Val 


Gly 


Ser 


Ala 


Asn 


He 


Asn 




210 










215 










220 










Leu 


Met 


Giu 


Asn 


Val 


Asn 


Leu 


He 


Thr 


Lys 


Ala 


Ser 


Phe 


Phe 


Ser 


Ala 


225 










230 










235 










240 


Tyr 


Thr 


His 


Asp 


Phe 


Gly 


Asn 


He 


Asp 


He 


Asn 


Trp 


Glu 


Ala 


Met 


Leu 










245 










250 










255 




Ala 


Met 


Lys 


He 


Asn 


Lys 


Phe 


Leu 


Thr 


Ala 


Thr 


He 


Ala 


Thr 


Asn 


Leu 








260 










265 










270 






He 


Tyr 


Asp 


Asp 


Asp 


Val 


L^-s 


He 


Asn 


Asp 


Gly 


Pro 


Lys 


He 


Gin 


Phe 






275 










280 










285 








Lys 


Glu 


Val 


Val 


Gly 


Val 


Gly 


Val 


Ala 


Tyr 


Thr 


Phe 












290 










255 










300 










(2) 


INFORMATION 


FOR 


SEQ 


ID MO: 34 7 

















10 



15 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LEMGTH: 221 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

tvH ORIGIUAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) HAl'IE/KEY: misc_feature 

(B) LOCATION 1 . . .221 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347 



40 



Thr 


Arg 


Glu 


Ser 


Val 


Leu 


His 


Cys 


Arg 


Thr 


Lys 


Leu 


Lys 


Lys 


Glu 


Arg 


1 








5 










10 










15 




Lys 


Met 


Lys 


Lys 


Met 


He 


Leu 


Ala 


Ala 


Thr 


Met 


Leu 


Leu 


Ala 


Thr 


He 








20 










25 










30 






Giy 


Phe 


Ala 


Asn 


Ala 


Gin 


Ser 


Arg 


Pro 


Ala 


Leu 


Arg 


Leu 


Asp 


Ala 


Asn 






35 










40 










45 








Phe 


Val 


Gly 


Ser 


Asn 


Leu 


Met 


Gin 


Lys 


Val 


Ala 


Asn 


Thr 


Ser 


Val 


Asn 




50 










55 










60 










Asn 


Lys 


Het 


He 


Val 


Gly 


Leu 


Arg 


Val 


Gly 


Ala 


Ala 


Ala 


Glu 


Phe 


Ala 


65 










70 










75 










80 


Leu 


Ser 


Asn 


Asp 


Gly 


Phe 


Tyr 


Leu 


Ala 


Pro 


Gly 


Leu 


Ala 


Tyr 


Thr 


Het 










85 










90 










95 




Arg 


Gly 


Ala 


Lys 


Met 


Glu 


Ssr 


Leu 


Ser 


Glu 


Thr 


Thr 


Thr 


Arg 


Leu 


His 








100 










105 










110 






Tyr 


Leu 


Gin 


He 


Pro 


Val 


Asn 


Ala 


Gly 


Het 


Arg 


Phe 


Ser 


Phe 


Ala 


Asp 






115 










120 










125 








Asn 


Het 


Ala 


He 


Ser 


Leu 


Glu 


Ala 


Gly 


Pro 


Tyr 


Phe 


Ala 


Tyr 


Gly 


Val 




130 










135 










140 










Ala 


Gly 


Thr 


He 


Lys 


Thr 


Lys 


Val 


Ala 


Gly 


Val 


Thr 


Ala 


Ser 


Val 


Asp 


145 










150 










155 










160 


Ala 


Phe 


Gly 


Asp 


Asn 


Gly 


Tyr 


Asn 


Arg 


Phe 


Asp 


Leu 


Gly 


Leu 


Gly 


Leu 










165 










170 










175 




Ser 


Ala 


Ala 


Leu 


Ser 


Tyr 


Asp 


Arg 


Tyr 


Tyr 


Val 


Gin 


He 


Gly 


Tyr 


Glu 








180 










185 










190 






His 


Gly 


Leu 


Leu 


Asn 


Met 


Leu 


Lys 


Asp 


Ala 


Pro 


Asp 


Lys 


Thr 


Ser 


Leu 






195 










200 










205 








Arg 


Asn 


His 


Asp 


Phe 


Phe 


Val 


Gly 


Leu 


Gly 


Val 


Arg 


Phe 










210 










215 










220 










(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 34 8 

















45 



50 



55 



60 



65 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 
70 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

75 (iii) HYPOTHETICAL; YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929a70A1_L> 



wo 99/29870 



PCT/AU98/01023 



277/490 



50 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGAWISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) HAI-1E/KEY: misc^f eat ure 

(B) LOCATIOri 1 ... 248 

(xi) SEQLfEMCE DESCRIPTIOM: SEQ ID IJO:348 



10 



15 



20 



25 



30 



35 



40 



45 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

iix) FEATURE: 

{A) HAJ^IE/KEY: misc_feature 
60 (B) LOCATION 1...211 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 9 



lie 


Lys 


Arg 


He 


Glu 


Met 


Lys 


Arg 


He 


Phe 


Thr 


Val 


Ala 


Leu 


Val 


Leu 


1 








5 










10 










15 




Leu 


Ala 


Ser 


Val 


Thr 


Met 


Ala 


He 


Gly Gin 


Ser 


Arg 


Pro 


Ala 


Leu 


Arg 








2 0 










25 










30 






Val 


Asp 


Ala 


Asn 


Phe 


Val 


Gly 


Ser 


Asn 


Gin 


Ser 


Met 


Lys 


Arg 


Asp 


Gly 






35 










40 










45 








Tyr 


Val 


Trp 


Asp 


Thr 


Lys 


Met 


Asn 


Val 


Gly 


Leu 


Arg 


Val 


Gly 


Ala 


Ala 




50 










55 










60 










Ala 


Glu 


Phe 


Met 


He 


Gly 


Ser 


Arg 


Gly 


Phe 


Tyr 


Leu 


Ala 


Pro 


Gly 


Leu 


65 










70 










75 










80 




Tyr 


Th r 


Me+' 


Jj^ s 


GT ^' 






Thr 


Glu 


Trp 


P 


He 




Glu 


t 










85 










90 










95 




Val 


Pro 


Gly 


Thr 


Tyr 


He 


Thr 


Met 


Val 


Ser 


Thr 


Arg 


Leu 


His 


Tyr 


Leu 








100 










105 










HO 






Gin 


Leu 


Pro 


li- 


Asn 


Ala 


Gly 


Met 


Arg 


Phe 


Asp 


Leu 


Met 


Asn 


Asp 


Met 






115 










120 










125 








Ala 


Val 


Ser 


ne 


Glu 


Ala 


Gly 


Pro 


Phe 


Leu 


Ala 


Tyr 


Gly 


He 


Tyr 


Gly 




130 










135 










140 










Thr 


Tyr 


Arg 


Gin 


Lys 


Leu 


Glu 


Gly 


Trp 


Lys 


Pro 


Asn 


Asn 


Tyr 


Ser 


Thr 


145 










150 










155 










160 


Glu 


Pile 


Phe 


Gly 


Pro 


Thr 


Leu 


Gly 


Gly 


Pro 


Thr 


Asn 


He 


Arg 


Trp 


Asp 










165 










170 










175 




He 


Gly 


Ala 


Asn 


lie 


He 


Ala 


Ala 


Phe 


His 


Tyr 


Lys 


Arg 


Tyr 


Tyr 


He 








180 










185 










190 






Gin 


He 


Gly 


Tyr 


Glu 


His 


Gly 


Phe 


Val 


Asp 


He 


Val 


Ser 


Gly 


Gly 


Gly 






195 










200 










205 








Ser 


Asp 


He 


Pro 


Arg 


Leu 


Asn 


Asp 


Asn 


Arg 


Gin 


Ser 


Ser 


Ser 


Thr 


Thr 




210 










215 










220 










Ala 


Leu 


Arg 


Glu 


Lys 


Gly 


Asn 


Asn 


Glu 


Tyr 


Ala 


Tyr 


Asn 


Arg 


Asp 


Phe 


225 










230 










235 










240 


Phe 


Val 


Gly 


He 


Gly 


Tyr 


Arg 


Phe 


























245 
























(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 34 9 

















Lys 


Arg 


Lys 


Ser 


Met Lys 


Arg 


Met 


Leu 


Leu 


Leu 


Leu 


Val 


Val 


Leu 


Leu 


1 
















10 










15 




Tyr 


Gly 


He 


Ala 
20 


Gly Arg 


Leu 


Ala 


Ala 
25 


Gin 


Asp 


Val 


He 


Arg 
30 


Pro 


Trp 


Ser 


Leu 


Gin 
35 


Val 


Gly Ala 


Gly 


Tyr 
40 


Ser 


Asp 


Thr 


Glu 


Asn 
45 


He 


Pro 


Gly 


Gly 


Phe 
50 


Thr 


Tyr 


Gly Phe 


Tyr 
55 


Leu 


Gly 


Lys 


Arg 


Met 

60 


Gly 


Ser 


Phe 


Leu 


Glu 


Val 


Gly 


Leu 


Ser Met 


Tyr 


Asn 


Ser 


Thr 


Arg 


Gin 


Thr 


Ala 


Asn 


Asn 


65 








70 










75 










80 


Ala 


Asp 


Ser 


Phe 


Ala Ser 
85 


Asn 


Glu 


Gly 


Asp 
90 


Gly 


Ser 


Phe 


Gin 


Val 
95 


Asn 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCtD: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



278/490 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Met 


Ser 


Ser 


Pro 
100 


Asn 


Glu 


Lys 


Trp 


Ser 
1 05 


Phe 


Phe 


Asp 


Ala 


Gly 
110 


Ser 


Ala 


As n 


Cys 


Tyr 
115 


l^iet 


He 


Val 


Val 


Gly 
120 


Val 


Asn 


Pro 


Leu 


His 
125 


Leu 


Phe 


Trp 


Gin 


Asn 
130 


Ser 


Arg 


His 


Asn 


Leu 
135 


Phe 


Leu 


Ala 


Val 


Gin 
140 


Ala 


Gly 


Leu 


Ser 


A^n 


Lys 


His 


Asn 


He 


His 


Phe 


He 


Tyr 


Gly 


Asp 


Lys 


Gly 


Ala 


Lys 


Val 


1-35 










150 










155 










160 


Ser 


lie 


Tyr 


Thr 


Asn 


Ser 


Asn 


Thr 


Tyr 


He 


Gly Tyr 


Gly 


Ala 


Arg 


Val 










165 










170 










175 




Ala 


Tyr 


Glu 


Tyr 
180 


Gin 


He 


His 


Lys 


Asn 
185 


Val 


Gly 


Ala 


Gly 


Ala 
190 


Ala 


Val 


Met 


Tyr 


Asp 
195 


His 


Gly 


Asn 


Lys 


Met 
200 


Leu 


Thr 


Ala 


Met 


Ala 
205 


Thr 


Leu 


Ser 


Thr 


His 
210 


Phe 





























10 



15 



(2) lUFORMATION FOR SEQ ID HO:360 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii^ MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Porphyromonas gingivali£ 

(ix) FEATURE: 

(A) NAI-IE/KEY: mis cofeature 

(B) LOCATION 1. . . 953 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350 



He 


Arg 


Met 


Arg 


Val 


Ser 


Asp 


Leu 


Cys 


Ser 


Arg 


Leu 


Ser 


Trp 


Leu 


Leu 


1 








5 










10 










15 




Pro 


Val 


He 


Leu 


Val 


Gly 


Leu 


Leu 


Cys 


Ala 


Thr 


Leu 


Val 


Ala 


Ala 


Glu 








20 










25 










30 






Arg 


Pro 


Met 


Ala 


Gly 


Ala 


Val 


Gly 


Leu 


His 


His 


Arg 


Arg 


His 


Ala 


Ala 






35 










40 










45 








Leu 


Ser 


Asp 


Ser 


Thr 


Ala 


Lys 


Asp 


Thr 


Val 


Pro 


Leu 


Ala 


Lys 


Pro 


He 




50 










55 










60 










Pro 


Asp 


Ser 


Ala 


Phe 


Arg 


Asp 


Ser 


Leu 


Pro 


Ala 


Asp 


Ser 


Thr 


Gly 


Ser 


65 










70 










75 










80 


Met 


Arg 


Gin 


Asp 


Ser 


Val 


Tyr 


Asp 


Asp 


Glu 


Phe 


Glu 


Leu 


Glu 


Asp 


He 










85 










90 










95 




Val 


Glu 


Tyr 


Glu 


Ala 


Ala 


Asp 


Ser 


He 


Val 


Leu 


Leu 


Gly 


Gin 


Asn 


Arg 








100 










105 










110 






Ala 


Tyr 


Leu 


Phe 


Gly 


Lys 


Ser 


Tyr 


Val 


Ser 


Tyr 


Gin 


Lys 


Ser 


Arg 


Leu 






115 










120 










125 








Glu 


Ala 


Asn 


Phe 


Met 


Tyr 


Leu 


Asn 


Thr 


Asp 


Ser 


Ser 


Thr 


Val 


Tyr 


Thr 




130 










135 










140 










Arg 


Tyr 


Val 


Leu 


Asp 


Thr 


Ala 


Gly 


Tyr 


Pro 


Met 


Ala 


Phe 


Pro 


Val 


Phe 


145 










150 










155 










160 


Lys 


Asp 


Gly 


Glu 


Gin 


Ser 


Phe 


Glu 


Ala 


Lys 


Asn 


Phe 


Thr 


Tyr 


Asn 


Phe 










165 










170 










175 




Arg 


Thr 


Gl u 


Lys 


Gly 


He 


He 


Ser 


Gly 


Val 


He 


Thr 


Gin 


Gin 


Gly 


Glu 








180 










185 










190 






Gly 


Tyr 


Leu 


Thr 


Ala 


Gly 


Lys 


Thr 


Lys 


Lys 


Met 


Pro 


Asp 


Asn 


He 


Met 






195 










200 










205 








Phe 


Met 


Gin 


Gly 


Gly 


Arg 


Tyr 


Thr 


Thr 


Cys 


Asp 


Asn 


His 


Asp 


His 


Pro 




210 










215 










220 










His 


Phe 


Tyr 


He 


Asn 


Leu 


Ser 


Lys 


Ala 


Lys 


Val 


His 


Pro 


Glu 


Lys 


Asp 


225 










230 










235 










240 


He 


Val 


Thr 


Gly 


Pro 


Val 


Asn 


Leu 


Val 


He 


Ala 


Asp 


Met 


Pro 


Leu 


Pro 










245 










250 










255 




He 


Gly 


Leu 


Pro 


Phe 


Gly 


Tyr 


Phe 


Pro 


Phe 


Ser 


Asn 


Lys 


Tyr 


Ser 


Ser 








260 










265 










270 






Gly 


He 


Leu 


Met 


Pro 


Thr 


Tyr 


Gly 


Glu 


Asp 


Asn 


Arg 


Tyr 


Gly 


Phe 


Tyr 






275 










280 










285 








Leu 


Arg 


Asn 


Gly 


Gly 


Tyr 


Tyr 


Phe 


Ala 


Phe 


Ser 


Asp 


Tyr 


He 


Asp 


Leu 




290 










295 










300 











SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSCXX;ID- <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



279/490 

Ala Leu Arg Gly Glu He Phe Ser Lys Gly Ser Trp Gly He Ser Ala 
305 310 315 320 

Gin Ser Lys Tyr Lys Lys Arg Tyr Lys Tyr Asn Gly Ser Phe Glu Ala 
325 330 335 

5 Asn Tyr Leu Val Ser Lys Ser Gly Asp Lys Tyr Val Pro Gly Asp Tyr 
340 345 350 

Ser Lys Thr Thr Ser Leu Asn He Arg Trp Thr His Ser Gin Asp Pro 

355 360 365 

Lys Ala Asn Pro Leu Gin Thr Leu Ser Ala Asn Val Asn Phe Ala Thr 
10 370 375 380 

Gly Ser Tyr Phe Gin Asn Ser Leu Asn Thr Thr Tyr Asp Val Asn Ala 
385 390 395 400 

Arg Thr Ala Thr Thr Arg Ser Ser Ala Val Ser Tyr Ser Arg Lys Phe 
405 410 415 

15 Pro Gly Thr Pro Phe Ser He Thr Gly Ser Met Asp He Ser Gin Asn 
420 425 430 

Met Arg Asp Thr Thr Val Ser Leu Thr Leu Pro Asn Leu Ser He Asn 

435 440 445 

Met Ser Thr Arg Tyr Pro Phe Lys Arg Lys Thr Arg Val Gly Pro Glu 
20 450 455 460 

Arg Trp Tyr Glu Lys Leu Ser Val Gly Tyr Ser Gly Gin Leu Arg Asn 
465 470 475 480 

Ser He Leu Thr Lys Glu Lys Asp Leu Leu Gin Ser Asn Leu Val Arg 
495 490 495 

25 Asp Trp Lys Asn Gly Ilet Arg His Ser Val Pro He Ser Leu Thr Val 
500 505 510 

Pro Leu Leu Asp Tyr He Asn Leu Thr IJet Gly Val Asn Tyr Asn Glu 

515 520 525 

Trp Trp Tyr Thr Lys Gly He Arg Lys Ser Trp Asn Glu Asp Lys Lys 
30 530 535 540 

Thr Phe Leu Pro Ser Asp Thr Thr Tyr Lys Phe Arg Arg Leu Tyr Asp 
S45 550 555 560 

Tyr Ser Leu Ser Ala Gly Leu Ser Thr Thr Leu Tyr Gly Met Phe Lys 
565 570 575 

o5 Pro Trp Lys Pro Phe Ser Phe Gly Gly Asn Leu He Met He Arg His 
580 585 590 

Arg Phe Thr Pro Thr Val Ser Phe Ser Tyr Ilet Pro Asp Phe Thr Lys 

595 600 605 

Arg Arg Tyr Gly Phe Trp Glu Leu Leu Glu His Thr Asp Gin Asn Gly 
40 610 615 620 

Lys Leu His Thr Leu Leu Tyr Ser Pro Tyr Phe Glu Gin He Phe Gly 
625 630 635 640 

Ala Pro Ser Met Gly Asn Ala Gly Ser Val Asn Phe Ser Phe Asp Asn 

645 . 650 655 

Asn Leu Glu Ala Lys He Lys Ser Lys Ser Asp Ser Thr Gly He Lys 

660 665 670 

Lys He Ser Leu He Asp Gin Phe Thr Trp Ser Tl^r Ser Tyr Asn Met 

675 680 685 

Phe Ala Asp Ser He Arg Trp Ser Asn He Ser Ala Ser Leu Ala Leu 
5U 690 695 700 

Arg Leu Ser Lys Ser Phe Thr Leu Arg Leu Ser Gly Leu Phe Asp Pro 
705 710 715 720 

Tyr Leu Thr Lys Tyr Tyr Glu Gly Glu Asp Gly Lys He He Pro Tyr 
725 730 735 

OO Lys Ser Asn Asp Leu Arg He Phe Asn Gly Lys Gly Leu Ala Arg Leu 
740 745 750 

He Ser Thr Gly Thr Ser Phe Ser Tyr Thr Leu Asn Lys Glu Ser Leu 

755 760 765 

Ser Gly Leu He Ala Leu Phe Ser Gly Lys Lys Glu Arg Arg Asp Glu 
OU 770 775 780 

Lys Lys Asn Thr Gly Ala Thr Pro His Glu Gly Asp Asp Ala Ala Asp 
785 790 795 800 

He Leu Glu Gly Gly Arg Pro Gin Asn Glu Ser Gly Gly Ser Leu Leu 

805 810 815 

Glu Arg Asn Arg Gin Gly Gly Ala Val Asp Gin Asp Gly Tyr Phe Ala 

820 825 830 

Tyr Ser He Pro Trp Ser Leu Ser Phe Asp Tyr Ser Trp Asn He Ala 

835 840 845 

Thr Asp Tyr Asn Arg Tyr Asn Val Asn Lys Met Glu His Tyr Tyr Arg 

850 855 860 

Val Thr Gin Asn Leu Ser Phe Arg Gly Asn He Gin Pro Thr Pro Asn 
86S 870 875 880 

Trp Ser Phe Gly Phe Asn Ala Asn Tyr Asn Phe Asp Leu Lys Lys He 

885 890 895 

Thr Ser Leu Thr Cys Asn Val Thr Arg Asp Met His Cys Trp Ala He 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



45 



65 



75 



BNSDCX;iD: <WO 9929870A1_L> 



wo 99/29870 PCT/AU98/01023 

280/490 



10 



15 



35 



40 



45 



50 



55 



60 



65 



70 



75 



900 905 910 

Ser Ala Ser Phe lie Pro lie Gly Ala Tyr Ly^ Ser Tyr Asn Phe Val 

S15 920 925 

lie Ser Val Lys Ser Ser Leu Leu Gin Asp Leu Lys Tyr Gin Gin Ser 

930 935 940 

Asn Arg Pro He Thr Asn Thr Trp Tyr 
945 950 

(2) 111 FORMAT I ON FOR SEQ ID MO: 351 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

{ix) FEATURE; 

(A) NAME/ KEY: mis cofeature 
25 (B) LOCATION 1...1251 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO; 351 



30 





Trp 


liV S 


T oi 1 

j-ieu 


Al a 


Cys 


f\i a 


fne 


Asp 


Cys 


Ala 


Cys 


Cys 


Phe 


Asp 


Pro 




















1 0 










1 5 




Phe 


Val 


Val 


Thr 




Glu 


Va 1 


He 


-L J. e 


rje L 


Met 


Lys 


Arg 


Tyr 


Thr 


He 








20 










2 5 










30 






lie 


Leu 


Ala 


Val 


Phe 


Leu 


Leu 


Phe 


Cys 


Thr 


Val 


Phe 


Thr 


Phe 


Gl n 


He 






35 










4 0 










A 5 








Lys 


Ala 


Arg 


Pro 


Tyr 


Glu 


Arg 


Phe 


Ala 


Asp 


Val 


Giu 


Lys 


Pro 


Trp 


He 




50 










55 










60 










Gin 


Lys 


His 


Ser 


Met 


Asp 


Ser 


Lys 


Leu 


Val 


Pro 


Ala 


Asn 


Lys 


Gly 


Asn 


65 










70 










75 










80 


Leu 


He 


Gin 


Ala 


Glu 


He 


Val 


Tyr 


Gin 


Ser 


Val 


Ser 


Glu 


His 


Ser 


Asp 










85 










90 










95 




Leu 


Val 


He 


Ser 


Pro 


Val 


Asn 


Glu 


He 


Arg 


Pro 


Ala 


Asn 


Arg 


Phe 


Pro 








100 










105 










110 






Ser 


His 


Arg 


Lys 


Ser 


Phe 


Phe 


Ala 


Glu 


Asn 


Leu 


Arg 


Ala 


Ser 


Pro 


Pro 






115 










120 










125 








Val 


Val 


Pro 


Val 


Ala 


Val 


Asp 


Lys 


Tyr 


Ala 


Val 


Pro 


Val 


Ala 


Asn 


Pro 




130 










135 










140 










Met 


Asp 


Pro 


Glu 


Asn 


Pro 


Asn 


Ala 


Trp 


Asp 


Va 1 


Thr 


Leu 


Lys 


He 


Thr 


1^5 










150 










155 










160 


Thr 


Lys 


Ala 


Val 


Thr 


Val 


Pro 


Val 


Asp 


Val 


Val 


Met 


Val 


He 


Asp 


Gin 










165 










170 










175 




Ser 


Ser 


Ser 


Met 


Gly 


Gly 


Gin 


Asn 


He 


Ala 


Arg 


Leu 


Lys 


Ser 


Ala 


He 








180 










185 










190 






Ala 


Ser 


Gly 


Gin 


Arg 


Phe 


Val 


Lys 


Lys 


Met 


Leu 


Pro 


Lys 


Gly 


Thr 


Ala 






195 










200 










205 








Thr 


Giu 


Gly 


Val 


Arg 


He 


Ala 


Leu 


Val 


Ser 


Tyr 


Asp 


His 


Glu 


Pro 


His 




210 










215 








220 










Arg 


Leu 


Ser 


Asp 


Phe 


Thr 


Lys 


Asp 


Thr 


Ala 


Phe 


Leu 


Cys 


Gin 


Lys 


He 


225 










230 










235 








240 


Arg 


Ala 


Leu 


Thr 


Pro 


He 


Trp 


Gly 


Thr 


His 


Thr 


Gin 


Gly 


Gly 


Leu 


Lys 










245 










250 










255 




Met 


Ala 


Arg 


Asn 


He 


Met 


Ala 


Thr 


Ser 


Thr 


Ala 


Val 


Asp 


Lys 


His 


He 








260 










265 










270 






He 


Leu 


Met 


Ser 


Asp 


Gly 


Leu 


Ala 


Thr 


Glu 


Gin 


Tyr 


Pro 


Val 


Lys 


Asn 






275 










280 










285 








Val 


Thr 


Thr 


Ala 


Asp 


Phe 


He 


Gly 


Lys 


Thr 


Gly 


Asn 


Ala 


Asn 


Asp 


Pro 




290 










295 










300 










He 


Asp 


Leu 


Val 


He 


Gin 


Gly 


Ala 


He 


Asn 


Phe 


Pro 


Thr 


Asn 


Tyr 


Val 


305 










310 










315 








320 


Ser 


Asn 


Asn 


Pro 


Ser 


Thr 


Pro 


Leu 


Thr 


Pro 


Asn 


Tyr 


Pro 


Thr 


His 


Ser 










325 










330 










335 




Ser 


Lys 


Val 


Gly 


Arg 


Arg 


Asn 


Leu 


Pro 


Glu 


Ser 


Lys 


Phe 


Asp 


Tyr 


Ser 








340 










345 










350 




Asn 


Leu 


Ser 


Ala 


Arg 


He 


Thr 


Phe 


Asp 


Gly 


Val 


Ala 


Gly 


Ala 


Leu 


Val 






355 










360 










366 








Tyr 


Glu 


Pro 


Arg 


Phe 


Pro 


His 


Pro 


Tyr 


Tyr 


Tyr 


Tyr 


Phe 


Pro 


Cys 


Asn 
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370 375 380 

Ala Ala lie Asn Glu Ala Gin Phe Ala Lys Asn Ser Gly Tyr Thr He 
385 390 395 400 

His Thr He Gly Tyr Asp Leu Gly Asp Phe Ala Leu Ala Asn Asn Ser 
5 405 410 415 

Leu Lys Leu Thr Ala Thr Asp Glu Asn His Phe Phe Thr Ala Thr Pro 

420 425 430 

Ala Asn Leu Ala Ala Ala Phe Asp Asn He Ala Gin Thr He Asn He 
435 440 445 

10 Gly He Gin Arg Gly Glu Val Thr Asp Phe Val Ala Pro Gly Phe He 
4 50 4 55 4 60 

Val Lys Asn Leu Thr Gin Ser Gly Asp Val Thr His Leu Leu Asn Val 
465 470 475 480 

Ser Asn Gly Thr Val His Tyr Asp Val Ser Thr Lys Lys Leu Thr Trp 
15 485 4 90 4 95 

Thr Thr Gly Thr He Leu Ser Ser Ser Glu Ala Thr He Thr Tyr Arg 

500 505 510 

He Tyr Ala Asp Leu Asp Tyr He Gin Asn Asn Asp He Pro Val Asn 
515 520 525 

20 Thr Thr Ser Ala He Gly Pro Asp Leu Gly Gly Phe Asp Thr Asn Thr 
530 535 540 

Glu Ala Lys Leu Thr Tyr Thr Asn Ser Asn Gly Glu Pro Asn Gin Gin 
545 550 555 560 

Leu He Phe Pro Arg Pro Thr Val Lys Leu Gly Tyr Gly Val He Lys 
25 565 570 575 

Arg His Tyr Val Leu Val Asn Lys Asp Gly Gin Pro He Gin Ala Asn 

SSO 585 590 

G]y Thr Val Val Ser Ser Leu Ser Glu Ala His Val Leu Gin Ser Gin 

595 600 605 

Asp Phe Phe Leu Pro Ser Gly Gly Gly His He Val Pro Lys Trp He 

610 615 620 

Lys Leu Asp Lys Thr Thr Glu Ala Leu Gin Tyr Tyc Ser Val Pro Pro 
625 630 635 640 

Thr Asn Thr Val He Thr Thr Ala Asp Gly Lys Arg Tyr Arg Phe Val 

645 650 555 

Glu Val Pro Gly Ser Thr Pro Asn Pro Gly Gin He Gly He Ser Trp 

660 665 670 

Lys Lys Pro Ala Gly Asn Ala Tyr Phe Ala Tyr Lys Leu Leu Asn Tyr 

675 680 685 

Trp Met Gly Gly Thr Thr Asp Gin Gin Ser Glu Trp Asp Val Thr Ser 

690 695 700 

Asn Trp Thr Gly Ala Gin Val Pro Leu Thr Gly Glu Asp Val Glu Phe 
705 710 715 720 

Ala Thr Thr Glu Asn Phe Gly Ser Pro Ala Val Ala Asp Leu His Val 
45 725 730 735 

Pro Thr Thr Asn Pro Lys He He Gly Asn Leu He Asn Asn Ser Asp 

740 745 750 

Lys Asp Leu Val Val Thr Thr Ser Ser Gin Leu Thr He Asn Gly Val 
755 760 765 

50 Val Glu Asp Asn Asn Pro Asn Val Gly Thr He Val Val Lys Ser Ser 
770 775 780 

Lys Asp Asn Pro Thr Gly Thr Leu Leu Phe Ala Asn Pro Gly Tyr Asn 
785 790 795 800 

Gin Asn Val Gly Gly Thr Val Glu Phe Tyr Asn Gin Gly Tyr Asp Cys 
55 805 810 815 

Ala Asp Cys Gly i-Iet Tyr Arg Arg Ser Trp Gin Tyr Phe Gly He Pro 

820 825 830 

Val Asn Glu Ser Gly Phe Pro He Asn Asp Val Gly Gly Asn Glu Thr 
835 840 845 

60 Val Asn Gin Trp Val Glu Pro Phe Asn Gly Asp Lys Trp Arg Pro Ala 
850 855 860 

Pro Tyr Ala Pro Asp Thr Glu Leu Gin Lys Phe Lys Gly Tyr Gin He 
865 870 875 880 

Thr Asn Asp Val Gin Ala Gin Pro Thr Gly Val Tyr Ser Phe Lys Gly 
d5 885 890 895 

Met He Cys Val Cys Asp Ala Phe Leu Asn Leu Thr Arg Thr Ser Gly 

900 905 910 

Val Asn Tyr Ser Gly Ala Asn Leu He Gly Asn Ser Tyr Thr Gly Ala 

915 920 925 

He Asp He Lys Gin Gly He Val Phe Pro Pro Glu Val Glu Gin Thr 

930 935 94 0 

Val Tyr Leu Phe Asn Thr Gly Thr Arg Asp Gin Trp Arg Lys Leu Asn 
945 950 955 960 

Gly Ser Thr Val Ser Gly Tyr Arg Ala Gly Gin Tyr Leu Ser Val Pro 
965 970 975 
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35 



40 



70 
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Lys Asn Thr Ala Gly Gin Asp Asn Leu Pro Asp Arg lie Pro Ser Met 

980 985 990 

His Ser Phe Leu Val Lys Met Gin Asn Gly Ala Ser Cys Thr Leu Unk 
995 1000 1005 

5 lie Leu Tyr Asp Lys Leu Leu Lys Asn Thr Thr Val Asn Asn Gly Asn 
1010 1015 1020 

Gly Thr Gin lie Thr Trp Arg Ser Gly Asn Ser Gly Ser Ala Asn Met 
1025 1030 1035 1040 

Pro Ser Leu Val Met Asp Val Leu Gly Asn Glu Ser Ala Asp Arq Leu 
10 1045 1050 1055 

Trp He Phe Thr Asp Gly Gly Leu Ser Phe Gly Phe Asp Asn Gly Trp 

1060 1065 1070 

Asp Gly Arg Lys Leu Thr Glu Lys Gly Leu Ser Gin Leu Tyr Ala Met 
1075 1080 1085 

15 Ser Asp He Gly Asn Asp Lys Phe Gin Val Ala Gly Val Pro Glu Leu 
1090 1095 1100 

Asn Asn Leu Leu He Gly Phe Asp Ala Asp Lys Asp Gly Gin Tyr Thr 
1105 1110 1115 1120 

Leu Glu Phe Ala Leu Ser Asp His Phe Ala Lys Gly Ala Val Tyr Leu 
20 1125 1130 1135 

His Asp Leu Gin Ser Gly Ala Lys His Arg lie Thr Asn Ser Thr Ser 

11-50 1145 1150 

Tyr Ser Fhe Asp Ala Lys Arg Gly Asp Ser Gly Ala Arg Phe Arg Leu 
1155 1160 1165 

25 Ser Tyr Gly Cys Asp Glu Asn Val Asp Asp Ser His Val Val Ser Thr 
1170 1175 1180 

Asn Gly Arq Glu He He He Leu Asn Gin Asp Ala Leu Asp Cys Thr 
1185 1190 1195 1200 

Val Thr Leu Phe Thr He Glu Gly Lys Leu Leu Arg Arg Leu Lys Val 
30 1205 1210 1215 

Leu Ala Gly His Arg Glu Val Met Lys Val Gin Thr Gly Gly Ala Tyr 

1220 1225 1230 

He Val His Leu Gin Asn Ala Phe Thr Asn Asp Val His Lys Val Leu 
1235 1240 1245 

35 Val Glu Tyr 
1250 

{2) INFORMATION FOR SEQ ID HO: 352 

40 (i) SEQUEHCE CHARACTERISTICS: 

(A) LEHGTH: 4 26 amino acids 

(B) TYPE: amino acid 
(D} TOPOLOGY: linear 

45 {ii) MOLECULE TYPE: protein 

(ill ) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 
50 (A) ORGAHISM: Porphyromonas gingival is 

(i>:) FEATURE: 

(A) NAT-IE/KEY: misc_feature 

( B) LOCATION 1 . . , 426 

55 



60 



65 



70 



75 





(xi ) 


SEQUEHCE DESCRIPTION: SEQ 


ID NO: 352 










Thr 


Met 


Glu 


Val 


Lys 


Lys 


Asn 


Thr 


Val 


Val 


Leu 


Arg 


Leu 


Leu 


He 


Trp 


1 








c 










10 










15 




Phe 


Val 


Ala 


He 


Leu 


Leu 


Phe 


His 


Ser 


Ser 


Arg 


Leu 


Trp 


Gly 


Gin 


Glu 








20 










25 










30 






Gly 


Glu 


Gly 


Ser 


Ala 


Arg 


Tyr 


Arg 


Phe 


Lys 


Gly 


Phe 


Val 


Asp 


Thr 


Tyr 






35 










40 










45 








His 


Ala 


Val 


Arg 


Ser 


Ser 


Ser 


Pro 


Phe 


Asp 


Phe 


I'let 


Ser 


Ser 


Arg 


Thr 




50 










55 










60 








Arg 


Val 


Arg 


Gly 


Glu 


Leu 


Glu 


Arg 


Ser 


Phe 


Gly 


Asn 


Ser 


Lys 


Val 


Ala 


65 










7 0 










75 










80 


Val 


Ser 


Val 


Asn 


Ala 


Thr 


Tyr 


Asn 


Ala 


Leu 


Leu 


Lys 


Asp 


Glu 


Thr 


Gly 










85 










90 








95 


Leu 


Arg 


Leu 


Arg 


Glu 


Ala 


Phe 


Phe 


Glu 


His 


Gin 


Glu 


Glu 


His 


Trp 


Gly 








100 










105 










110 






Leu 


Arg 


Leu 


Gly 


Arg 


Gin 


He 


Val 


lis 


Trp 


Gly 


Ala 


Ala 


Asp 


Gly Val 






115 










120 










125 








Arg 


He 


Thr 


Asp 


Leu 


He 


Ser 


Pro 


Met 


Asp 


Met 


Thr 


Glu 


Phe 


Leu 


Ala 




130 










135 








140 
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Gin Asp Tyr Asp Asp He Arg Met Pro Val Asn Ala Leu Arg Phe Ser 
1^5 150 155 160 

Val Phe Asn Glu Ser Met Lys Val Glu Val Val Val Leu Pro Val Phe 
165 170 175 

5 Glu Gly Tyr Arg Leu Pro Val Asp Pro Arg Asn Pro Trp Asn He Phe 
180 185 190 

Ser Leu Ser Pro He Ala Gin Gly Met Asn He Val Trp Lys Glu Glu 

195 200 205 

Ala Gly Lys Pro Ala Phe Lys Val Ala Asn He Glu Tyr Gly Ala Arg 
10 210 215 220 

Trp Ser Thr Thr Leu Ser Gly He Asp Phe Ala Leu Ala Ala Leu Hi^ 
225 230 235 240 

Thr Trp Asn Lys Met Pro Val He Glu Val Gin Gly He Val Pro Thr 
245 250 255 

15 Glu He He Val Ser Pro Arg Tyr Tyr Arg Met Gly Phe Val Gly Gly 
260 265 270 

Asp Leu Ser Val Pro Val Gly Gin Phe Val Phe Arg Gly Glu Ala Ala 

275 280 285 

Phe Asn He Asp Lys His Phe Thr Tyr Lys Ser His Ala Glu Gin Glu 
20 290 295 300 

Gly The Gin Thr He Asn Trp Leu Ala Gly Ala Asp Trp Tyr Ala Pro 
305 310 315 320 

Gly Glu Trp Met He Ser Gly Gin Phe Ser Met Glu Ser He Phe Arg 
325 330 335 

25 Tyr Arg Asp Phe He Ser Gin Arg Gin His Ser Thr Leu He Thr Leu 
340 " 345 350 

Asn Val Ser Lys Lys Phe Phe Gly Ser Thr Leu Gin Leu Ser Asp Phe 

355 360 365 

Thr Tyr Tyr Asp Leu Thr Gly Lys Gly Trp Phe Ser Arg Phe Ala Ala 
30 370 375 380 

Asp Tyr Ala Leu Asn Asp Gin He His Leu Met Ala Gly Tyr Asp Trp 
385 390 395 400 

Phe Ser Ser Lys Gly Ser Gly He Phe Asp Arg Tyr Lys Asp Asn Ser 
405 410 415 

35 Glu Leu Trp Phe Lys Ala Arg Tyr Ser Phe 
420 425 

(2) INFORMATION FOR SEQ ID MO: 353 

40 (i) SEQUEl'iCE CHARACTERISTICS: 

(A) LENGTH: 4 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
50 (A) ORGAN I SH: Porphyromon as gingival is 

(iK) FEATURE: 

(A) NAT'IE/KEY: mis cofeature 
{B) LOCATION 1 ... 464 



55 



60 



65 



70 



(Mi) SEQUENCE DESCRIPTION: SEQ ID NO: 353 



Tyr 


•31 y 


Lys 


Arg 


Arg 


Lys 


Leu 


Gly 


Thr 


Ser 


Val 


Arg 


Fro 


Ser 


Val 


1 


















10 










15 


Thr 


Gin 


He 


Arg 


Phe 


He 


Leu 


Asp 


Leu 


His 


Leu 


He 


Thr 


Asp 


Phe 








20 










25 










30 




Glu 


Gly 


Leu 


Arg 


Val 


Asn 


Pro 


He 


Gly 


Ala 


Ala 


Ala 


He 


Val 


Ala 






35 










40 










45 






He 


He 


Asp 


Leu 


Leu 


Leu 


Leu 


Cys 


Cys 


Ser 


Ala 


Phe 


Met 


Ser 


Ser 




50 










55 










60 








Glu 


Val 


Ala 


Tyr 


Phe 


Ser 


Leu 


Lys 


Pro 


He 


Asp 


Leu 


Gin 


Asn 


He 


65 










70 










75 










Glu 


Arg 


Asn 


His 


Ser 


Ser 


Asp 


He 


Ala 


Leu 


Ser 


Asn 


Leu 


Leu 


Asp 










85 










90 










95 


Ser 


Asn 


Gin 


Leu 


Leu 


Ala 


Thr 


He 


Leu 


He 


Gly 


Asn 


Asn 


Val 


He 








100 










105 










110 




Val 


Ala 


He 


Val 


He 


Leu 


Ser 


Asn 


Tyr 


Ala 


He 


Glu 


Gin 


Thr 


Phe 






115 










120 










125 






Phe 


Ser 


Ser 


Pro 


He 


He 


Gly 


Phe 


Leu 


He 


Gin 


Thr 


He 


Leu 


Leu 



80 



75 130 13 



140 
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10 



15 



20 



25 



30 



35 



40 



50 



55 



60 



65 



70 



75 



Th f 


Va 1 


Leu 


L*^u 


Leu 


Phe 


Gly 


Glu 


He 


Leu 


Pro 


L' ''s 


Val 


Tyr 


f\± a 


Arg 


± 'i O 










X ;j u 










XOO 










X ou 


L-i S 


n o r-i 


P jto 


eu 


Gl 11 


Tyr 


Ser 


Arg 


rne 


Ser 




MX a 


MX a 


Hot- 
lie t 


Ser 


vai 










X oo 










X / u 










17 5 




± J. € 


Tyr 


L,ys 


lie 


Leu 


oer 


Pro 


rJie 


Ser 


Lys 


Leu 


Tail 

Xieu 


va X 


Lys 


Ser 


Thr 








18 0 










185 










190 






Gl y 


He 


Vd 1 


Thr 


Arg 


Gly 


He 


Ser 


Lys 


Lys 


Lys 


Tyr 


Asp 


Met 


Ser 


Val 






1 95 










2 00 










205 








Asp 


Glu 


Leu 


Se r 


Lys 


Ala 


Val 


Ala 


Leu 


Thr 


Thr 


Thr 


Glu 


Gly 


Glu 


Pro 




210 




















220 












Gl u 


Lys 


Glu 


rie I, 


X X e 


Asn 


Glu 


He 


He 


Lys 


Phe 


Tyr 


As n 


Lys 


Thr 


C> 9 c: 
Z O 






























240 




Cys 


Glu 


T 1 *s 


Met 


va X 


Pro 


Arg 


He 


Asp 


He 


V a X 


Asp 


Va 1 


Asp 


Leu 




















250 










255 




SeiT 


Trp 


P ITO 


r n s 


Arg 


Xjys 


M e t 


Leu 


Asp 


Phe 


\r~ 1 
V d X 


Val 


Ser 


Ser 


Gl y 


Tyr 








2 60 




















27 0 




Ssir 




Leu 


Pro 


Val 


Ser 


Glu 


tjx y 


oe L 


Glu 


Asp 


Asn 


X X e 


Lys 


Gly 


Va 1 






775 










280 










285 








He 


Tyr 


H *^ 


L^''s 


sp 


eu 


T 1 cj 

X X e 


Pro 


nx s 


Met 


Asp 


Lys 


Gly 


Asp 


Gl u 


Phe 




2 90 




















300 












i LP 


His 


ro 


eu 


He 


Arg 


Lys 


Al a 


Tyr 


Ph"^ 


V a X 


Pro 


UXU 


Asn 


T - -e- 

XjyS 


r* R 




















316 










320 


Aig 


1 1 


Asp 




Leu 


T oil 


uX U 


Glu 


Ph e 


Arg 


Ala 


Asn 


Lys 


Val 


His 


Val 










"5 "? c. 
O o 










330 










335 






He 


Va 1 


Va 1 


Asp 


Glu 


PK (a 

r ne 


ux y 


Gi V 


1 nr 


Cys 


Giy 


T Di ^ 

Lteu 


Tin 

X X e 


Thr 


H e t 








J 4 u 










34 5 










350 






L, U 




T 1 o 

IX- 


L'^^-u 


G U 


Glu 


X X e 


va X 


Gly 


Glu 


He 


Thr 


Asp 


Glu 


Tyr 


Asp 






355 










360 


















Glu 


Glu 


Glu 


Leu 


Pro 


Phe 


Ly^ 


Val 


Leu 


Gly 


Asp 


Gly 


Ser 


Tyr 


Leu 


Phe 




37 0 










375 










380 










Glu 


Gly 


Lys 


Thr 


Ser 


Leu 


Ser 


Asp 


Val 


Arg 


His 


Tyr 


Leu 


Asp 


Leu 


Pro 


385 










390 








395 










400 


Glu 


Asn 


Ala 


Phe 


Gly 


Glu 


Leu 


Gly 


Asp 


Glu 


Val 


Asp 


Thr 


Leu 


Ser 


Gly 










405 










410 










415 




Leu 


Phe 


Leu 


Glu 


He 


Lys 


Gin 


Glu 


Leu 


Pro 


His 


Val 


Gly 


Asp 


Thr 


Ala 








420 










425 










430 






Val 


Tyr 


Glu 


Pro 


Phe 


Arg 


Phe 


Gin 


Val 


Thr 


Gin 


Met 


Asp 


Lys 


Arg 


Arg 






435 










440 










445 








He 


He 


Glu 


He 


Lys 


He 


Phe 


Pro 


Phe 


Glu 


Arg 


Thr 


Trp 


Glu 


Val 


Glu 




450 










455 










460 











(2) INFORMATION FOR SEQ ID HO: 354 



(i) SEQUEMCE CHARACTERISTICS: 
45 (A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(il) MOLECULE TYPE: protein 

(iii ) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) nAl'lE/KEY: misc^f eature 

(B) LOCATION 1 ... 266 





(xi) 


SEQUENCE DESCRIPTION: SEQ 


ID NO: 354 










He 


He 


Tyr 


Arg 


Ser 


Thr 


Met 


Lys 


Leu 


Leu 


Leu 


Tyr 


Leu 


Leu 


Leu 


Val 


1 








5 










10 










15 




Leu 


Ser 


Thr 


Leu 


Ser 


Pro 


Met 


Tyr 


Ser 


Gin 


Met 


Leu 


Phe 


Ser 


Glu 


Asn 








20 










25 










30 






Leu 


Thr 


Met 


Asn 


He 


Asp 


Ser 


Thr 


Lys 


Thr 


He 


Gin 


Gly 


Thr 


He 


Leu 






35 










40 










45 








Pro 


Val 


Leu 


Asp 


Phe 


Lys 


Thr 


Glu 


Lys 


Glu 


Asn 


Val 


Phe 


Thr 


Phe 


Lys 




50 










55 










60 








Asn 


Thr 


Ala 


Asn 


Leu 


Asn 


Leu 


Leu 


He 


Lys 


His 


Gly 


Gin 


Val 


He 


Asn 


65 










70 








75 








80 


Leu 


He 


Asn 


Lys 


Leu 


Glu 


Phe 


Ser 


Thr 


Tyr 


Gly 


Asn 


Lys 


Val 


Thr 


Val 










B5 










90 








95 




Ser 


Gly Gly 


Tyr 


Val 


His 


Thr 


Glu 


Tyr 


Arg 


Tyr 


Leu 


Leu 


His 


His 


Val 








100 
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Phe 


Glu 


Val 
115 


Tyr 


Pro 


Tyr 


Val 


Glu 
120 


Ser 


Gin 


Trp 


Ala 


Glu 
125 


Ser 


Arg 


Gly 




Met 


Lys 
130 


Tyr 


Lys 


Val 


Ser 


Thr 
135 


Gly 


Leu 


Gin 


Ser 


Arg 
140 


Tyr 


Arg 


Leu 


Val 


5 


A^n 
14S 


Ser 


Asp 


Asn 


Cys 


Leu 
150 


Met 


Phe 


Ala 


Thr 


Leu 
155 


Gly 


Val 


Phe 


Phe 


Glu 
160 




Phe 


Glu 


Lys 


Trp 


Glu 

165 


Gin 


Pro 


Ala 


Thr 


Ser 
170 


Leu 


Phe 


Ala 


Gly 


Thr 
175 


Tyr 


10 


Aia 


Tyr 


Ser 


Arg 


Ser 


He 


Lys 


Ser 


His 


Leu 


Ser 


He 


Ser 


Phe 


Arg 


His 








1 80 










185 










190 






Arg 


Leu 


Gly 
195 


Glu 


His 


Trp 


Glu 


Phe 
200 


Thr 


Thr 


Thr 


Ala 


He 
205 


His 


Gin 


Gly 




Ly^ 


Fro 


Asp 


Ser 


Tyr 


Phe 


Lys 


Lys 


Ala 


Arg 


Phe 


Gly 


Gly 


Ala 


He 


Asp 


15 




210 










215 










220 










L&u 
225 


Lys 


Tyr 


His 


He 


Thr 
230 


Pro 


Thr 


He 


Gly 


He 
235 


Arg 


Gly 


Ala 


Tyr 


Arg 
240 




lie 


lie 


Tyr 


Asp 


Thr 
245 


Ala 


Pro 


He 


Val 


Pro 
250 


Val 


Arg 


Lys 


Asp 


Tyr 
255 


Asn 


20 


Thr 


Val 


Asp 


Val 


Gly 


He 


Asp 


He 


Ser 


Phe 




















2 60 










265 

















(2) iriFORUATION FOR SEQ ID MO: 355 

(i) SEQUEIICE CHARACTERISTICS: 
25 ^A) LENGTH: 907 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGV: linear 
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(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAt-IISM: Porphyromonas gingivalis 

(i:-.) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATIOIJ 1 , . . 907 



40 (>:i) SEQUEHCE DESCRIPTION: SEQ ID HO:355 
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Thr 


He 


Cys 


Val 


Ala 


Phe 


Leu 


Ser 


Ala 


Pro 


Val 


Ala 


Ala 


Leu 


Arg 


Glu 


1 








5 










10 










15 




Ser 


Pro 


Pro 


Met 
20 


Gly 


Ala 


Glu 


Arg 


Lys 
25 


Thr 


Pro 


Ser 


Leu 


Leu 
30 


Pro 


Leu 


Leu 


Phe 


Glu 
35 


His 


Ser 


Asp 


Lys 


Gly 
^0 


Lys 


Gly 


Phe 


Asp 


Tyr 
45 


Arg 


Leu 


Phe 


Thr 


Ser 


Asn 


Lys 


Leu 


Lys 


Val 


Phe 


Ala 


Thr 


Gly 


Asn 


Ser 


Arg 


Tyr 


He 
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Lys 
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He 


He 
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Ala 
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Lys 
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He 
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Ser 


65 










70 
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80 


Ser 


Phe 


Leu 


Phe 


Val 
85 


Leu 


Ser 


He 


Leu 


Ser 
90 


Leu 


Met 


Ala 


Gin 


Asn 
95 


Asn 


Thr 


Leu 


Asp 


Val 
100 


His 


He 


Ser 


Gly 


Thr 
105 


He 


Lys 


Asp 


Ala 


Ser 
110 


Ser 


Gly 


Glu 


Pro 


Val 
115 


Pro 


Tyr 


Ala 


Thr 


Val 
120 


Ser 


He 


Arg 


Leu 


Thr 
125 


Gly 


Ala 


Asp 


Thr 


Thr 


Gin 


Val 


Phe 


Arg 


Gin 


Val 


Thr 


Asp 


Gly 


Asn 


Gly 


Tyr 


Phe 


Val 




130 










135 








lAO 










He 


Gly 


Leu 


Pro 


Ala 


Ala 


Pro 


Ser 


Tyr 
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Leu 


Thr 


Ala 


Ser 


Phe 


Val 


145 










150 










155 










160 


Gly 


Met 


Lys 


Thr 


His 
165 


Thr 


i-Ie t 


Gin 


He 


Ser 
170 


Arg 


Gly 


Asn 


Gly 


Gin 
175 


His 


Asp 


He 


Lys 


Ser 


He 


Asp 


He 


Ser 


Leu 


Glu 


Ser 


Glu 


Asp 


Lys 


Gin 


Leu 








180 










185 








190 






Ser 


Thr 


Val 
195 


Thr 


Val 


Ser 


Ala 


Ala 
200 


Arg 


Pro 


Leu 


Val 


Lys 
205 


Met 


Glu 


He 


Asp 


Arg 
210 


Leu 


Ser 


Tyr 


Asn 


Met 
215 


Lys 


Asp 


Asp 


Pro 


Ala 

220 


Aia 


Lys 


Thr 


As n 


Asn 


Leu 


Leu 


Glu 


Met 


Leu 


Arg 


Asn 


Val 


Pro 


Leu 


Val 


Thr 


Val 


Asp 


Gly 


225 










230 










235 










240 


Gin 


Gly 


Asn 


He 


Gin 
245 


Val 


Lys 


Gly 


Ser 


Ser 
250 


Asn 


Phe 


Lys 


He 


His 
255 


Leu 


Asn 


Gly 


Arg 


Pro 


Ser 


Thr 


Met 


Val 


Ser 


Ser 


Asn 


Pro 


Lys 


Glu 


Val 


Phe 
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Arg 


Ser 


He 


Pro 


Ala 


His 


Thr 


He 


Lys 


Arg 


Val 


Glu 


Val 


He 


Thr 


Asp 








275 










280 










285 










Pro 


Gly 


Val 


Lys 


Tyr 


Asp 


Ala 


Glu 


Gly 


Thr 


Ser 


Ala 


He 


Leu 


Asp 


He 
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300 










c 
O 


Val 


Thr 


Giu 


Glu 


Gly 


Lys 


Lys 


Leu 


Glu 


Gly 


Tyr 


Ser 


Gly 


Ser 


He 


Thr 




305 










310 










315 










320 




Ala 


Ser 


Val 


Ser 


Asn 


Asn 


Pro 


Thr 


Ala 


Asn 


Gly 


Ser 


He 


Phe 


Leu 


Thr 
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330 










335 




10 


Ala 


Lys 


Ser 


Gly 


Lys 


Val 


Gly 


Leu 


Thr 


Thr 


Asn 


Tyr 


Asn 


Tyr 


Tyr 


Gly 








340 
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Gly 


Lys 


Asn 


Lys 


Gly 


Ser 


Arg 


Tyr 


Phe 


Thr 


Glu 


Arg 


Thr 


Thr 


Ser 


Met 








355 










360 










365 










Leu 


Gin 


Thr 


He 


Glu 


Glu 


Gly 


Lys 


Gly 


Gin 


Glu 


Thr 


Phe 


Gly 


Gly 


His 


15 




370 










375 










38 0 










Phe 


Gly 


A5n 


Ala 


Leu 


Leu 


Ser 


Phe 


Glu 


He 


Asp 


Ser 


Leu 


Asn 


Leu 


Phe 




385 










390 










395 










400 




Thr 


Val 


Gly 


Gly 


Asn 


Val 


Arg 


Leu 


Trp 


Glu 


Met 


Thr 


Thr 


Asp 


Arg 


Asn 












405 










410 










415 






Ser 


Val 


Glu 


Lys 


Ser 


Phe 


Ala 


Gly 


Ser 


Asn 


Leu 


i'let 


Ser 


Tyr 


He 


Asp 








420 










425 










4 30 






Arg 


Lys 


Leu 


Lys 


Thr 


Gin 


I'let 


Asp 


Ala 


Gly 


Ser 


Tyr 


Glu 


Leu 


Asn 


Ala 








435 










4 4 0 










445 










A£-p 


Tyr 


Gin 


His 


Ser 


Thr 


Arg 


Leu 


Pro 


Gly 


Glu 


Leu 


Leu 


Thr 


Val 


Ser 
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Tyr 


Arg 


Fhe 


Thr 
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Asn 


Pro 


Asn 


Asn 


Ser 


Glu 


Thr 


Phe 


He 


Asp 


Gin 
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475 










4 80 




Trp 


Lys 


Arg 


Asp 


Pro 


Leu 


Asn 


Thr 


Ala 


Asn 


Thr 


He 


Gin 


Tyr 


Ala 


Gly 












485 










490 










4 95 




30 


Gin 


His 


Ser 


Lys 


Ser 


Asp 


Ala 


Gly 


Met 


Asp 


Glu 


His 


Thr 


Ala 


Gin 


Val 








500 










505 










510 








A^p 


Tyr 


Thr 


Arg 


Pro 


Leu 


Gly 


Gin 


Ala 


His 


Ser 


Leu 


Glu 


Ala 


Gly 


Leu 








515 










520 










525 








Lys 


Tyr 


He 


Tyr 


Arg 


His 


Ala 


Thr 


Ser 


Asp 


Pro 


Leu 


Tyr 


Glu 


He 


Arg 


35 




530 










535 










540 








Pro 


Ser 


Glu 


Asp 


Ala 


Pro 


Trp 


Gin 


Pro 


Gly 


Ser 


Leu 


Tyr 


Ala 


Gin 


Asn 




5^5 










550 










555 










560 




Pro 


Ser 


Asn 


Gly 


Lys 


Phe 


Arg 


His 


Asp 


Gin 


Tyr 


He 


Gly 


Ala 


Ala 


Tyr 












565 










570 










57 5 




A O 

4U 


Ala 


Gly 


Tyr 


Asn 


Tyr 


Arg 


Lys 


Asp 


Gin 


Tyr 


Ser 


Leu 


Gin 


Thr 


Gly 


Leu 








580 










595 










590 






Arg 


Val 


Glu 


Ser 


Ser 


Arg 


Leu 


Lys 


Ala 


Leu 


Phe 


Pro 


Glu 


Asn 


Ala 


Ala 








595 










600 










605 










Ala 


Asp 


Phe 


Ser 


His 


Asn 


Ser 


Phe 


Asp 


Trp 


Val 


Pro 


Gin 


Leu 


Thr 


Leu 


4o 




610 










615 










620 










Gly 


Tyr 


Thr 


Pro 


Ser 


Pro 


Met 


Lys 


Gin 


Leu 


Lys 


Leu 


Ala 


Tyr 


Asn 


Phe 




625 










630 










635 










640 




Arg 


He 


Gin 


Arg 


Pro 


Ala 


He 


Gly 


Gin 


Leu 


Asn 


Pro 


Tyr 


Arg 


Leu 


Gin 












645 










650 










655 




c;n 
ou 


Thr 


Asn 


Asp 


Tyr 


Gin 


Val 


Gin 


Tyr 


Gly 


Asn 


Pro 


Asp 


Leu 


Lys 


Ser 


Glu 








660 










665 










670 








Lys 


Arg 


His 


His 


Val 


Gly 


Leu 


Ser 


Tyr 


Asn 


Gin 


Tyr 


Gly 


Ala 


Lys 


Val 








675 










680 










685 










Met 


Leu 


Thr 


Ala 


Ser 


Leu 


Asp 


Tyr 


Asp 


Phe 


Cys 


Asn 


Asn 


Ala 


He 


Gin 


Do 




690 










695 










700 










Asn 


Tyr 


Thr 


Phe 


Ser 


Asp 


Pro 


Ala 


Asn 


Pro 


Asn 


Leu 


Phe 


His 


Gin 


Thr 




7 05 










710 










715 










720 




Tyr 


Gly 


Asn 


He 


Gly 


Arg 


Glu 


His 


Ser 


Phe 


Ser 


Leu 


Asn 


Thr 


Tyr 


Ala 












725 










730 










735 




DU 


Met 


Tyr 


Thr 


Pro 


Ala 


Val 


Trp 


Val 


Arg 


He 


Met 


Leu 


Asn 


Gly 


Asn 


He 








740 










745 










750 








Asp 


Arg 


Thr 


Plie 


Gin 


Lys 


Ser 


Gl u 


Ala 


Leu 


Gly 


He 


Asp 


Val 


Asn 


Ser 








755 










760 










765 










I rp 


Ser 


•ji y 


|.|ci 


Va 1 


Tyr 


S*^ r 


ijx y 


Leu 


Met 


Phe 


Thr 


Leu 


Pro 


Lys 


Asp 


65 




770 










775 










780 










Trp 


Thr 


Val 


Asn 


Leu 


Phe 


Gly 


Gly 


Tyr 


Tyr 


His 


Gly 


Gly 


Arg 


Ser 


Tyr 




785 










790 










795 










800 




Gin 


Thr 


Lys 


Tyr 


Asp 


Gly 


Asn 


Val 


Phe 


Asn 


Asn 


He 


Gly 


He 


Ala 


Lys 












805 










810 










815 


70 


Gin 


Leu 


Phe 


Asp 


Lys 


Lys 


Leu 


Arg 


Val 


Ser 


Leu 


Ser 


Ala 


Asn 


Asn 


He 








820 










825 










830 








His 


Ala 


Lys 


Tyr 


Ser 


Thr 


Trp 


Lys 


Ser 


Arg 


Thr 


He 


Gly 


Asn 


Gly 


Phe 








835 










840 










845 










Thr 


He 


Tyr 


Ser 


Glu 


Asn 


Ala 


Gly 


He 


Gin 


Arg 


Ser 


Val 


Ser 


Leu 


Ser 


75 




850 










865 










860 










Leu 


Thr 


Tyr 


Ser 


Phe 


Gly 


Lys 


Met 


Asn 


Thr 


Gin 


Val 


Arg 


Lys 


Val 


Glu 
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865 870 875 880 

Arg Thr He Val Asn A^p Asp Leu Lys Gin Thr Ser Ser Gin Gly Gin 

885 890 895 

Gin Gly Gly Gly Gin Gly A^n Pro Thr Gly Asn 
5 900 905 

(2) INFORllATIOH FOR SEQ ID MO: 356 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 450 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona s gingiva lis 

(ix) FEATURE: 

(A) NAI IE/KEY: misc_feature 

(B) LOCATION 1 ... 450 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356 



30 



Trp 


Cys 


Gin 


Ser 


Asp 


Pro 


He 


Pro 


His 


Cys 


His 


Ala 


Val 


Tyr 


Arg 


Asn 


1 








5 










10 










15 




Ala 


Gin 


Gly 


His 


Gin 


Gin 


Gly 


Arg 


Ser 


Asp 


Gin 


Asp 


Arg 


Leu 


Pro 


Pro 
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25 










30 






Val 


Arg 


Gly 


Tyr 


Arg 


Tyr 


Ala 


Arg 


Cys 


Leu 


Gly 


Arg 


Tyr 


Glu 


Gly 


Phe 






35 










40 










45 








Gly 


Glu 


Arg 


Met 


Arg 


Leu 


Ser 


Ala 


He 


Leu 


He 


Ala 


Leu 


He 


Val 


Met 




50 










55 










60 










Leu 


Pro 


Ala 


Val 


Leu 


Ser 


Gly 


Gin 


His 


Tyr 


Tyr 


Ser 


Met 


Ala 


Gly 


Glu 


65 










70 








75 








80 


Avg 


Leu 


Glu 


Thr 


A.sp 


Ser 


He 


Arg 


Pro 


Asn 


Giu 


Leu 


Ser 


Ala 


Ser 


He 










85 










90 










95 




Arg 


Ser 


Ala 


Leu 


Phe 


Phe 


Arg 


Asn 


Asn 


Glu 


Tyr 


Asn 


Ala 


Arg 


Ser 


Val 








100 










105 










110 






Lys 


Gly 


Tyr 


Thr 


Leu 


Pro 


Gly 


Ala 


Arg 


Val 


Ser 


Ala 


Phe 


Ala 


Ser 


Tyr 






115 










120 










125 








Ser 


Leu 


Pro 


Ala 


Ala 


His 


Gly 


Val 


Lys 


Leu 


Ser 


Leu 


Gly 


Val 


Ser 


Thr 




130 










135 










140 










Leu 


Asn 


Tyr 


Trp 


Gly 


Ala 


Ser 


Arg 


Tyr 


Pro 


Ala 


Gly 


He 


Ala 


Tyr 


Ser 


1)5 










150 










155 










160 


Asp 


Leu 


Pro 


Tyr 


Trp 


Thr 


Asp 


Tyr 


Asn 


Asp 


Tyr 


Val 


Arg 


Leu 


Arg 


He 










165 










170 










175 




Leu 


Pro 


Tyr 


Val 


Gin 


Ala 


Met 


Leu 


Lys 


Pro 


Thr 


Ala 


Thr 


Thr 


Ala 


Leu 








180 










185 










190 






llet 


Leu 


Gly 


Asn 


lie 


Ala 


Gly 


Gly 


Thr 


Ala 


His 


Gly 


Leu 


He 


Glu 


Pro 






195 










200 










205 








lie 


Tyr 


Asn 


Pro 


Glu 


Leu 


Asp 


Leu 


Thr 


Ala 


Asp 


Pro 


Glu 


Ala 


Gly 


Val 




210 










215 










220 








Gin 


Phe 


Arg 


Gly 


Asp 


Trp 


Thr 


Arg 


Phe 


Arg 


Met 


Asp 


Val 


Trp 


Val 


Asn 


225 










230 










235 










240 


Trp 


Met 


Ser 


Met 


lie 


Phe 


Lys 


Asn 


Asp 


Asn 


His 


Gin 


Glu 


Ser 


Phe 


Val 










245 










250 










255 




Phe 


Gly 


Leu 


Ser 


Thr 


Thr 


Ser 


Lys 


Leu 


Leu 


Ser 


Gly 


Glu 


Gly 


Lys 


Trp 








260 










265 










270 






Arg 


Leu 


Glu 


Leu 


Pro 


Leu 


Gin 


Ala 


He 


Ala 


Thr 


His 


Arg 


Gly 


Gly 


Glu 






275 










280 










285 








Tyr 


Asn 


Trp 


Ala 


Gin 


Gin 


Asp 


Thr 


Val 


His 


Thr 


Trp 


Val 


Asn 


Gly 


Ala 




290 










295 










300 








Val 


Gly 


Leu 


hys 


Leu 


Ser 


Tyr 


Arg 


Pro 


Arg 


Thr 


Asp 


Lys 


Pro 


Met 


Gin 


305 










310 










315 










320 




Trp 


Gly 


Ser 


Ala 


Tyr 


Gly 


Val 


Ala 


Ala 


Leu 


Ser 


Ser 


Gly 


Gly 


Tyr 










325 










330 










335 




Phe 


Pro 


Tyr 


Glu 


Arg 


Gly 


Trp 


Gly 


Gly 


Tyr 


Leu 


Ser 


Leu 


Gly 


Met 


Asp 








340 










345 










350 






Leu 


Glu 


His 


Phe 


Ala 


Phe 


Arg 


Thr 


Asp 


Tyr 


Trp 


Tyr 


Gly 


Arg 


His 


Tyr 






355 










360 










365 








Val 


Ser 


Pro 


Phe 


Ala 


Ala 


Pro 


Phe 


Ala 


Asn 


Ser 


Leu 


Thr 


Tyr 


Asp 


Lys 




370 










375 










380 




Gin 


Pro 


Leu 


Thr 


Asn 


Gly 


Trp 


Gly 


Asp 


Tyr 


He 


Arg 


Leu 


Tyr 


Ala 


Asp 
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385 390 395 400 

Tyr Ser Trp Arg Met Ala Arg Ser Val Ser Leu Ala Ala Val Ala Arg 

405 410 415 

Val Trp Phe Gin Pro Ser Asp Arg Phe Ala I let Ser His Ala Leu Glu 

420 425 430 

Leu Thr Het Arg He Asp Pro hys Phe Pro He Ala Phe Leu Lys Gly 
435 440 445 

Asn Hi^ 
450 

(2) INFORMATION FOR SEQ ID HO: 357 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii| MOLECULE TYPE: protein 

20 (iiil HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

25 (ix) FEATURE: 

(A) HAI4E/KEY: misc_feature 

(B) LOCATION 1 . . .4 47 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357 
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Pro 




Trp 


V U J. 
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Ml d 
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Al a 
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Ser 


Ser 


Val 


Arg 


JL 








O 
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Thr 


Arg 


Pro 


Cys 


Ser 


Thr 


Val 


Ser 


Ser 


Thr 


Ala 


Ala 


Ser 


Ser 


His 


Arg 








20 
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Arg 


Ser 


Glu 


His 


Met 


Arg 


Ser 


Ala 


Asp 


Ser 


Ser 


Pro 


Ala 


Tyr 


Ser 


Pro 
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40 










45 








He 


Gly 


Ser 


Leu 


Ser 


Ser 


Ser 


His 


Ser 


Phe 


Asp 


Ser 


He 


Pro 


Asp 
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55 
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Arg 


I let 
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Lys 


Ser 


Leu 


Leu 


Ser 


Leu 


Ala 


Cys 


Leu 


He 


Leu 


Cys 


Gly 


65 










70 
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80 


Met 


Pro 


Ala 


He 


Ala 


Gin 


Gin 


Thr 


Gly 


Pro 


Ala 


Glu 


Arg 


Ser 


Gly 


Glu 










85 










90 










95 




Pro 


Ser 


Leu 


Ala 


Glu 


Arg 


Val 


Phe 


Gly 


Leu 


Glu 


Gin 


Lys 


Gin 


Lys 


Lys 








100 










105 










110 






Leu 


Lys 


Val 


Tyr 


Leu 


Gly 


He 


Gin 


Ser 


Phe 


Tyr 


Asp 


Gin 


Pro 


Leu 


Val 






115 










120 










125 








Asp 


Asp 


Glu 


Ser 


His 


He 


Gly 


His 


Phe 


Lys 


Val 


Gin 


Glu 


Leu 


Arg 


Het 




130 










135 










140 










Ser 


Ala 


His 


Gly 


Glu 


Leu 


Asn 


Arg 


His 


Leu 


Ser 


Phe 


Asp 


Trp 


Arg 


Gin 


145 










150 










155 










160 


Arg 


Leu 


Asn 


Arg 


Ala 


Ala 


Asp 


Gly 


Thr 


Ser 


Phe 


Ala 


Asp 


Asn 


Leu 


Ser 










165 










170 










175 




Asn 


Ala 


He 


Asp 


He 


Ala 


Gly 


Val 


Asp 


Trp 


His 


Pro 


Asn 


Asp 


Lys 


Val 








180 










185 










190 






Ser 


Phe 


Phe 


Phe 


Gly 


Arg 


Gin 


Tyr 


Ala 


Arg 


Phe 


Gly 


Gly 


He 


Glu 


Tyr 






195 










200 










205 








Asp 


Met 


Asn 


Pro 


Val 


Glu 


He 


Tyr 


Gin 


Tyr 


Ser 


Asp 


Leu 


Val 


Asp 


Tyr 




210 










215 










220 








Met 


Thr 


Cys 


Tyr 


Thr 


Ser 


Gly 


Val 


Asn 


Phe 


Ala 


Trp 


Asn 


Phe 


His 


Pro 


225 










230 










235 










240 


Glu 


Gin 


Gin 


Leu 


Gin 


Leu 


Gin 


Val 


Leu 


Asn 


Ala 


Tyr 


Asn 


Asn 


Arg 


Phe 










245 










250 










255 




Ala 


Asp 


Arg 


Tyr 


His 


Val 


Thr 


Pro 


Asp 


Val 


Ala 


Thr 


Ala 


Thr 


Ser 


Tyr 








260 










265 










270 






Pro 


Leu 


Leu 


Tyr 


Ser 


Ala 


Gin 


Trp 


Asn 


Gly 


Thr 


Leu 


Leu 


Gly 


Gly 


Ala 






275 










280 










285 








Leu 


His 


Met 


Arg 


Tyr 


Ala 


Val 


Ser 


Met 


Ala 


His 


Gin 


Ala 


Gin 


Glu 


Arg 




290 










295 










300 








As n 


Met 


Trp 


Tyr 


Phe 


Thr 


Ala 


Gly 


Asn 


Leu 


Phe 


Asn 


Pro 


Gly 


Lys 


Arg 


305 










310 










315 










320 


He 


Asn 


Gly 


Tyr 


Leu 


Asp 


Leu 


Thr 


Tyr 


Ser 


He 


Glu 


Gly 


Leu 


Asp 


Asp 










325 










330 










335 




Lys 


Gly 


He 


Met 


Thr 


Ala 


Arg 


Tyr 


Gly 


Lys 


Gly 


Lys 


Thr 


Leu 


Thr 


Asp 








340 










345 










350 




Val 


Lys 


Tyr 


Tyr 


Ala 


Leu 


Val 


Ser 


Lys 


Trp 


Asn 


Phe 


Arg 


He 


Phe 


Asp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCtD: <WO 9929870A1_L> 



wo 99/29870 



PCTyAU98/01023 



289/ 490 



30 



35 



40 



45 



50 



55 



60 



65 



70 







355 










360 










365 








Gin 


Val 
37 0 


Asn 


Leu 


Phe 


Leu 


Lys 
375 


Gly 


Met 


Tyr 


Glu 


Asn 
380 


Gly 


Tyr 


Ala 


Pro 


Ala 


Gin 


Tyr 


Gly 


Glu 


Ser 


Ser 


His 


Thr 


Arg 


His 


Ser 


Tyr 


Gly 


Tyr 


Met 


385 










390 










395 




400 


Gly 


Gly 


Val 


Glu 


Tyr 


Tyr 


Pro 


Thr 


Glu 


Thr 


Asn 


Phe 


Arg 


Leu 


Phe 


Val 










405 










410 








415 




Thr 


Tyr 


He 


Gly 


Arg 


His 


Tyr 


Arg 


Tyr 


Ser 


Ala 


Thr 


Glu 


Thr 


Glu 


Ser 
















425 










430 






Thr 


Asn 


Ala 
435 


Leu 


Arg 


Ala 


Gly 


Leu 
440 


He 


Tyr 


Gin 


He 


Pro 
445 


Phe 


Leu 





10 



(2) INFORMATION FOR SEQ ID MO: 358 

15 (i) SEQLfEIJCE: CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Po rphyromonas gingivalis 

{!>:) FEATURE: 

(A) IJAI-IE/KEY: misc_feature 

(B) LOCATION 1 . . .227 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358 



Tyr 


Lys 


Gin 


Ala 


He 


Met 


Lys 


His 


Leu 


Phe 


Lys 


Ser 


Thr 


Leu 


Val 


Leu 


1 








5 










10 










15 




Leu 


Cys 


Ala 


Leu 


Ser 


Phe 


Ser 


Gly 


Thr 


Tyr 


Thr 


Phe 


Ala 


Gin 


Gl u 


Asn 








20 










25 










30 






Asn 


Thr 


Glu 


Lys 


Ser 


Arg 


Phe 


Asp 


Phe 


Ser 


Val 


Arg 


Leu 


Gly 


Gin 


Gly 






35 










40 










45 




Tyr 


He 


Ala 


Gly 


Ser 


Thr 


Thr 


Asn 


Leu 


Met 


Tyr 


Gly 


Tyr 


Thr 


Ser 


Ala 




50 










55 










60 








Asn 


Asp 


Arg 


Leu 


Leu 


Ser 


Gly 


Ala 


He 


Tyr 


Leu 


Gly 


Leu 


Thr 


Pro 


Ser 


65 










70 








75 








80 


Lys 


Lys 


Glu 


Asn 


Ala 


Thr 


Gly 


Val 


Ala 


Phe 


Arg 


Phe 


Leu 


Ser 


Pro 


Ser 










85 










90 










95 




Pro 


Gly 


Tyr 


Tyr 


Val 


Asp 


He 


Ser 


Gly 


Lys 


Glu 


Asn 


Thr 


Leu 


Asn 


Tyr 








100 










105 










110 






Ala 


Phe 


Tyr 


Val 


Val 


Gly 


Ala 


Tyr 


Asn 


Arg 


He 


Ala 


He 


Pro 


He 


Arg 






115 










120 










125 






Pro 


He 


Lys 


Asn 


Phe 


Asn 


Phe 


He 


Phe 


Ser 


Thr 


Glu 


Val 


Gly Met 


Ala 




130 










135 










140 










Trp 


Met 


Ser 


Arg 


His 


Glu 


Gin 


He 


Tyr 


Asn 


Ser 


Thr 


Ser 


Gin 


Thr 


Trp 


145 










150 










155 










160 


Asp 


Lys 


Gin 


Arg 


Lys 


Ser 


Arg 


Ser 


Gly 


Leu 


Asp 


Phe 


Gly 


Leu 


Gly 


Met 










165 










170 






175 




His 


Leu 


Gin 


Unk 


His 


He 


Asn 


Lys 


Thr 


Val 


Tyr 


Phe 


Met 


Ala 


Gly 


Thr 








180 










185 










190 




Asp 


Leu 


Thr 


Ser 


Cys 


Met 


Phe 


Gly 


Lys 


Arg 


He 


Asn 


Asp 


Tyr 


Gin 


Gin 






195 










200 










205 






Lys 


Asp 


Arg 


Thr 


Phe 


He 


Ala 


Leu 


He 


Asp 


Asn 


Ser 


He 


Gly 


He 


Gly 




210 










215 










220 






Leu 


Asn 


Leu 




























225 
































(2) 


1 11 FORMAT I ON 


FOR 


SEQ 


ID NO:359 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 06 amino acids 

(B) TYPE: amino acid 
{D} TOPOLOGY: linear 



(it) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
75 (vi) ORIGINAL SOURCE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



290/ 490 



( A ) ORGAN I SM : Porphyromonas gingivalis 

(ix) FEATURE: 

{A) (W)E/KEV: misc_feature 
5 (B) LOCATIOII 1 ... 406 

SEQUENCE DESCRIPTION: SEQ ID 110:359 

He Phe He Asp Pro Asp Lys Asn Thr Lys Gin Asn Glu Arg Asn Met 
10 I 5 10 15 

He He Lys Lys Met Leu Lys Asn Lys Leu Ala Pro Leu Ala lie Leu 

20 25 30 

Phe Leu Phe Ala Pro Lys Ala Met Lys Ala Gin Glu Gin Leu Asn Val 
35 40 45 

15 Val His Thr Ser Val Pro Ser Leu Asn He Ser Pro Asp Ala Arg Ala 
60 55 60 

Ala Gly Met Gly Asp He Gly Val Ala Thr Thr Pro Asp Ala Tyr Ser 
65 70 75 80 

Gin Tyr Trp Asn Pro Ser Lys Tyr Ala Phe Met Asp Thr Lys Ala Gly 
20 85 90 95 

He Ser Phe Ser Tyr Thr Pro Trp Leu Ser Lys Leu Val Asn Asp He 

100 105 110 

Ala Leu Met Gin Met Thr Gly Phe Tyr Lye Leu Gly Thr Asp Glu Asn 
115 120 125 

25 Gin Ala He Ser Ala Ser Leu Arg Tyr Phe Thr Leu Gly Lys Leu Glu 
130 135 140 

Thr Phe Asp Glu Leu Gly Glu Ser Met Gly Glu Ala His Pro Asn Glu 
145 150 155 160 

Phe Ala Val Asp Leu Gly Tyr Ser Arg Gin Leu Ser Glu Asn Phe Ser 
30 1G5 170 176 

Met Ala Val Ala Leu Arg Tyr He Arg Ser Asp Gin Ser Thr His Asn 

180 185 190 

Thr Gly Glu Asn Gin Ala Gly Asn Ala Phe Ala Ala Asp He Ala Gly 
195 200 205 

35 Tyr Leu Gin Lys Tyr Val Leu Leu Gly Asn Ala Glu Ser Leu Trp Ser 
210 215 220 

Leu Gly Phe Asn Val Lys Asn He Gly Thr Lys He Ser Tyr Asp Gly 
225 230 235 240 

Gly Val Thr Ser Phe Phe He Pro Thr Ser Leu Asn Leu Gly Thr Gly 
40 245 250 255 

Leu Leu Tyr Pro He Asp Asp Tyr Asn Ser He Asn Phe Asn Leu Glu 

260 265 270 

Leu Ser Lys Leu Leu Val Pro Thr Pro Pro He Met Asp Gin Asn Asp 
275 280 285 

45 Gin Ala Gly Tyr Glu Ala Ala Leu Lys Lys Tyr Gin Glu Thr Ser Ser 
290 295 300 

He Ser Gly He Phe Ser Ser Phe Gly Asp Ala Pro Gly Gly Leu Lys 
305 310 315 320 

Glu Glu Phe Arg Glu He Thr Trp Gly Leu Gly A] a Glu Tyr Ser Tyr 
50 325 330 335 

Asp Asp Lys Phe Phe Val Arg Ala Gly Tyr Ser Tyr Leu His Pro Thr 

340 345 350 

Lys Gly Asn Leu Gin Tyr Phe Thr Ala Gly Ala Gly Phe Lys Met Asn 
355 360 365 

55 He Phe Arg He Asp Ala Ser Tyr Leu Leu Ser Thr He Gin Ser Asn 
370 375 380 

Pro Leu Asp Gin Thr Leu Arg Phe Thr Leu Ala Phe Asp Met Asp Gly 
385 390 395 400 

Leu Arg Asn Leu Phe His 
60 405 

(2) INFORMATION FOR SEQ ID NO: 36 0 

(i) SEQUENCE CHARACTERISTICS: 
65 (A) LENGTH: 452 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



70 



75 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1„t_> 




wo 99/29870 PCT/AU98/01023 

291/490 

(ix) FEATLfRE: 

<A) NAME/KEY: misc_feature 
(B) LOCATION 1 ... 452 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID MO: 3 60 

Leu lie lie Glu Lys Glu Met Lys Thr Thr Val Gin Gin lie He Leu 
1 5 ■ 10 15 

Cys Leu Ala Leu Met Met Ser Gly Val Leu Gly Gly Asn Ala Gin Ser 
10 20 25 30 

Phe Trp Glu Glu He Ala Pro Pro Phe He Ser Asn Glu Pro Asn Val 

35 40 45 

Lys Tyr He He Pro Asn Met Gly He Asp Ser Lys Gly Thr He Tyr 
50 55 60 

15 Val Thr Val Thr Lys Arg He Gin Gin Gly Ala Asn Tyr Thr Ser Glu 
65 70 75 80 

Gin Leu Gly Met Tyr Tyr Acg Pro Leu Gly Asp Asn Glu Gin Trp Trp 

85 90 95 

Lys His Asp Pro Tyr Phe Asp Asp Lys He Val Ala Asp He Gin Thr 
20 100 105 110 

Asp Ala Tyr Gly Arg Val Tyr Val Cys Thr Thr Ser Ser Arg Asp Gin 

115 120 125 

Glu Tyr Gin Leu Tyr He Asn Glu Gin Asn Glu Trp Arg Cys He Phe 
130 135 140 

25 Lys Thr Ser Val Ser Thr Tyr Glu His Gly Met Ala Val Phe Arg Ser 
1*^5 150 155 160 

Ser Thr Gly Val Thr Tyr He Gly Thr Arg His His He Phe Ala Ser 

165 170 175 

Gly Val Asn Asp Phe Glu Phe Asn Thr He Tyr Glu Asp Ser Thr Pro 
30 180 185 190 

Met Ser Cys Arg Phe Ala Glu Ala Thr Asn Ser Gly Thr He Tyr Leu 

195 200 205 

Ala Leu Met His Glu Thr Thr Met Ser Thr Thr He Leu Thr Tyr Gin 
210 215 220 

35 Asn Gly Glu Phe Val Asp He Ser Glu Ser Glu Leu Ser Asn Ser He 
225 230 235 240 

He Ala Ser Het Cys Ser Asn Lys Glu Gly Asp He He Ala Leu Val 

245 250 255 

Thr Ser Tyr Thr Gly Phe Met Ser Gly Thr Leu Ala He Arg Lys Ala 
40 260 265 270 

Asp Glu Gly Lys Trp Gin Leu Val Gly Gly Asp He Gin Asn Ala He 

275 280 285 

Val Gin Asn He Cys Met Met Asp Asp Asn Lys He Ala Cys Glu Val 

290 295 300 

Phe Gly Thr Pro Asn Gly Val Asp Gly Arg Thr Arg Val Cys Val Ser 
305 310 315 320 

Asp Ala Ser Val Phe Asp Phe Glu Trp Tyr Glu Asp Glu He Tyr Gly 

325 330 335 

Gly Leu He Phe Asp Thr Phe Phe Tyr Ser Pro Trp Asp Lys Leu Leu 

340 345 350 

Tyr Ala Lys Phe Gly Gly He Met Leu Arg Ser Lys Glu Ser Phe He 

355 360 365 

Thr Ser Phe He Ser Pro Thr Val Val Gin Gly Val Asp Val Tyr Thr 
370 375 380 

55 Leu Ala Gly Lys He Arg He Glu Ser Glu Thr Pro Val Ser Glu Val 
385 390 395 400 

Leu Leu Phe Asp Leu Ala Gly Arg Met Val Leu Arg Gin Thr He Asp 

4 05 410 415 

Asn Lys He Tyr Ser Asp He Asp Thr Asn Gly Leu Lys Arg Ser Gly 

420 425 430 

He Tyr Val Val Ser Val Arg Leu Ser Ser Gly Gin Val Phe Ser His 

4 35 4 40 4 45 

Lys Val Gin Val 
450 



45 



50 



60 



65 



(2) IMFORriATIOH FOR SEQ ID HO: 361 



(i) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 
70 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

75 (iii) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 




wo 99/29870 PCT/AU98/01023 

292/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGAI^IISH: Porphyromon as gingiva lis 

(ix) FEATURE: 

(A) tlAl^IE/KEY: misc__f ea ture 

(B) LOCATION 1 . . .331 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 361 



Gly 


Leu 


Tyr 


Gly 


Met 


Ser 


Val 


Val 


Pro 


He 


He 


He 


Tyr 


Leu 


Cys 


Gly 


1 








5 










10 










15 


He 


Ser 


Asn 




Ala 


Arg 


Leu 


Met 


He 


He 


Arg 


Cys 


Leu 


He 


Arg 


Arg 








20 










25 










30 






Pro 


Arg 


Th r 


Val 


Leu 


Phe 


Gl y 


Leu 


He 


Phe 


Val 


Val 


Gly 


Leu 


Phe 


Ser 






35 










40 










45 








Ala 


Met 


Al a 


Gin 


Glu 


Lys 


Lys 


Asp 


Ser 


Leu 


Ser 


Thr 


Val 


Gin 


Pro 


Val 




50 










55 










60 










Pro 


Asn 


Ser 


Ser 


Met 


Val 


Glu 


Gin 


Thr 


Pro 


Leu 


Leu 


Ser 


He 


Asp 


His 


65 










70 










75 










80 


Pro 


Val 


Leu 


Pro 


Ala 


Ser 


Phe 


Gin 


Asn 


Thr 


Arg 


Thr 


Leu 


Lys 


Arg 


Phe 










85 










90 










95 




Arg 


Asp 


Lys 


His 


Leu 


Ser 


Asp 


Ala 


Leu 


Leu 


Asn 


Gly 


Leu 


Lys 


Pro 


His 








1 00 










105 










110 






Arg 


Ser 


Ser 


Leu 


Gin 


Leu 


Asn 


Glu 


Glu 


Leu 


Asn 


Phe 


Ala 


Ala 


Glu 


Arg 






115 










120 










125 








Arg 


A^p 


Phe 


Val 


Ser 


Pro 


Leu 


Leu 


Gin 


Thr 


Arg 


His 


Ala 


Ala 


Gly 


Val 




130 










135 










14 0 










Leu 


Ser 


Trp 


Arg 


Pro 


Thr 


Asp 


Arg 


Met 


His 


Phe 


Tyr 


Thr 


Ser 


Gly 


Asn 


14 5 










150 










155 










160 


He 


Gly 


Leu 


Gly 


His 


Asp 


Leu 


Leu 


Thr 


Gly 


Val 


Arg 


Lys 


Asp 


Phe 


Gly 










165 










170 










175 




Trp 


Asn 


Ala 


Gly 


Ala 


Asp 


Phe 


Leu 


Leu 


Ser 


Gin 


Asn 


Leu 


Thr 


Ala 


His 








180 










185 










190 






Val 


Gin 


Gly 


Gly 


Trp 


Gin 


Gin 


Asn 


Phe 


Gly 


Phe 


He 


Pro 


Met 


Thr 


Ala 






195 










200 










205 








Val 


Asn 


Gly 


Gin 


Leu 


Arg 


Trp 


Gin 


Ala 


Thr 


Glu 


Arg 


Leu 


Ser 


Phe 


Thr 




210 










215 










220 










Thr 


Gly 


He 


Asp 


Tyr 


Arg 


Gin 


Val 


Gin 


Trp 


Asn 


Ala 


Phe 


Asp 


Asn 


Arg 


225 










230 










235 










240 


Thr 


Phe 


Ser 


Leu 


Lys 


Gly 


Ser 


Ala 


Arg 


Tyr 


Glu 


Val 


Met 


Asp 


Asn 


Val 










245 










250 










255 




Phe 


Val 


Asn 


Gly 


Phe 


Gly 


Ser 


Tyr 


Pro 


Leu 


Tyr 


Ser 


Ser 


Thr 


Arg 


Ser 








260 










265 










270 






Gly 


Leu 


Asn 


Met 


Ala 


Val 


Pro 


Met 


His 


Gly 


Phe 


Gly 


Pro 


Gin 


Tyr 


Gly 






275 










280 










285 








Gly 


Ser 


Leu 


Glu 


Leu 


Lys 


Val 


Ser 


Glu 


Arg 


Phe 


Gly 


Phe 


Ala 


Val 


Gly 




290 










295 










300 










Met 


Glu 


Arg 


Glu 


Tyr 


Asn 


He 


Trp 


Thr 


Arg 


Ara 


Trp 


Glu 


Thr 


His 


Tyr 


305 










310 










315 










320 


Phe 


Ala 


Tyr 


Pro 


Val 


Phe 


Tyr 


Gly 


Asp 


Lys 


Lys 




















325 










330 















(2) INFORMATION FOR SEQ ID HO: 362 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

65 {vi) ORIGINAL SOURCE: 

(A) ORGAflISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAJ-1E/KEY: mi sc_f ea ture 
70 (B) LOCATION 1.-.32 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 362 

Glu Thr Asn Ser Trp Val Ser Ser Asp Cys Asn Ser Thr Thr Met Lys 
75 1 5 10 15 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCtD: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



293/490 



50 



55 



60 



65 



70 



75 



Thr 


Asn 


Arg 


Arg 


Tyr 


Ala 


Phe 


Val 


Leu 


Pro 


Leu 


Leu 


Leu 


Leu 


Thr 


Gly 








20 










25 










30 




Leu 


Leu 


Ala 


Trp 


Gly 


Gl n 


Asp 


Ser 


Ser 


His 


Gly 


Ser 


Asn 


Thr 


Ala 


Phe 






35 










40 










45 








Ala 


Thr 


Asp 


Ser 


Ser 


Ser 


Arg 


Glu 


Leu 


Pro 


Thr 


Glu 


Gin 


Ser 


Ala 


Tyr 




50 










55 










60 








Arg 


He 


His 


Ser 


Ala 


Tyr 


Met 


Val 


Gly 


Gly 


Gly 


Gly 


Ser 


He 


Thr 


Arg 


65 










70 










75 










80 " 


Asp 


Thr 


Tyr 


Leu 


Ser 


Pro 


Leu 


Arg 


Tyr 


Gly Gly 


Trp 


Thr 


Leu 


Asn 


Leu 










85 










90 










95 




Leu 


Gly 


Glu 


Lys 


Thr 


Phe 


Pro 


Leu 


Lys 


Ala 


Ser 


Asp 


Ser 


Arg 


Trp 


Met 








100 










105 










110 




He 


Arg 


Thr 


Gly 


His 


Glu 


Leu 


Asp 


Phe 


Ala 


Leu 


Met 


Asp 


Asn 


Fro 


Ala 






115 










120 










125 








Asn 


Asn 


Ala 


His 


Phe 


Tyr 


Ser 


Leu 


Leu 


Tyr 


Asn 


Gly 


Ser 


Ala 


Ala 


Ala 




130 










135 










140 










Leu 


Tyr 


Arg 


Leu 


Gly 


Ala 


Lys 


His 


Leu 


Arg 


Ala 


Ala 


Trp 


Met 


Asp 


Asn 


145 










150 










155 






160 


Leu 


Arg 


Leu 


Ala 


Phe 


Gly 


Pro 


Gly 


Leu 


Glu 


He 


Gly 


Leu 


Gly 


Gly 


He 










165 










17 0 










175 




Tyr 


Ser 


Thr 


Arg 


Asn 


Gly 


Asn 


Asn 


Pro 


Ala 


Thr 


Leu 


Lys 


Leu 


Tyr 


Thr 








180 










185 










190 






Asn 


Ala 


He 


Ala 


Gin 


Ala 


Ser 


He 


Gly 


Tyr 


Tyr 


Val 


Pro 


Ser 


Glu 


Thr 






195 










200 










205 








Phe 


Pro 


Leu 


Tyr 


Phe 


Arg 


Leu 


Leu 


Ser 


Gin 


He 


Asn 


Leu 


Phe 


Gly 


He 




210 










215 










220 








Ala 


Tyr 


Gly 


Asn 


Gly 


Phe 


Gly 


Glu 


Ser 


Tyr 


Tyr 


Glu 


Asn 


Phe 


Leu 


Leu 


225 










230 










235 










2^0 


Am 


Asn 


Gl V 
K3± y 


He 


Al a 


ijiy 


o e r 


Leu 


Hi s 


Phe 


Thr 


Tyr 


Pro 


Gly 


Lys 


Phe 










2^15 










250 








255 




Thr 


Arg 


Phe 


Thr 


Thr 


Leu 


He 


Thr 


Ala 


Asp 


He 


Fro 


He 


Arg 


Asn 


Phe 








2 60 










265 










210 






Cys 


Thr 


Leu 


Arg 


Val 


Gly 


Tyr 


Arg 


Tyr 


Ser 


His 


Leu 


Gly 


Ser 


Ser 


Leu 






275 










280 










285 








Asn 


Ala 


Leu 


Asp 


Thr 


Arg 


He 


His 


Ser 


His 


Thr 


Ala 


Phe 


He 


Gly 


Phe 




290 










295 










300 








Val 


Thr 


Glu 


Phe 


Tyr 


Arg 


Phe 


Arg 


Gly 


Arg 


Lys 


Ala 


Met 


Asn 


Thr 


Gly 


305 










310 










315 










320 


Arg 


Arg 


Thr 


Ser 


Leu 


Tyr 


Tyr 


His 


Asp 
























325 
























(2) 


ItlFORI'IATION 


FOR 


SEQ 


ID NO: 363 

















10 



15 



20 



25 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LEllGTH: 319 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . , 319 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363 



Phe 


Thr 


Ser 


Gly 


Thr 


He 


Phe 


Val 


Arg 


He 


Ala 


He 


Ser 


Giy 


Arg 


Val 


1 








5 










10 








15 




Val 


Cys 


7\rg 


Glu 


Val 


Glu 


Asp 


Asn 


Gly 


Arg 


Arg 


Glu 


Ala 


Val 


Arg 


His 








20 










25 










30 




Asp 


Gly 


Met 


Val 


Val 


Thr 


Leu 


Leu 


Val 


He 


Val 


Gly 


He 


Val 


Val 


Val 






35 










40 








45 








Val 


Arg 


Tyr 


Ser 


Leu 


Arg 


Val 


His 


Val 


His 


Lys 


Thr 


Gly 


Thr 


Val 


Val 




50 










55 










60 








Ser 


Ala 


Ala 


He 


Phe 


Gly 


Phe 


He 


Leu 


Leu 


Gly 


Lys 


Thr 


Val 


Pro 


Cys 


65 










70 










75 










8 0 


Asp 


Thr 


Arg 


Asn 


Phe 


Phe 


Ser 


Ser 


Glu 


Ser 


Asp 


Glu 


Pro 


Glu 


Ser 


Arg 










85 










90 










95 


Val 


Ala 


Thr 


Glu 


He 


Ala 


His 


Leu 


Cys 


Glu 


He 


Gly 


Phe 


Gin 


He 


His 








100 










105 








110 







SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



294/ 490 



Ala Ser Ser He Hxs Val Ala Val Arg Thr Asp Phe Gly Gin Ala Gly 

115 120 126 

He His Cys Pro Met Ala Thr Asp Ala Ser Ala Thr Glu Phe Asp Arg 
130 135 140 

5 Ser Ala Glu Cys Ala Glu Arg Thr Ser Ala Gin He Asp Thr Ala He 
145 150 156 160 

Ara Ser Gin Ser Gin He He Arg Thr His lie Asp Thr Cys Pro Lys 

165 170 175 

Ser Ser Gly Thr He Gly Gly Ser Thr His Thr Ser Leu His Leu Lys 
10 180 185 190 

Val Phe Asp Gly Arg Gly Glu Val Gly His He His Pro Lys Asp Gly 

195 200 205 

Leu Arg Phe Gly Val Val Glu Gly Tyr Ser He Gly Ser Tyr Val Asp 
210 215 220 

15 Ala He Gly He Gly Ala Thr His Ala Lys Ala Gly He Ser Asp Thr 
225 230 235 240 

Arg Thr Gly He Ala Gly Gly Tyr His Gly Gly Ser Gin His Gin Gin 

245 250 255 

He Gly Asp Val Thr Thr He He Gly Leu Gly Lys Phe Gly Leu Ala 
20 260 265 270 

Asn Val Gly Val Ser Asp Arg Gly Phe Arg Arg Gly Ala Ser Ser Tyr 

275 280 285 

Asp Leu Tyr Gly Leu Glu Leu His He Ala Lys Thr He Tyr Leu Ala 
290 295 300 

25 Val Gly Asp Gly Cys Leu Cys Arg Gin Ala Glu Arg Gin Asp Gly 
305 310 315 

(2) IMFORMATIOII FOR SEQ ID NO: 364 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 614 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 

i iii ) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 
40 (A) ORG7^4ISI't: Porphyromonas gingival is 

(i:-:) FEATURE: 

(A) NAtlE/KEY: misc_f eatur e 

(B) LOCATIOM 1 ... 614 



45 



t::i) SEQUENCE DESCRIPTION: SEQ ID 110:364 



Pro Tyr Asn Val Gin Ser He Ser Asn Lys Thr He Lys Lys Gin Met 
15 10 15 

50 Glu Asn Leu Lys Asn He Gin Pro Arg Glu Asp Phe Asn Trp Glu Glu 
20 25 30 

Phe Glu Ala Gly Gly Val His Ala Ala Val Ser Arg Gin Glu Gin Glu 

35 40 45 

Ala Ala Tyr Asp Lys Thr Leu Asn Thr He Lys Glu Lys Glu Val Val 
55 50 55 60 

Met Gly Arg Val Thr Ala He Asn Lys Arg Glu Val Val He Asn Val 
65 70 75 80 

Gly Tyr Lys Ser Glu Gly Val Val Pro Ala Thr Glu Phe Arg Tyr Asn 
85 90 95 

60 Pro Glu Leu Lys Val Gly Asp Glu Val Glu Val Tyr He Glu Asn Gin 
100 105 110 

Glu Asp Lys Lys Gly Gin Leu Val Leu Ser His Arg Lys Gly Arg Ala 

115 120 125 

Ala Arg Ser Trp Glu Arg Val Asn Glu Ala Leu Glu Lys Asp Glu He 
65 130 135 140 

Val Lys Gly Tyr Val Lys Cys Arg Thr Lys Gly Gly Met He Val Asp 
145 150 155 160 

Val Phe Gly He Glu Ala Phe Leu Pro Gly Ser Gin He Asp Val Arg 
165 170 175 

70 Pro He Arg Asp Tyr Asp Ala Phe Val Glu Lys Thr Met Glu Phe Lys 
180 185 190 

He Val Lys He Asn Gin Glu Tyr Lys Asn Val Val Val Ser His Lys 

195 200 205 

Val Leu He Glu Ala Glu Leu Glu Gin Gin Lys Lys Glu He He Gly 
75 210 215 220 
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60 



65 



70 



Lys 


Leu 


Glu 


Lys 


Gly 


Gin 


Val 


Leu 


Glu 


Gly 


He 


Val 


Lys 


Asn 


He 


Thr 


225 










230 










235 










240 


Ser 


Tyr 


Gly 


Val 


Phe 


He 


Asp 


Leu 


Gly 


Gly 


Val 


Asp 


Gly 


Leu 


He 


His 










245 










250 










255 




He 


Thr 


A^p 


Leu 


Ser 


Trp 


Gly 


Arg 


Val 


Ala 


His 


Pro 


Glu 


Glu 


He 


Val 








260 










265 










270 






Gin 


Leu 


Asp 


Gin 


Lys 


He 


Asn 


Val 


Val 


He 


Leu 


Asp 


Phe 


Asp 


Glu 


Asp 






275 










280 










285 








Arg 


Lys 


Arg 


He 


Ala 


Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


Met 


Pro 


His 


Pro 


Trp 




290 










295 










300 










Asp 


Ala 


Leu 


Asp 


Ser 


Glu 


Leu 


Lys 


Val 


Gly 


Asp 


Lys 


Val 


Lys 


Gly 


Lys 


305 










310 










315 










320 


Val 


Val 


Val 


Met 


Ala 


Asp 


Tyr 


Gly 


Ala 


Phe 


Val 


Glu 


He 


Ala 


Gin 


Gly 










325 










330 










335 




Val 


Glu 


Gly 


Leu 


He 


His 


Val 


Ser 


Glu 


Met 


Ser 


Trp 


Thr 


Gin 


His 


Leu 








340 










345 










350 






Arg 


Ser 


Ala 


Gin 


Asp 


Phe 


Leu 


His 


Val 


Gly 


Asp 


Glu 


Val 


Glu 


Ala 


Val 






355 










360 










365 








He 


Leu 


Thr 


Leu 


Asp 


Arg 


Glu 


Glu 


Arg 


Lys 


Met 


Ser 


Leu 


Gly 


Leu 


Lys 




370 










375 










380 










Gin 


Leu 


Lys 


Pro 


Asp 


Pro 


Trp 


Ala 


Asp 


He 


Glu 


Thr 


Arg 


Phe 


Pro 


Val 


385 










390 










395 










400 


Gly 


Ser 


Arg 


His 


His 


Ala 


Arg 


Val 


Arg 


Asn 


Phe 


Thr 


Asn 


Phe 


Gly 


Val 










405 










410 










415 




Phe 


Val 


Glu 


He 


Glu 


Glu 


Gly 


Val 


Asp 


Gly 


Leu 


He 


His 


He 


Ser 


Asp 








420 










425 










430 






Leu 


Ser 


Trp 


Thr 


Lys 


Lys 


He 


Lys 


His 


Pro 


Ser 


Glu 


Phe 


Thr 


Glu 


Val 






435 










440 










445 








Gly 


Ala 


Asp 


He 


Glu 


Val 


Gin 


•Val 


He 


Glu 


He 


Asp 


Lys 


Glu 


Asn 


Arg 




450 










455 










460 










Arg 


Leu 


Ser 


Leu 


Gly 


His 


Lys 


Gin 


Leu 


Glu 


Glu 


Asn 


Pro 


Trp 


Asp 


Val 


465 










470 










475 








480 


Phe 


Glu 


Thr 


Val 


Phe 


Thr 


Val 


Gly 


Ser 


He 


His 


Glu 


Gly 


Thr 


Val 


He 










485 










490 










4 95 




Glu 


Val 


Met 


Asp 


Lys 


Gly 


Ala 


Val 


Val 


Ser 


Leu 


Pro 


Tyr 


Gly 


Val 


Glu 








500 










505 










510 






Gly 


Phe 


Ala 


Thr 


Pro 


Lys 


His 


Met 


Val 


Lys 


Glu 


Asp 


Gly 


Ser 


Gin 


Ala 






515 










520 










525 








Val 


Leu 


Glu 


Glu 


Lys 


Leu 


Pro 


Phe 


Lys 


Val 


He 


Glu 


Phe 


Asn 


Lys 


Asp 




530 










535 










540 










Ala 


Lys 


Arg 


He 


He 


Val 


Ser 


His 


Ser 


Arg 


Val 


Phe 


Glu 


Asp 


Glu 


Gin 


545 










550 










555 










560 


Lys 


Ilet 


Ala 


Gin 


Arg 


Glu 


Ala 


Asn 


Ala 


Glu 


Arg 


Lys 


Ala 


Glu 


Ala 


Lys 










565 










570 










575 




Ala 


Ala 


Gin 


Lys 


Glu 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Asn 


Pro 


Ala 


Gin 


Ala 








580 










585 










590 






Val 


Glu 


hys 


Ala 


Thr 


Leu 


Gly 


Asp 


Leu 


Gly 


Glu 


Leu 


Ala 


Ala 


Leu 


Lys 






595 










600 










605 








Glu 


Lys 


Leu 


Ser 


Glu 


Asn 
























610 






























(2} 


IN FORMATION 


FOR 


SEQ 


ID NO: 365 

















10 



15 



20 



25 



30 



35 



40 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) NAI-4E/KEY: misc_feature 

(B) LOCATION H . .243 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365 



Ser Thr He Met Lys Lys Ala He Leu Ser Gly Ala Ala Leu Leu Leu 

15 10 15 

Gly Leu Cys Ala Asn Ala Gin Asn Val Gin Leu His Tyr Asp Phe Gly 
75 20 25 30 
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40 



55 



60 



65 



70 



75 



His 


Ser 


lie 


Tyr 


Asp 


Gl u 


Leu 


Asp 


Gl y 


Arg 


Pro 


Lys 


Leu 


Thr 


Thr 


Thr 






35 










40 










4 5 








Val 


Glu 


Asn 


Phe 


Thr 


Pro 


Asp 


Lys 


Trp 


Gly 


Ser 


Thr 


Phe 


Phe 


Phe 


He 




50 










55 










60 










Asp 


Met 


Asp 


Tyr 


Thr 


Gly 


Lys 


Gl y 


He 


Gin 


Ser 


Al a 


Tyr 


Trp 


Glu 


He 


65 










7 0 










7 5 










80 




Arg 


Glu 


Leu 


Lys 


Phe 


Trp 


Gin 


Ala 


Pro 


Val 


Ser 


He 


His 


Leu 


Glu 










8 5 










90 










95 




Tyr 




Gly Gly 


ijeu 


Ser 


Thr 


Ser 


Phe 


Thr 


Phe 


Gly 


His 


Asp 


Al a 


Leu 








100 










105 










110 






lie 


Gly 


Ala 


Thr 


Tyr 


Thr 


Tyr 


Asn 


Asn 


Pro 


Ser 


Phe 


Thr 


Arg 


Gly 


Phe 






115 










120 










125 




Thr 


lie 


Thr 


Pro 


Met 


Tyr 


Lys 


His 


Leu 


Gly 


Ala 


His 


Asp 


Phe 


His 


Thr 




130 










135 










14 0 










Tyr 


Gin 


He 


Thr 


Gly 


Thr 


Trp 


Tyr 


Ilet 


His 


Phe 


Leu 


Asp 


Gly 


Leu 


Leu 


145 










150 










155 








160 


Thr 


Phe 


Asn 


Gly 


Phe 


Leu 


Asp 


Leu 


Trp 


Gly 


Phe 


Pro 


Gin 


Glu 


Asn 


Pro 










165 










170 










175 




lie 


Gly 


Gly 


Pro 


Val 


Leu 


Lys 


Glu 


Gly 


Asp 


Lys 


Phe 


Val 


Phe 


Leu 


Ser 








180 










185 










190 






Glu 


Pro 


Gin 


Phe 


Trp 


He 


Asn 


Leu 


Asn 


Arg 


He 


Lys 


Gly 


He 


Asp 


Lys 






195 










200 










205 








Asp 


Phe 


Asn 


Leu 


Ser 


He 


Gly 


Thr 


Glu 


Met 


Glu 


He 


Ser 


Arg 


Asn 


Phe 




210 










215 










220 








Ala 


Arg 


Ilet 




Lys 


Phe 


Ser 


Cys 


He 


Pro 


Thr 


Leu 


Ala 


Val 


Lys 


Trp 












230 










235 










24 0 


Thr 


Phe 


Asn 





























10 



15 



20 



25 



30 (2) IMFORMATIOIJ FOR SEQ ID NO: 366 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 35 amino acids 

(B) TYPE; amino acid 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



ivi) ORIGIHAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(iK) FEATURE: 
45 (A) NWIE/KEY: mis cofeature 

(B) LOCATION 1 . . . 235 



50 





(xi 


SEQUENCE DESCRIPTIOH: SEQ ID NO: 366 










Met 


Lys 


Thr 


He 


Ser 


Lys 


Asn 


His 


Ala 


Ala 


Arg 


He 


Cys 


Ala 


Ala 


He 


1 








5 










10 










15 




Ala 


Leu 


Phe 


Ala 


Val 


Cys 


Asn 


Gly 


Arg 


He 


Ala 


Ala 


Gin 


Asp 


Phe 


Leu 








20 










25 










30 






Tyr 


Glu 


He 


Gly 


Gly 


Gly 


Phe 


Gly 


Ala 


Ala 


Gin 


Tyr 


Phe 


Gly 


Asp 


Ala 






35 










40 










45 






Asn 


Arg 


Gly 


Leu 


Phe 


Gly 


Ser 


Ser 


Gly 


Val 


Gly 


Leu 


Glu 


Leu 


Val 


Gly 




50 










55 








60 










Arg 


Tyr 


Asn 


Tyr 


Asn 


Phe 


Arg 


Trp 


Ala 


Phe 


Ser 


Thr 


Met 


Leu 


Asp 


Trp 


65 










70 










75 










80 


Arg 


Thr 


Leu 


Arg 


Gly 


Asp 


Thr 


Asp 


Lys 


Ser 


Gly 


Asn 


Val 


Phe 


Pro 


Asp 










85 










90 










95 




Phe 


Ala 


Gin 


Ala 


Asp 


Phe 


Lys 


Val 


Gly 


Leu 


Thr 


Gin 


Leu 


His 


Val 


Arg 








100 










105 










110 






Ser 


Glu 


Phe 


Asn 


Phe 


Leu 


Pro 


Tyr 


Ser 


Asp 


Gly 


Tyr 


Lys 


Tyr 


Leu 


Gly 






115 










120 










125 




Thr 


Ala 


Arg 


Leu 


Ser 


Pro 


Tyr 


Val 


Ala 


Ala 


Gly 


Leu 


Ser 


Leu 


Gly 


Phe 




130 










135 








140 








Ala 


Ser 


Gly 


Ala 


Lys 


Gly 


Ser 


Ala 


Phe 


Ala 


Pro 


Gly 


He 


Thr 


Ala 


Gly 


145 










150 










155 










160 


Met 


Gly 


Val 


Lys 


Tyr 


Lys 


Leu 


Lys 


Pro 


Arg 


He 


Asn 


Val 


Gly 


He 


Glu 










165 










170 










175 




Tyr 


Ser 


Phe 


Thr 


Gly 


Leu 


Leu 


Thr 


Asp 


Ala 


Leu 


Asp 


Ala 


Leu 


Thr 


Asp 








180 










185 










190 






Lys 


Ser 


Val 


Trp 


Leu 


Glu 


Asp 


Pro 


Tyr 


Lys 


He 


Asn 


Asp 


Ser 


Trp 


Val 






195 










200 










205 
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Lys Aen Lys Asp Ala Thr Gly Ala Leu Val Leu Arg lie Thr Tyr Asp 

210 215 220 

Phe Gly Leu Arg Lys Thr Phe Cys Asn Lys Gin 

225 230 235 

(2) IllFORI-lATIOrJ FOR SEQ ID NO: 367 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 436 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

15 (iii) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

20 (ix) FEATURE: 

(A) NAIIE/KEY: misc__f eatu re 

(B) LOCATION 1. . .436 



25 



30 



40 



60 



70 



(xi) SEQUEMCE DESCRIPTION: SEQ ID MO:367 



A^rn II- Met Tyr Lys Asp Tyr Lys Gly Leu Tyr Ala Ser Leu Arg Trp 
i 5 10 15 

Tyr Ala Leu lie lie Gly Leu Leu Phe Ala Ala Asp Gly lie Gin Ala 

20 25 30 

Gin Asn Asn Asn Phe Thr Glu Ser Pro Tyr Thr Arg Phe Gly Leu Gly 

35 40 45 

Arg Leu Gly Glu Arg Thr Thr lie Ser Gly His Ser Met Gly Gly Leu 

50 55 60 

Gly Val Gly Leu Arg Gin Gly Thr Tyr Val Asn Ala Val Asn Pro Ala 
o5 65 70 75 80 

Ser Tyr Ser Ala Val Asp Ser Met Thr Phe lie Phe Asp Phe Gly Ala 

85 90 95 

Ser Thr Gly He Thr Trp Tyr Ala Glu Asn Gly Lys Lys Asp Asn Arg 

100 105 110 

Lys Met Gly Asn He Glu Tyr Phe Ala Met Leu Phe Pro He Ser Lys 

115 120 125 

Ser He Ala Met Ser Ala Gly Val Leu Pro Tyr Ser Ala Ser Gly Tyr 

130 135 140 

Gin The Gly Ser Val Asp Gin Val Glu Gly Gly Ser Val Gin Tyr Thr 
45 145 150 155 160 

Arg Lys Tyr Leu Gly Thr Gly Asn Leu Asn Asp Leu Tyr Val Gly He 

165 170 175 

Gly Ala Thr Pro Phe Lys Asn Phe Ser He Gly Ala Asn Ala Ser Ser 
180 1B5 190 

OU Leu Phe Gly Arg Phe Thr His Ser Arg Gin Val He Phe Ser Thr Glu 
195 200 205 

Ala Pro Tyr Asn Pro Val His Leu Ser Thr Leu Tyr Leu Lys Ala Ala 

210 215 220 

Lys Phe Asp Phe Gly Met Gin Tyr His Leu Leu Leu Lys Ser Asp Arg 
55 225 230 235 240 

Ser Leu Val He Gly Ala Val Tyr Ser Pro Arg Val Lys Met His Ser 

245 250 255 

Glu Leu Thr Gin He Lys Asn Gin Val Gin Asn Gly Val Val Val Glu 

260 265 270 

Ser Glu Thr Gin Glu Tyr He Lys Gly Met Asp Tyr Tyr Thr Leu Pro 

275 280 285 

His Thr Leu Gly He Gly Phe Ser Tyr Glu Lys Lys Asp Lys Leu Leu 

290 295 300 

Leu Gly Ala Asp Val Gin Tyr Ser Ly.s Trp Lys Gly Glu Lys Phe Tyr 
o5 305 310 315 320 

Lys Ser Asp Cys Lys Phe Gin Asp Arg He Arg Val Ser Leu Gly Gly 

325 350 335 

Glu He He Pro Asp He Asn Ala Val Gly Met Trp Pro Lys Val Arg 

340 345 350 

Tyr Arg Phe Gly Leu His Gly Glu Asn Ser Tyr Leu Lys Val Pro Thr 

355 360 365 

Lys Gly Gly Val Tyr Gin Gly Tyr His He Val Gly Ala Val Phe Gly 

370 375 380 

He Gly He Pro Leu Asn Asp Arg Arg Ser Phe Val Asn Val Ser Leu 
/O 385 390 395 400 
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Glu Tyr Asp Arg Leu He Pro Lys Glu Gly Met He Lys Glu Asn Ala 

405 410 415 

Leu Lys Leu Thr Phe Gly Leu Thr Phe Asn Glu Ser Trp Phe Lys Lys 
420 425 430 

5 Leu Lys Leu Asn 
435 

(2) INFORMATION TOR SEQ ID 110:368 

10 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



20 



25 



30 



40 



45 



50 



60 



( V i ) OR I G I HAL SOU RC E : 

(A) ORGAMISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) HAI-IE/KEY: misc_feature 

(B) LOCATION 1 ... 945 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368 



Asn Thr He Ser Glu Asn Gly Asn Asp Ser Thr Tyr Phe Ser Phe Gin 
15 10 15 

Thr Phe Leu Met Arg Ser He Tyr Gin Leu Leu Leu Ser He Leu Leu 

20 25 30 

Ala Ser Leu Gly Phe Val Gly Leu Glu Ala Gin Gin Ala Gly Val Ala 

35 40 45 

Gly Arg Val Leu Asp Glu Glu Gly Asn Pro Met He Gin Ala Asn Val 
35 50 55 60 

Gin Leu Val Gin Ser Thr Gly Gin Val Ala Val Ala Ala Gly Ala Thr 
^5 70 75 80 

Asn Glu Lys Gly Leu Phe Ser Leu Lys Thr Ser Gin Glu Gly Asp Tyr 

85 90 95 

He Leu Arg Val Ser Tyr Val Gly Tyr Thr Thr His Asp Glu Lys He 

100 105 110 

Ser Leu Arg Asn Gly Gin Thr He Thr Leu Lys Asp He Ser Met Asn 

115 120 125 

Glu Asp Ala Arg Leu Leu Gin Ser Val Thr Val Gin Ala Lys Ala Ala 

130 135 140 

Glu Val Val Val Arg Asn Asp Thr Leu Glu Phe Asn Ala Gly Ser Tyr 
145 150 155 160 

Thr Val Ala Gin Gly Ala Ser He Glu Glu Leu He Lys Lys Leu Pro 

165 170 175 

Gly Ala Glu He Gly Ser Asp Gly Lys He Thr He Asn Gly Lys Asp 

180 185 190 

He Ser Lys He Leu Val Asp Gly Lys Glu Phe Phe Ser Lys Asp Pro 

195 200 205 

Gin Val Ala He Lys Asn Leu Pro Ala Asp Met Val Asn Lys Val Gin 
55 210 215 220 

Val Leu Asn Lys Leu Ser Glu Leu Ser Arg Met Ser Gly Phe Asp Asp 
225 230 235 240 

Gly Glu Glu Glu Thr Val He Asn Leu Thr Val Lys Pro Glu Lys Lys 

245 250 255 

Lys Gly Leu Phe Gly Thr Leu Gin Ala Gly Tyr Gly Thr Asp Gin Arg 

260 265 270 

Tyr Met Ala Gly Gly Asn Val Asn Arg Phe Asp Gly Asn Lys Gin Trp 

275 280 285 

Thr Leu He Gly Ser Ala Asn Asn Thr Asn Asn Met Gly Phe Ser Glu 
b5 290 295 300 

Met Asp Ser Glu Met Gly Ser Met Thr Phe Phe Ser Pro Gin Gly Glv 
305 310 315 320 

Gly Arg Arg Gly Phe Gly Asn Ser Gly Gly Val Thr Ser Ser Ser M-t 
7n 330 335 

/U Leu Gly Gly Asn Phe Ser Val Glu Phe Ser Ser Ala Leu Asn Thr Gly 
340 345 350 

Gly Asp Ala Arg Tyr Gly Tyr Asn Asp Lys Ala He Glu Thr Thr Lys 

355 360 365 

Arg Val Glu Asn He Leu Ala Glu Gly Asn Thr Tyr Met Asp Glu Asn 
370 375 380 



75 
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lie Leu Glu Arg Ser Phe Ser His Asn Gly Gin Ala Arg Phe Arg Met 
385 390 395 ^00 

Gin Trp Lys Pro Ser Glu Arg Thr Glu Val Val Phe Glu Pro Asp Leu 
405 410 415 

5 Ser He Ser Lys He Asp Gly Phe Phe Asn Asp Thr Tyr Glu Thr Lys 
<i20 425 430 

Asp Ala Thr Gly He Ser He Asn Lys Gly Ser He His Gin Thr Thr 

435 440 445 

Gin Gly Asn Asn Phe Arg Leu Asn Gly Glu Leu Asp He Ser His Lys 
10 450 455 460 

Leu Asn Asp Glu Gly Arg Thr He Ser Ala Ser Val Ser Gly Gly Leu 
465 470 475 480 

Thr Asp Glu Asp Gly Asp Gly He Tyr Gin Ala Val Leu Gin Ser Val 
485 490 495 

15 Glu Thr Asn Gin Lys Gin Phe Asn Asp Asn Ser Asn Leu Gin Tyr Arg 
500 505 510 

Leu Arg Leu Ser Tyr Val Glu Pro Leu Gly Lys Asn Tyr Phe Ala Gin 

515 520 525 

Ala He Leu Asn Arg Arg Phe Ser Arg Arg Asn Ser Asp Arg Glu Val 
20 530 535 540 

Tyr Arg Leu Gly Asp Asp Gly Gin Tyr Ser He Leu Asp Ser Gin Tyr 
545 550 555 560 

Gly Leu Ser Tyr Ser Asn Glu Phe Thr Gin Tyr Arg He Gly Leu Asn 
565 570 575 

25 Leu Lys Lys He Ala Lys Thr Trp Asp Tyr Thr Val Gly Phe Asn Val 
580 585 590 

Asp Pro Asn Arg Thr Val Ser Tyr Arg Ser Val Ala Gly Val Glu Gin 

595 600 605 

Asp Lys Leu Ala Phe Asn Arg Val Asn Leu Ser Pro Met Leu Arg He 
30 610 615 620 

Asn Tyr Lys Pro Ser Arg Thr Thr Asn Leu Arg Val Asp Tyr Arg Gly 
625 630 635 640 

Arg Thr Thr Gin Pro Ser He Asn Gin He Ala Pro Val Gin Asp He 
645 650 655 

35 Thr Asn Pro Leu Phe Val Thr Glu Gly Asn Pro Gly Leu Lys Pro Ser 
660 665 670 

Tyr Ser Asn Asn Val Met Ala Met Phe Ser Asp Phe Asp Ala Lys Ser 

675 680 685 

Gin Arg Ala Phe Asn He Val Phe Phe Gly Asn Tyr Thr Phe Asp Asp 
40 690 695 700 

He Val Pro Asn Thr His Tyr Asp Pro Ser Thr Gly He Ara Thr Thr 
"705 710 715 ' 720 

Arg Tyr Glu Asn Ala Ser Gly Thr Trp Gin Ala Asn Leu His Gly Thr 
725 730 735 

45 Leu Ser Leu Pro Leu Lys Asn Arg Ala Phe Ser Phe Arg Met Ser Leu 
740 745 750 

Phe Asn Arg Leu Ala Glu Gly Gin Ser Phe He Asn Asp Asp Lys Asn 

"755 760 765 

Lys Ala Leu Ser Phe Arg Thr Arg Glu Arg Leu Thr Leu Thr Tyr Arg 
50 770 775 700 

Asn Asn Trp He Asp Thr Ser He Gly Gly Asn He Gly Phe Tyr Met 
785 790 795 800 

Ala Asn Asn Ser Leu Ser Gly Gin Lys Asp Ser Arg Thr Tyr Asp Phe 
805 810 815 

55 Gly Gly Asn Tyr Gin Val Ala Leu Thr Leu Pro Tyr Gly Phe Arg He 
820 825 830 

Asp Ser Asp Val Glu Tyr Asn Thr Asn Ser Gly Tyr Ser Gly Gly Phe 

835 840 845 

Ser Leu Asp Glu Trp Leu Trp Asn Ala Ser Leu Ser Tyr Ser Phe Leu 
OO 850 855 860 

Arg Asp Lys Ala Gly Thr Leu Arg Val Asn Gly Tyr Asp He Leu Gly 
865 870 875 880 

Gin Arg Ser Ser He Ser Arg Ser Ala Ser Ala He Asn He Glu Glu 

885 890 895 

Ser Met Ser Asn Thr He Gly Arg Tyr Val Met Val Asp Phe He Tyr 

900 905 910 

Arg Phe Asn Ala Phe Ser Gly Gly Gly Ser Arg Ser Asp His Gin Arg 

915 920 925 

Gly Asn Met Asn Arg Pro Gly Pro Pro Phe Gly Gly Gly Arg Arg Pro 
/O 930 935 940 

Ser 
945 

(2) I M FORMATION FOR SEQ ID 110:369 



65 



75 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

10 (vi) ORIGINAL SOURCE: 

(A) ORGAHISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAIIE/K'EY: misc_feature 
15 (B) LOCATIOH 1...412 

(xi) SEQUENCE DESCRIPTION; SEQ ID HO: 369 



20 



Gly 


Glu 


Tyr 


Pro 


Ala 


Asn 


Ser 


Asn 


Asp 


Lys 


Lys 


Glu 


Met 


Val 


Met 


Lys 


1 








5 










10 










15 




Leu 


He 


Lys 


Arg 


Ser 


Leu 


Leu 


Leu 


Leu 


Gly 


Ala 


Val 


Leu 


Leu 


He 


Thr 








20 










25 










30 






Leu 


Pro 


Ala 


Tyr 


Ser 


Gin 


Asn 


Asp 


Asp 


He 


Phe 


Glu 


Asp 


Asp 


He 


Tyr 






35 










40 










45 








Thr 


Ser 


Arg 


Lys 


Glu 


He 


Arg 


Lys 


Gin 


Asn 


Gin 


Val 


Lys 


Asp 


Trp 


Gin 




50 










55 










60 










Asn 


Gin 


Glu 


Asp 


Gly 


Tyr 


Gly 


Asp 


Asp 


Thr 


Glu 


Tyr 


Thr 


Val 


Ala 


Ser 


65 










70 










75 










80 


Asp 


Arg 


Asp 


He 


Asp 


Ala 


Tyr 


Asn 


Arg 


Arg 


Asp 


Gly 


Gin 


Ser 


Tyr 


Asp 










85 










90 










95 




Gly 


Lys 


Lys 


Leu 


Ser 


Lys 


Asp 


Lys 


Lys 


Arg 


Asp 


Ser 


Thr 


Arg 


Ser 


Ser 








100 










105 










110 






Val 


Pro 


Gly 


Arg 


Tyr 


Ser 


Arg 


Arg 


Leu 


Ala 


Arg 


Phe 


Tyr 


Lys 


Pro 


Asn 






115 










120 










125 








Thr 


He 


Val 


He 


Ser 


Gly 


Ala 


Asp 


Asn 


Val 


Tyr 


Val 


Thr 


Asp 


Asp 


Gly 




130 










135 










140 










Glu 


Tyr 


Phe 


Val 


Tyr 


Gly 


Asp 


Glu 


Tyr 


Tyr 


Asp 


Asp 


Ala 


Ser 


Ser 


Val 


145 










150 










155 










160 


Asn 


He 


Tyr 


He 


Asn 


Ser 


Pro 


Trp 


Cys 


Asp 


Pro 


Phe 


Pro 


Tyr 


Thr 


Ser 










165 










170 










175 




Trp 


Tyr 


Pro 


Ser 


Phe 


Ser 


Gly 


Trp 


Tyr 


Asn 


Tyr 


Thr 


Trp 


Asn 


Tyr 


Pro 








180 










185 










1 90 






Trp 


Phe 


Tyr 


Tyr 


Gly 


Ser 


His 


He 


Gly 


Trp 


Gly 


Gly 


Tyr 


Tyr 


Pro 


Gly 






195 










200 










205 








Tyr 


Asn 


Trp 


Tyr 


Trp 


Ser 


Tyr 


Tyr 


Tyr 


Asp 


Pro 


Phe 


Tyr 


Asn 


Pro 


Tyr 




210 










215 










220 










Gly 


He 


Gly 


Met 


Gly 


Trp 


Gly 


Tyr 


Pro 


Tyr 


Gly 


Trp 


Gly 


Ser 


Tyr 


Tyr 


225 










230 










235 










240 


Gly 


Trp 


Gly 


Gly 


Tyr 


Pro 


Gly 


Val 


He 


His 


His 


Tyr 


His 


His 


Tyr 


Pro 










245 










250 










255 




hys 


Lys 


Thr 


Tyr 


Ser 


Asn 


Gly 


Gin 


His 


Ser 


Gly 


Ala 


Tyr 


Tyr 


Ser 


Tyr 








260 










265 










270 






Gly Arg 


Pro 


Asn 


Arg 


He 


Lys 


Gly 


Gly 


Thr 


Ser 


Gly 


Ala 


Lys 


Leu 


Gly 






275 










280 










285 








Thr 


Gly 


Arg 


Tyr 


Asp 


Arg 


He 


Gin 


Asn 


Ser 


Ser 


Ser 


Gin 


Lys 


Asn 


Lys 




290 










295 










300 










Phe 


Gly 


Leu 


Gin 


Ser 


Asn 


Lys 


Pro 


Asn 


Asn 


Asn 


Leu 


Gin 


Asn 


Val 


Lys 


305 










310 










315 










320 


Ser 


Gly 


Arg 


Thr 


Gly 


Arg 


Ala 


Asn 


Arg 


Asp 


Arg 


Asn 


He 


Glu 


Thr 


Val 










325 










330 










335 




Thr 


Pro 


Asn 


Asn 


Gly 


Gin 


Lys 


Gin 


Asn 


Arg 


Pro 


Val 


Phe 


Gin 


Gin 


Asn 








340 










345 










350 






Gin 


Ser 


Gly 


Asn 


Asp 


Arg 


Pro 


Thr 


Gly 


Arg 


Asn 


He 


Arg 


Ser 


Glu 


Arg 






355 










360 










365 








Gin 


Gly 


Glu 


Asn 


Asn 


Asp 


Arg 


Thr 


Phe 


Ser 


Thr 


Pro 


Ser 


Arg 


Ser 


Asn 




37 0 










375 










380 










Ser 


Asn 


Gly 


Gly 


Phe 


Ser 


Thr 


Pro 


Ser 


Arg 


Ser 


Ser 


Ser 


Gly 


Ser 


Met 


385 










390 








395 










400 


Ser 


Gly 


Gly 


Gly 


Gly 


Arg 


Ser 


Gly 


Arg 


Gly 


Arg 


Asn 


















405 










410 














(2) 


I M FORMAT I Oi l 


FOR 


SEQ 


ID MO: 370 

















25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



(i) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 601 amino acids 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 




wo 99/29870 PCT/AU98/01023 

301/490 



10 



60 



65 



70 



75 



( B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIJAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAI1E/KEY: misc_feature 

(B) LOCATIOIi 1 . . . 601 



15 (xi) SEQUEMCE DESCRI PTIOIJ : SEQ ID MO:370 

Ser A.sn Ser Ser Ser His Lys Trp Leu lie Tyr Tyr His lie Glu Lys 
1 5 10 15 

Thr Lys Ser He Met He Arcj Lys Leu He Leu Leu Leu Ala Leu Met 
20 20 25 30 

Pro Val Ala Ser Val Ala Phe Ala Val Pro Thr Asp Ser Thr Glu Ser 

35 40 45 

Lys Asp Asn Arg He Leu Thr Ser Met Gin Ser Ser Ser Leu Asn Arg 
50 55 60 

25 Asp Asp Ala Pro Asp Lys Trp Gin Pro Met His Ala Asn Phe Ser He 
65 70 75 80 

Gin Ser Asp Met Leu Leu Ser Thr Ala Gin Lys Ser Lys Asn Thr Trp 

85 90 95 

Phe Gly Asn Ser Tyr He Met Gly He He Lys Asn Asn Tyr Leu Glu 
30 100 105 110 

Phe Gly Ala Arg Phe Glu Asp Leu Tyc Lys Pro Leu Pro Gly His Glu 

H5 120 125 

Pro Glu Met Gly Arg Gly Val Pro His Met Tyr Val Lys Gly Ser Tyr 
130 135 140 

35 His Trp Ala Glu Leu Thr Met Gly Asp Phe Tyr Asp Gin Phe Gly Ser 
I'iS 150 155 160 

Gly Met Val Phe Arg Thr Tyr Glu Glu Arg Asn Leu Gly He Asp Asn 

165 170 175 

Ala Val Arg Gly Gly Arg He Val Leu Thr Pro Phe Asp Gly Val Arg 
40 180 185 190 

Val Lys Gly He Ala Gly Gin Gin Arg Asn Tyr Phe Asp Arg Thr Gly 

195 200 205 

Lys Val Phe Asn Ser Gly Arg Gly Tyr Leu Leu Gly Ser Asp Leu Glu 
210 215 220 

45 Leu Asn Val Glu Arg Trp Ser Ser Ala Met Arg Asp Asn Asp Tyr His 
225 230 235 240 

Leu Ala He Gly Gly Ser Phe Val Ser Lys His Glu Ala Asp Glu Asp 

^•55 250 255 

He Phe Val Gly Val Gly Glu Asp Arg Lys Ara Leu Asn Leu Pro Leu 
50 260 265 " 270 

Asn Val Pre He Met Gly Leu Arg Thr Asn Phe Gin Lys Gly Gly Leu 

275 280 285 

Ala Leu Tyr Ala Glu Tyr Gly Tyr Lys Tyr Asn Asp Pro Ser Ala Asp 
290 295 300 

55 Asn Asp Tyr He Tyr His Asp Gly Gin Ala Ala Leu Leu Ser Ala Ser 
305 310 315 320 

Tyr Ser Lys Lys Gly Met Ser He Leu Leu Gin Ala Lys Arg Cys Glu 

325 330 335 

Asn Phe Ala Phe Arg Ser Lys Arg Ser Ala Gin Leu Thr Pro Leu Met 

3-10 345 350 

He Asn Tyr Met Pro Ala Phe Thr Gin Ala His Thr Tyr Thr Leu Ala 

355 360 365 

Ala He Tyr Pro Tyr Ala Thr Gin Pro Gin Gly Glu Trp Ala Phe Gin 

370 375 380 

Gly Glu Leu Arg Tyr Asn Phe Ala Arg Arg Thr Ala Leu Gly Gly Arg 
385 390 395 400 

Tyr Gly Thr Gly Leu Arg He Asn Val Ser His Val Arg Gly Leu Asp 

405 410 415 

Lys Lys Met Leu Lys Glu Asn Pro Asp Glu Leu He Gly Thr Aso Gly 

420 425 430 

Tyr Thr Val Ser Phe Phe Gly Met Gly Asp Leu Tyr Tyr Ser Asp He 

435 440 445 

Asp Val Glu He Thr Lys Lys Val Ser Pro Gly Phe Asn Phe Thr Leu 

450 455 460 

Thr Tyr Leu Asn Gin He Tyr Asn Asn Lys Val Leu His Gly Ala Ala 
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4 65 










470 










475 










480 




Gl y 


Glu 


Lys 


Pro 


Glu 
485 


Lys 


lie 


Tyr 


Ala 


Asn 
490 


He 


Phe 


Val 


Tyr 


Asp 
4 95 


Gly 




hys 


Tyr 


Lys 


Leu 


Ser 


Asn 


Lys 


Val 


Ala 


Leu 


Arg 


Thr 


Glu 


Leu 


Gin 


Tyr 


5 








500 










505 










510 








Leu 


His 


Thr 


Lys 


Gin 


Asp 


Gin 


Gly 


Asp 


Trp 


He 


Tyr 


Gly Met 


Ala 


Glu 








515 










520 










525 










Leu 


Ser 


lie 


Leu 


Pro 


Ser 


Leu 


Met 


Leu 


Ser 


Leu 


Ser 


Glu 


Gin 


Tyr 


Asn 


10 




530 










535 










540 








lie 


Gly 


Glu 


Thr 


Lys 


Lys 


His 


Tyr 


Val 


Met 


Gly 


Ser 


Val 


Thr 


Tyr 


Thr 




545 










550 










555 








560 




His 


Gly 


Ala 


His 


Arg 
565 


Val 


Ala 


Phe 


Ser 


Ala 
570 


Gly 


Lys 


Thr 


Arg 


Ala 

575 


Gly 


15 


Met 


Asn 


Cys 


Ser 


Gly 


Gly 


Val 


Cys 


Arg 


Val 


Val 


Pro 


Glu 


Thr 


Gin 


Gly 








580 










585 










590 






Phe 


Tyr 


Leu 

595 


Ser 


Tyr 


Ser 


Thr 


Asn 
600 


Leu 

















20 



25 



(2) INFORMATION FOR SEQ ID HO: 371 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



30 (vi) ORIGINAL SOURCE: 

(A) ORGAl^JXSM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAT'lE/KEY: misc_feature 
35 (B) LOCATION 1...252 

(Xi) SEQUENCE DESCRIPTION: SEQ ID IIO:371 

Ara Gly Ser Ser Ser Gly He Ser Ala Arg Gly Arg Asp Met Arg Ser 
40 1 5 10 15 

Leu Phe Leu Ser Ala Leu Arg Ser Ser Ser Leu His Gly Ser Glu Arg 

20 25 30 

Arg Ser Arg He Ser Ser Ser Val Val Met Ser He Arg Gin Lys He 
35 40 45 

45 Arg Leu Phe His Leu Ser Val Cys Ala Gin Thr His Asp His Leu He 
50 55 60 

Glu He His Leu Val Cys He Glu Phe Gly Ala He Asp Thr Asp Glu 
65 70 75 80 

Phe Arg Leu Ser Ser His Ala Tyr Thr Thr Ser Pro Thr His Thr Gly 
50 85 90 95 

Ala He His His Asn Cys He Glu Arg Ser Tyr Gly Arg Tyr Leu Val 

100 105 HQ 

Thr Phe Gly Gin Glu Arg Asn Glu Leu His His His Ser Arg Pro Asp 
115 120 125 

55 Arg Asn Ala Glu Val Tyr Arg Phe Pro Phe Asp Asn Ala Phe His Ser 
130 135 140 

He Arg Tyr Glu Ala Phe Arg Pro He Arg Pro He Val Cys His Asp 
145 150 155 160 

Asp His Phe He Ala He Gly Ser His Leu Phe Phe Lys Asp Asn Gin 
60 165 170 175 

He Phe Ser Ser Gly Ser Gin Tyr Asp Asn Tyr Thr Val Ala Cys Phe 

180 185 190 

Val Glu Ser Leu His Asp Arg Glu Gin Arg Ser His Thr His Thr Ala 
195 200 205 

65 Ser Gly Thr Asn His Cys Ala Asp Leu Phe Asp Met Arg Thr Leu Ser 
210 215 220 

Gin Arg Thr Tyr His He Arg Asp He Val Ala Asp Phe Glu Phe Gly 
225 230 235 240 

Gin Phe Leu Gly Arg Phe Ala His Gly Leu Asn His 
70 245 250 

(2) INFORMATION FOR SEQ ID NO: 372 

(i) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 790 amino acids 



SUBSTITUTE SHEET (Ride 26) (RO/AU) 
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• 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) nOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) MAIIE/KEY: misc^f eature 

(B) LOCATIOH 1 . . .790 

(xi) SEQUENCE DESCRI PTIOI I : SEQ ID MO: 372 



Leu 


Leu 


Phe 


Ser 


Ser 


Pro 


Leu 


Pro 


Arg 


Met 


Asp 


Arg 


Pro 


Lys 


Pro 


Ser 


1 








5 










10 










15 




Tyr 


He 


Val 


Arg 


He 


Ala 


Ala 


He 


Leu 


Cys 


Leu 


Phe 


Val 


Gly 


Arg 


Pro 








20 










25 










30 






Leu 


Phe 


Ala 


Gin 


Ser 


Tyr 


Val 


Asp 


Tyr 


Val 


Asp 


Pro 


Leu 


He 


Gly 


Thr 






35 










40 










45 








Leu 


Ser 


Ser 


Phe 


Glu 


Leu 


Ser 


Ala 


Gly 


Asn 


Thr 


Tyr 


Pro 


Val 


He 


Gly 




50 










55 










60 








Leu 


Pro 


Trp 


Gly 


Met 


Asn 


Ser 


Trp 


Thr 


Pro 


Met 


Thr 


Gly 


Val 


Pro 


Gly 


65 










70 










75 










8 0 


Asp 


Gly 


Trp 


Gin 


Tyr 


Thr 


Tyr 


Ser 


Ala 


His 


Lys 


He 


Arg 


Gly 


Phe 


Lys 










85 










90 










95 




Gin 


Thr 


His 


Gin 


Pro 


Ser 


Pro 


Trp 


He 


Asn 


Asp 


Tyr 


Gly 


Gin 


Phe 


Ser 








100 










105 










110 






Leu 


Leu 


Pro 


Leu 


Thr 


Ala 


Pro 


Gin 


Lys 


Pro 


Ser 


Ser 


Asn 


Asp 


Ser 


He 






115 










120 










125 








Ala 


Leu 


Thr 


Lys 


Trp 


Cys 


Lys 


Gin 


Leu 


Phe 


Ser 


Asp 


Glu 


Gin 


Thr 


Ser 




130 










135 










140 










Trp 


Phe 


Ser 


His 


Lys 


Ala 


Glu 


Thr 


Ala 


Thr 


Pro 


Tyr 


Tyr 


Tyr 


Ser 


Val 


145 










150 










155 










160 


Tyr 


Leu 


Ala 


Asp 


Tyr 


Asp 


Thr 


Arg 


Val 


Glu 


Met 


Ala 


Pro 


Thr 


Glu 


Arg 










165 










170 










175 




Ala 


Ala 


He 


Phe 


Arg 


He 


Arg 


Tyr 


Ser 


Gly 


Asn 


Thr 


Glu 


Ser 


Gly 


Ser 








180 










185 










190 






Gly 


Arg 


Trp 


Leu 


Arg 


Leu 


Asp 


Ala 


Phe 


Thr 


Gly 


Gly 


Ser 


Glu 


He 


Ser 






195 










200 










205 








He 


Val 


Asp 


Pro 


His 


Thr 


Val 


Val 


Gly 


He 


Ser 


Arg 


Lys 


Asn 


Ser 


Gly 




210 










215 










220 










Gly 


Val 


Pro 


Ala 


Asn 


Phe 


Ala 


Cys 


Tyr 


Phe 


He 


Leu 


Gin 


Ser 


Asp 


Thr 


225 










230 










235 










240 


Pro 


Met 


Ala 


Asp 


Val 


Leu 


Leu 


Glu 


Thr 


Asp 


Thr 


Gly 


Lys 


Ser 


Asp 


Glu 










245 










250 










255 




Gly 


Thr 


Arg 


Ala 


Trp 


Ala 


Ala 


Cys 


Arg 


Phe 


Asp 


Ser 


Gin 


Glu 


Val 


Thr 








260 










265 










270 






Val 


Arg 


Val 


Ala 


Ser 


Ser 


Phe 


He 


Ser 


Val 


Glu 


Gin 


Ala 


Glu 


Arg 


Asn 






275 










280 










285 








Leu 


Ala 


Glu 


Val 


Lys 


Gly 


Gin 


Ser 


Phe 


Asp 


Arg 


He 


Arg 


Leu 


Ala 


Gly 




290 










295 










300 










Arg 


Glu 


Ala 


Trp 


Asn 


Lys 


Val 


Leu 


Gly 


Arg 


He 


His 


Val 


Glu 


Gly 


Gly 


305 










310 










315 










320 


Thr 


hys 


Asp 


Glu 


Arg 


Thr 


Thr 


Phe 


Tyr 


Ser 


Ala 


Leu 


Tyr 


Arg 


Cys 


Leu 










325 










330 










335 




Leu 


Phe 


Pro 


Arg 


Arg 


Phe 


Tyr 


Glu 


Glu 


Asp 


Ala 


Ser 


Gly 


Asn 


Phe 


Val 








340 










345 










350 






His 


Tyr 


Ser 


Pro 


Tyr 


Asn 


Gly 


Glu 


Val 


Leu 


Pro 


Gly 


Tyr 


Leu 


Tyr 


Thr 






355 










360 










365 








Asp 


Thr 


Gly 


Phe 


Trp 


Asp 


Thr 


Phe 


Arg 


Ala 


Leu 


Phe 


Pro 


Leu 


Leu 


Asn 




370 










375 










380 










Leu 


Leu 


Tyr 


Pro 


Asp 


Glu 


Asn 


He 


Lys 


He 


Gin 


Glu 


Gly 


Leu 


Leu 


Asn 


385 










390 










395 








400 


Val 


Tyr 


Arg 


Glu 


Ser 


Gly 


Phe 


Phe 


Pro 


Glu 


Trp 


Ala 


Ser 


Pro 


Gly 


His 










405 










410 










415 




Arg 


Asp 


Cys 


Met 


He 


Gly 


Asn 


Asn 


Ser 


Ala 


Ser 


Val 


Leu 


Ala 


Asp 


Ala 








420 










425 










430 






Tyr 


Leu 


Lys 


Gly 


Val 


Arg 


Val 


Glu 


Asp 


Thr 


Arg 


Thr 


Leu 


Met 


Asn 


Gly 






435 










440 










445 






Leu 


Leu 


His 


Ala 


Thr 


Lys 


Ala 


Val 


His 


Pro 


Lys 


He 


Ser 


Ser 


Thr 


Gly 




450 










455 










460 








Arg 


Lys 


Gly 


Trp 


Glu 


Trp 


Tyr 


Asn 


Ser 


Leu 


Gly 


Tyr 


Val 


Pro 


Ala 


Asp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 





PCT/AU98/01023 



WO 99/29870 



304/ 490 



4 65 



470 



415 



480 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



Ala 


Gly 


He 


Asp 


Glu 


Ser 


Ala 


Ala 


Arg 


Thr 


Leu 


Glu 


Tyr 


Ala 


Tyr 


Asn 










4 85 










4 90 










4 95 




Asp 


Trp 


Cys 


He 


Leu 


Arg 


Leu 


Gly 


Arg 


Thr 


Leu 


Gl y 


Trp 


Asp 


Arg 


Ala 








500 










505 










510 






Ala 


Leu 


Asp 


Thr 


Leu 


Ala 


His 


Arg 


Ser 


Met 


Asn 


Tyr 


Arg 


His 


Leu 


Phe 






515 










520 










52 5 








Asp 


Pro 


Glu 


Thr 


Lys 


Leu 


Met 


Arg 


Gly 


Arg 


Asn 


Gin 


Asp 


Gly 


Ser 


Phe 




530 










535 










54 0 










Arg 


Thr 


Pro 


phe 


Ser 


Pro 


Phe 


Lys 


Trp 


Gly 


Asp 


Val 


Phe 


Thr 


Glu 


Gly 


545 










550 










555 










560 


Asn 


Ala 


Trp 


His 


Tyr 


Thr 


Trp 


Ser 


Val 


Phe 


His 


Asp 


Val 


Gin 


Gly 


Leu 










565 










570 










575 




lie 


Asp 


Leu 


Met 


Gly 


Gly Asp 


Arg 


Pro 


Phe 


Val 


Ser 


Met 


Leu 


Asp 


Ser 








580 










585 










590 






Val 


Phe 


Asn 


Thr 


Pro 


Pro 


Met 


Phe 


Asp 


Glu 


Ser 


Tyr 


Tyr 


Gly 


Phe 


Val 






595 










600 










605 








He 


His 


Glu 


lie 


Arg 


Glu 


Met 


Gin 


He 


Ala 


Asp 


I let 


Gly 


Asn 


Tyr 


Ala 




610 










615 










620 










His 


Gly 


Asn 


Gin 


Pro 


He 


Gin 


His 


Met 


He 


Tyr 


Leu 


Tyr 


Asn 


His 


Ala 


626 










630 










635 










64 0 


Gly 


His 


Pro 


Trp 


Lys 


Ala 


Gin 


Glu 


Arg 


Leu 


Arg 


Glu 


Val 


Met 


Gly 


Arg 










64 5 










650 










655 




Leu 


Tyr 


Arg 


Pro 


Thr 


Pro 


Asp 


Gly 


Tyr 


Cys 


Gly 


Asp 


Glu 


Asp 


Asn 


Gly 








660 










665 










670 






Gin 


Thr 


Ser 


Ala 


Trp 


Tyr 


Val 


Phe 


Ser 


Ala 


Leu 


Gly 


Phe 


Tyr 


Pro 


Val 






675 










680 










685 








Thr 


Pro 


Ala 


Thr 


Asp 


Gin 


Tyr 


Val 


Leu 


Gly 


Ser 


Pro 


He 


Phe 


Ser 


Lys 




690 










695 










7 00 










Val 


He 


Leu 


Ser 


Phe 


Pro 


Asp 


Gly 


His 


Lys 


Thr 


Val 


Leu 


His 


Ala 


Pro 


705 










710 










715 










720 


Ala 


Asn 


Ser 


Ala 


Asp 


Thr 


Pro 


Tyr 


He 


Arg 


Ser 


He 


Ser 


Val 


Glu 


Gly 










725 










730 










735 




Lys 


Glu 


Trp 


Ser 


Cys 


Asn 


Tyr 


Leu 


Thr 


His 


Glu 


Gin 


Leu 


Arg 


Ser 


Ser 








740 










745 










750 






Ala 


Ser 


He 


Gin 


Trp 


Met 


Met 


Asp 


Thr 


Lys 


Pro 


Asn 


Tyr 


Asn 


Arg 


Gly 






755 










760 










765 








Met 


Lye 


Glu 


Ser 


Asp 


Arg 


Pro 


Tyr 


Ser 


Phe 


Ser 


Thr 


Glu 


Gin 


Gin 


Arg 




770 










775 










780 










Arg 


Ala 


Asn 


His 


Ser 


Asn 






















785 










790 






















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 37 3 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAI'IISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1 . . . 286 



(::i) SEQUENCE DESCRIPTION: SEQ ID NO: 37 3 



He 


Cys 


Gly 


Ser 


Lys 


Met 


Asn 


Leu 


Ser 


Gly 


Leu 


Gin 


Ser 


Phe 


Thr 


Met 


1 








5 










10 










15 




Met 


Lys 


Ser 


Met 


Arg 


Ser 


Val 


Leu 


Leu 


Leu 


Leu 


f he 


Pro 


Leu 


Ser 


Leu 








20 










25 










30 






He 


Thr 


Ala 


Leu 


Gly 


cys 


Ser 


Asn 


Asn 


Lys 


Ala 


Ala 


Glu 


Ser 


Lys 


Ser 






35 










40 










45 








Val 


Ser 


Phe 


Asp 


Ser 


Ala 


Tyr 


Leu 


Glu 


Arg 


Tyr 


He 


Pro 


Leu 


Arg 


Ala 




50 










55 










60 










Asp 


He 


Asp 


Thr 


Pro 


Ser 


Leu 


His 


Val 


Met 


He 


Ser 


Tyr 


Val 


Tyr 


Pro 


65 










70 










75 










80 


Ser 


Gly 


Asp 


Asp 


Met 


Leu 


Thr 


Glu 


He 


Phe 


Asn 


Gly 


Leu 


Leu 


Phe 


Gly 










85 










90 










95 




Asp 


Ser 


Leu 


Met 


Asp 


Ser 


Ser 


Ser 


Pro 


Glu 


Asn 


Ala 


Met 


Glu 


Gly 


Tyr 



SUBSTITUTE 



SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



305/490 











100 










105 










110 








Al a 




Me t 
115 


Leu 


Gly 


Glu 


sp 


Tyr 
12 0 


Arg 


Car- 

OS r 


sn 


Ac 

sn 


Ala 
125 


Glu 


Ala 


sn 




Leu 


Gin 


Gly 


L*^u 


ro 


Ser 


Asp 


Leu 


Leu 


MSp 


1 yr 


He 


Tyr 


ys 


Gin 


Glu 


5 




130 










135 










140 












Asn 


Thr 


He 


Ala 


Tyr 


Cys 


Asp 


Thr 


Gly 


Leu 


He 


Ser 


Thr 


Arg 


He 


Asn 




14S 










150 










155 










160 




Thr 


Tyr 


Thr 


Tyr 


Glu 
165 


Gly 


Gly 


Ala 


His 


Thr 
170 


Glu 


Asn 


Thr 


Val 


Arg 
175 


Phe 


10 


Ala 


Asn 


He 


Leu 
180 


Arg 


Thr 


Thr 


Gly 


Lys 
185 


Val 


Leu 


Glu 


Glu 


Arg 
1 90 


Asp 


He 




Phe 


Lys 


He 
195 


A^p 


Tyr 


Ala 


Glu 


Arg 

200 


Leu 


Ser 


Ala 


Leu 


He 
205 


He 


Gly 


Gin 




Leu 


Val 


His 


Asp 


Phe 


Gly 


Lys 


Thr 


Thr 


Pro 


Ala 


Glu 


Leu 


Asp 


Ala 


He 


15 




210 










215 










220 












Gly 


Phe 


Phe 


Asn 


Ala 


Glu 


Glu 


He 


Gin 


Pro 


Asn 


Gly Asn 


Plie 


Met 


He 




225 










230 










235 










240 




Asp 


Asp 


Lys 


Gly 


Leu 
245 


Thr 


Tyr 


cys 


Phe 


Asn 
250 


Glu 


Tyr 


Gin 


He 


Ala 
255 


Ala 


20 


Tyr 


Ala 


Arg 


Gly 
260 


Ala 


Val 


Tyr 


Val 


Arg 
265 


Leu 


Gly 


Tyr 


Asp 


Val 
270 


Leu 


Ala 




Pro 


Leu 


Leu 

275 


Arg 


Asp 


Asp 


Ser 


Pro 
280 


Leu 


Lys 


Arg 


Tyr 


Leu 
285 


Pro 







25 (2) lUFORHATIOn FOR SEQ ID HO: 37 4 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 8 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 



35 



45 



50 



55 



60 



65 



70 



75 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAt-IISM: Porphyromonas gingival is 



(ix) FEATURE: 
40 (A) HAME/KEY: misc_feature 

(B) LOCATIOll 1 ... 378 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 4 



Gly 


He 


He 


Ser 


Val 


Leu 


Ser 


His 


Val 


Val 


Asp 


Arg 


Pro 


Gin 


Trp 


Gly 


1 








5 










10 










15 




Ala 


Ser 


Pro 


Glu 


Ala 


Ala 


Gly 


Thr 


His 


Ser 


Val 


Tyr 


Ser 


He 


Leu 


His 








20 










25 










30 






Pro 


Ser 


Ala 


Gly 


He 


He 


Arg 


He 


Arg 


Ser 


Met 


Gly 


He 


He 


Ser 


Ala 






35 










40 










45 








Cys 


Arg 


He 


Ala 


He 


Leu 


Ala 


Gly 


Lys 


His 


Pro 


Arg 


Ser 


Gly 


Leu 


Ser 




50 










55 










60 










Arg 


Ala 


Asn 


Val 


Gly 


He 


Leu 


Ser 


Tyr 


Asn 


Pro 


Glu 


Asn 


Thr 


Pro 


Glu 


65 










70 










75 










80 


Lys 


Lys 


Arg 


Lys 


Leu 


Gin 


Glu 


Lys 


Asn 


Val 


Phe 


Leu 


Gin 


He 


Arg 


Leu 










85 










90 










95" 




Arg 


Gin 


Ser 


Phe 


Asn 


Asn 


Leu 


He 


Pro 


Ser 


Leu 


Pro 


Phe 


Arg 


He 


Asp 








100 










105 










110 






Asn 


Thr 


Lys 


Lys 


He 


Thr 


Glu 


Met 


Lys 


Lys 


Thr 


Thr 


Leu 


Thr 


Gly 


Ser 






115 










120 










125 








He 


Cys 


Ala 


Leu 


Leu 


Leu 


Phe 


Leu 


Gly 


Leu 


Ser 


Ala 


Asn 


Ala 


Gin 


Ser 




130 










135 










140 










Lys 


Leu 


Lys 


He 


Lys 


Ser 


He 


Glu 


Ala 


Ala 


Thr 


Thr 


Phe 


Ser 


Ser 


Ala 


145 










150 










155 










160 


Thr 


Ala 


Gly 


Asn 


Gly 


Phe 


Gly 


Gly 


Asn 


He 


Phe 


Gly 


Met 


Asp 


Met 


Ser 










165 










170 










175 




He 


Arg 


Met 


Arg 


Val 


His 


His 


Ser 


He 


Leu 


Pro 


Glu 


Gly 


Leu 


Asp 


Phe 








180 










185 










190 






Ser 


Val 


Gly 


He 


His 


Glu 


Arg 


Arg 


Ala 


His 


Trp 


Glu 


Glu 


Ala 


Gly 


Ser 






195 










200 










205 








Pro 


Lys 


Leu 


Met 


Tyr 


Thr 


Asn 


Val 


Pro 


Ser 


He 


He 


Gly 


He 


Val 


Glu 




210 










215 










220 










Lys 


Val 


He 


Val 


Phe 


Glu 


Asp 


Ala 


Glu 


Asp 


Phe 


Phe 


Asp 


Lys 


Lys 


Ala 


225 










230 










235 










240 


Leu 


Gly 


Arg 


Phe 


Leu 


He 


Ser 


Leu 


Gly 


He 


Ser 


Tyr 


Thr 


Lys 


His 


Leu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID- <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



306/490 



10 



15 



20 



25 



35 



40 



45 



50 



55 



60 



65 



70 



75 











2*i 5 










2 50 










O R c: 
^ D 3 




Giy 


Ala 


Tyr 


Trp 


Gl V 


Tro 


Thr 


Asn 


Asp 


Al a 


His 


He 


Leu 
eu 


Phe 


Se r 


Pro 








260 










2 65 
















He 


Pro 


LVE 


Ser 


Lys 


Val 


His 


I yr 


Asp 


Thr 


1 yr 


Thr 


Arg 


r\± a 


ox y 


Ser 






275 










280 










285 






Asp 


Leu 


Val 


Leu 


Gin 


Ser 


Glu 


Asp 


Val 


Ala 


Thr 


Val 


Ser 


Asn 


Gly 


Phe 




290 










295 










300 








Ser 


Pro 


Gly 


lie 


Gly 


Leu 


Lys 


Ser 


Ser 


He 


Trp 


Trp 


Lys 


Met 


Pro 


He 


305 










310 










315 








320 


Lys 


Ser 


Lys 


Tyr 


Asp 


Phe 


Arg 


Leu 


Gly 


Phe 


Ser 


Leu 


Gly 


Tyr 


Glu 


Tyr 










325 










330 










335 




Leu 


Asn 


Leu 


Leu 


Tyr 


Pro 


Tyr 


Arg 


Asn 


Phe 


Lys 


Leu 


Asp 


Gly 


Asn 


Lys 








340 










345 










350 




Pro 


Leu 


Ser 


Ala 


Leu 


Ser 


Pro 


Arg 


Met 


Asn 


His 


He 


Gly 


His 


Val 


Gly 






355 










360 










365 






Phe 


Asn 


Phe 


Thr 


Val 


Gly 


Leu 


Trp 


Thr 


Asn 
















370 










375 




















(2) 


I M FORMAT I on 


FOR 


SEQ 


ID NO: 37 5 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LEllGTH: 1269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

30 (vi) ORIGINAL SOURCE: 

(A) ORGAliISM: Porphyromonas gingival is 



(ix) FEATURE: 

(A) NAI IE/KEY: misc_f eature 

(B) LOCATION 1...1269 

(Ki) SEQUENCE DESCRIPTION: SEQ ID 110:37 5 



Val 


Lys 


Cys 


Met 


Gly 


Lys 


Tyr 


Lys 


Arg 


Ala 


Lys 


Tyr 


Arg 


Tyr 


Trp 


Leu 


1 








5 










10 










15 




Phe 


Pro 


Phe 


Cys 


Ser 


Asp 


Tyr 


Tyr 


Thr 


Phe 


Glu 


Gly 


Val 


Thr 


Phe 


Leu 








20 










25 










30 






Cys 


Ala 


Ser 


Asp 


Asp 


Met 


Thr 


Thr 


Lys 


Lys 


Pro 


Gin 


Ala 


He 


Leu 


Asp 






35 










40 










45 






Leu 


Glu 


Lys 


Ala 


Tyr 


Asn 


He 


Glu 


He 


Pro 


Asp 


Leu 


Ser 


Ser 


Gin 


Glu 




50 










55 








60 










Gly 


He 


Ser 


Trp 


Ser 


Val 


Asn 


Arg 


Tyr 


Phe 


Lys 


Gin 


Asp 


Ser 


Ser 


Gly 


65 










70 










75 










80 


Ala 


Val 


Val 


Glu 


Leu 


Cys 


Leu 


Arg 


Glu 


Cys 


Gin 


He 


Glu 


Ser 


Met 


Thr 










85 










90 










95 




Trp 


Leu 


He 


Asp 


Phe 


Pro 


Ala 


Leu 


Lys 


Lys 


Leu 


Asp 


Leu 


Ser 


Tyr 


Asn 








100 










105 










110 




Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Thr 


Lys 






115 










120 










125 






Leu 


Arg 


Leu 


Arg 


Ser 


Asn 


Gin 


He 


Arg 


Lys 


Leu 


Glu 


Gly 


Leu 


Asp 


Ser 




130 










135 










140 






Leu 


Thr 


Ser 


Leu 


Thr 


Lys 


Leu 


Ser 


Leu 


Ser 


Asp 


Asn 


Gin 


He 


Ser 


Lys 


145 










150 










155 










160 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Glu 


Leu 


Tyr 


Leu 


Leu 










165 










170 










175 




Asp 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 








180 










185 








190 






Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly 


Asn 


Gin 


He 


Arg 


Lys 


Leu 


Glu 


Gly 


Leu 






195 










200 










205 






Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly 


Asn 


Gin 


He 




210 










215 










220 








Arg 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Thr 


Lys 


Leu 


Arg 


225 










230 










235 








240 


Leu 


Arg 


Ser 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 










2*35 










250 








255 




Ser 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly 


Asn 


Gin 


He 


Arg 


Lys 


Leu 


Glu 








260 










265 










270 






Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly 


Asn 






275 










280 










285 






Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Ser 


Ser 


Leu 


Thr 


Lys 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929B70A1_L> 



wo 99/29870 



PCT/AU98/01023 



10 



30 



307/490 

290 295 300 

Leu Arg Leu Arg Ser Asn Gin lie Ser Lys Leu Glu Gly Leu Glu Arg 
305 310 315 320 

L-u Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin lie Ser Lys 

325 330 335 

Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr Leu Leu 

340 345 350 

Asp Asn Gin lie Arg Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu 

355 360 365 

Thr Lys Leu Arg Leu Arg Ser Asn Gin lie Ser Lys Leu Glu Gly Leu 

370 375 380 

Asp Ser Leu Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin He 
385 390 395 400 

Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr 
15 405 410 415 

Leu Leu Asp Asn Gin He Arg Lys Leu Glu Gly Leu Asp Gly Leu Ala 

420 425 430 

Ser Leu Thr Arg Leu Ser Leu Arg Arg Asn Gin He Ser Lys Leu Glu 
435 440 445 

ZU Gly Leu Asp Arg Leu Lys Val Leu Arg Lys Leu Asp Val Ser Gly Asn 
4SQ 455 460 

Asp He Gin Ser He Asp Asp He Lys Leu Leu Ala Pro He Leu Glu 
465 470 475 480 

Gin Thr Leu Glu Lys Leu Arg He His Asp Asn Pro Phe Val Ala Ser 
25 485 4 90 ^95 

Ser Gly Leu He Leu Ser Pro Tyr Asp Asn His Leu Pro Glu He Lys 

500 505 510 

Ala Leu Leu Glu Lys Glu Lys Glu Lys Gin Lys Lys Thr Ser Val Glu 

515 520 525 

Tyr His Pro Phe Cys Lys Val Met Leu Leu Gly Asn His Ser Ser Gly 

530 535 540 

Lys Thr Thr Phe Leu Ser Gin Tyr Asp Thr Asn Tyr Thr Tyr Gin Lys 
545 550 555 560 

Asn Thr His Val Leu Ser He His Arg Ser Asn Asn Pro Asn Ala He 

565 570 575 

Phe Tyr Asp Phe Gly Gly Gin Asp Tyr Tyr His Gly He Tyr Gin Ala 

580 585 
Phe Phe Thr Thr Gin Ser Leu Tyr Leu Leu Phe Trp Asp Ala Lys Lys 

595 600 605 

Asp Arg Asn Phe Val Ser Val Asp Asp Lys Glu Tyr Gin Thr Leu Asn 

610 615 620 

Phe Asn Arg Pro Tyr Trp Leu Gly Gin He Ala Tyr Ala Cys Asn Arg 
625 630 635 640 

Cys Met Ser Val Gly Gly Asn Pro Asp Gly Lys Asp Thr Pro Gin Thr 

645 650 655 

Thr Asp Asp Thr He He He Gin Thr His Ala Asp Glu Thr Gly Ala 

660 665 670 

Lys Gin Gin Thr Leu Gly Cys Ala Ala Glu Asn Gly Val Leu Glu Glu 

675 680 685 

He Tyr Val Ser Leu Glu Pro Lys Ala Asn Ser Ala Val His Ala Leu 

690 695 700 

Asn Tyr Leu Asn Glu Arg Val Arg Glu Val Val Ala Ser Arg Ser Lys 
'fOb 710 715 720 

Ser He Gin He Thr Glu Lys Asp Lys Gly Leu Tyr Glu Ala Leu Pro 

725 730 735 

Thr He Ala Gly Asp Asn Lys His He Pro He Ser Leu Glu Ala Leu 

"^40 7^5 750 

Ala Ala Gin Leu Asn Lys Gly Arg Ala Glu Asn Asp Leu Tyr Thr He 

755 760 765 

Glu Tyr Leu Gin Thr Glu Leu Asn Gin Leu Ser Leu Arg Gly Glu Val 

770 775 780 

Leu Tyr Tyr Arg Glu Asn Glu Lys Leu Asn Asn Tyr Val Trp Leu Asp 
785 790 795 800 

Pro Ala Ala Phe Val Gin Met He His Gly Glu He Leu Gin Lys Asp 

805 810 815 

Asn He Asn Arg Gly Thr Val Pro Lys Asp He Phe Glu Cys Lys Leu 

820 825 830 

His Asn Leu Ser Ser Gly Ser He Phe Glu Glu Asp Gly Gin Asn Gly 

635 840 845 

Asn Met He Leu Gin Leu Leu Leu Glu Glu Leu He Val Tyr Glu Asp 

850 855 860 

Lys Asp Cys Tyr Val He Pro Gly Tyr Leu Pro Leu His Ser Asp Asp 
865 870 875 880 

Glu Ala Tyr Lys Trp Leu Thr Leu Gly Phe Glu Arg Pro Asn Phe Val 
885 890 895 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



40 



50 



55 



60 



65 



70 



75 



BNSDOCID: <WO 9929870Al_ 



wo 99/29870 



PCT/AU98/01023 



308/ 490 



10 



15 



20 



60 



70 



Leu 


Lys 


Phe 


Glu 


Arg 


Phe 


He 


Pro 


Phe 


Gl V 


Leu 


He 


Asn 


Gl n 


He 


He 








900 










905 










910 






Ala 


Tyr 


Tyr 


Gly 


Arg 


Glu 


Glu 


Gly 


Ala 


Leu 


Lys 


Arg 


Tyr 


Trp 


rg 


sp 






915 










920 










925 








Gin 


Val 


He 


Phe 


Thr 


Ala 


Gly 


Arg 


Glu 


Met 


Asp 


Arg 


Gin 


Thr 


L u 


Glu 




930 










935 










940 










Gin 


Glu 


Glu 


Glu 


Lys 


Glu 


Gly 


Leu 


Pro 


Lys 


Thr 


Asn 


Ala 


Glu 


Asp 


ryr 


945 










950 










955 








960 


Gin 


He 


Trp 


He 


Lys 


Leu 


Asp 


Phe 


Thr 


Asp 


Leu 


Ala 


He 


Ser 


Val 


Phe 










965 










97 0 










97 5 




He 


Lys 


Glu 


Gin 


Arg 


Lys 


Thr 


Ser 


Ala 


Lys 


Asp 


Met 


Gin 


Arg 


Lys 


Glu 








980 










985 










990 




Ala 


Thr 


He 


Leu 


Ser 


Asp 


Met 


Leu 


Asp 


Met 


Tyr 


Trp 


Asn 


Asn 


He 


Pro 






995 










1000 








1005 






Pro 


Arg 


Glu 


Gin 


He 


Gly 


Asp 


Lys 


Asp 


Thr 


Glu 


Gin 


Thr 


Arg 


Ser 


Thr 




1010 








1015 








1020 








He 


Arg 


Glu 


Thr 


Asn 


Arg 


Lys 


Lys 


Arg 


Pro 


He 


Gin 


Asp 


Leu 


Tyr 


Leu 


1025 








1030 








103 5 








] 


Ser 


Cys 


Ala 


Gin 


Ala 


Asp 


Lys 


Asp 


Leu 


Thr 


Glu 


Ser 


His 


Tyr 


He 


His 










1045 








1050 








1055 


Leu 


Gly 


Thr 


Leu 


Asp 


Asp 


Glu 


Ser 


Lys 


Thr 


Thr 


Ala 


Arg 


He 


Ala 


Ala 








1060 








1065 








1070 




Tyr 


Pro 


Leu 


Lys 


Asn 


Gly 


Val 


He 


Asp 


Lys 


Glu 


Arg 


Val 


Arg 


Glu 


Val 






1075 








1080 








1085 






Ser 


Thr 


Arg 


Fro 


Tyr 


i^y^ 


His 


Leu 


Ser 


Val 


Asn 


Lys 


Asn 


Leu 


Ala 


Thr 




1090 








1095 








1100 








Ala 


Lys 


Gin 


He 


Phe 


He 


Ser 


Tyr 


Ser 


Lys 


Glu 


Asp 


Gin 


Thr 


Glu 


Leu 



1040 



25 

1105 1110 1115 1120 

Glu Thr Cys Leu Gin Phe Phe Lys Pro Leu Glu Lys Asn Gly Gin He 
30 1125 1130 1135 

Glu He Tyr Tyr Asp Lys Leu Thr Lys Phe Glu Thr Pro He His Pro 

1140 1145 1150 

Glu He Arg Lys Arg He Val Glu Ala Asp Cys He He Ala Leu He 
1155 1160 1165 

35 Ser Gin Arg Tyr Leu Ala Thr Asp Tyr He Leu Asp His Glu Leu Pro 
1170 1175 1180 

Val Phe Arg Glu Tyr Asn Lys Thr He Val Pro He Leu He Lys Pro 
1185 1190 1195 1200 

Cys Thr Phe Glu Asp Asp Glu Phe Leu Arg Glu Lys Tyr Phe Ala Gin 
40 1205 1210 1215 

Lys Ala Gin He He Asn Leu Gly Lys Glu Gly Lys Thr He Lys Ala 

1220 1225 1230 

Tyr Asp Ser He Thr Ala Ser Ala His Arg Asp Glu Asn Trp Val Ala 
1235 1240 1245 

45 Val Val Arg Glu Phe Lys Glu Lys He Leu Arg He Thr Lys Gin Glu 
1250 1255 1260 

Val Asn Thr Asp Glu 
1265 

50 (2) INFORMATION FOR SEQ ID MO: 37 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 



(ix) FEATURE: 
65 (A) NAME/KEY: misc_feature 

{B) LOCATION 1 . . .231 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 6 



Lys Phe Met Met Lys Lys Ala Phe Val Phe Val Leu Leu Val Cys Leu 

15 10 15 

Phe Ser Ser Phe Ser Ser Ser Ala Gin Thr Thr Thr Asn Ser Ser Arg 

20 25 30 

Ser Tyr Phe Thr Gly Arg He Glu Lys Val Ser Leu Asn Leu Gly Val 
75 35 40 45 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929B70A1_I_> 



wo 99/29870 



PCT/AU98/01023 



309/490 



45 



50 



55 



60 



65 



70 



75 



Pro 


Pro 


Vai 


Ser 


Thr 


Glu 


Val 


Trp 


Gly 


Met 


Thr 


His 


Asp 


Ala 


Asn 


Gly 




50 










55 










60 










Leu 


Pro 


Phe 


Glu 


He 


Pro 


He 


Ser 


Phe 


Ser 


Arg 


Phe 


Asn 


Ser 


Gin 


Gly 


65 










70 










75 










80 


Asp 


lie 


Ala 


Thr 


Thr 


Tyr 


Tyr 


He 


Ala 


Asn 


Ser 


Glu 


Ala 


Thr 


Leu 


Asn 










85 










90 










95 




Glu 


Trp 


Cys 


Asp 


Tyr 


Ala 


His 


Pro 


Gly 


Gly 


He 


Val 


Arg 


Val 


Glu 


Gly 








100 










105 










110 






Arg 


Phe 


Trp 


Lys 


Met 


Thr 


Tyr 


Asn 


He 


Pro 


Thr 


Tyr 


Asn 


Ala 


Val 


Cys 






115 










120 










125 








Thr 


Arg 


He 


Thr 


Phe 


Glu 


Aen 


Gin 


Glu 


He 


Glu 


Gly 


Thr 


He 


Val 


Leu 




130 










135 










140 










lie 


Pro 


Ly^ 


Pro 


Lys 


Val 


Ser 


Leu 


Pro 


His 


Val 


Ser 


Glu 


Ser 


Val 


Pro 


145 










150 










155 










160 


Cys 


He 


Arg 


Thr 


Glu 


Ala 


Gly 


Arg 


Glu 


Phe 


He 


Leu 


Cys 


Glu 


Glu 


Asp 










165 










170 










175 




Asp 


Thr 


Phe 


Val 


Ser 


Hi^ 


Asp 


Gly 


Asn 


Glu 


Val 


Thr 


He 


Gly 


Gly 


Lys 








180 










185 










190 






Pro 


Phe 


Leu 


Leu 


Asn 


Thr 


Asn 


Val 


Lys 


He 


Val 


Gly 


Asp 


Vai 


Ser 


Gin 






195 










200 










205 








hys 


Tyr 


Ala 


Val 


Gly 


Val 


Gly 


Glu 


He 


Arg 


Phe 


Leu 


Gin 


He 


Cys 


Ala 




210 










215 










220 










Gin 


Thr 


Val 


Ser 


Gin 


Gin 


Lys 




















225 










230 






















(2) 


It I FORI 1 AT I Oil 


FOR 


SEQ 


ID HO; 37 7 

















10 



15 



20 



25 



( i ] SEQUEMCE CHARACTER I ST ICS : 

(A) LENGTH: 563 amino acids 
30 {B) TYPE: amino acid 

(D) TOPOLOGy: linear 

(ii.) MOLECULE TYPE: protein 

35 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAJIISM; Porphyromonas gingivalis 

40 (ixj FEATURE: 

(A) rWIE/KEY: misc_feature 
{B) LOCATION 1 . . . 563 



(>;i) SEQUENCE DESCRIPTION: SEQ ID HO: 377 



Ser 


Leu 


He 


Asn 


Asn 


Ser 


Arg 


Asp 


Trp 


Arg 


Ala 


Pro 


Val 


Arg 


Leu 


Ser 


1 








5 










10 










15 




Leu 


Lys 


Lys 


Lys 


Thr 


Lys 


Thr 


Met 


Lys 


Thr 


Lys 


Val 


Leu 


Arg 


Lys 


Phe 








20 










25 










30 






Vai 


Val 


Ala 


Ala 


Phe 


Ala 


Val 


Ala 


Thr 


Leu 


cys 


Pro 


Leu 


Ala 


Gin 


Ala 






35 










40 








^5 








Gin 


Thr 


Met 


Gly 


Gly 


Asp 


Asp 


Val 


Lys 


Val 


Val 


Gin 


Tyr 


Asn 


Gin 


Glu 




50 










55 










60 










Lys 


Leu 


Val 


Gin 


Thr 


Arg 


Met 


Ser 


Val 


Ala 


Asp 


Asn 


Gly 


Trp 


He 


Tyr 


65 










70 










75 










80 


Val 


Met 


Thr 


His 


Ser 


Gly Tyr 


Asp 


Thr 


Gly 


Asn 


Ser 


Asn 


Val 


Lys 


He 










85 










90 










95 




Phe 


Arg 


Ser 


Lys 


Asp 


Gin 


Gly 


Ala 


Thr 


Tyr 


Gin 


Lys 


Leu 


Arg 


Asp 


Trp 








100 










105 










110 






Asp 


Pro 


Ser 


Asp 


Asp 


Tyr 


Gin 


Phe 


Gin 


Asp 


Phe 


Asp 


He 


Val 


Val 


Thr 






115 










120 










125 








Gly 


Lys 


Asn 


Glu 


Ser 


Asp 


He 


Lys 


He 


Trp 


Ser 


Val 


Glu 


Leu 


Met 


Asn 




130 










135 










I'JO 










Lys 


Pro 


Gly 


Gly 


Tyr 


Lys 


Ser 


Arg 


Val 


Ala 


Val 


Phe 


Ser 


Arg 


Asp 


Ala 


145 










150 










155 










160 


Asn 


Ala 


Gin 


Asn 


Ala 


Lys 


Leu 


Val 


Tyr 


Lys 


Glu 


Asp 


Phe 


Ser 


Asn 


Val 










165 










170 










175 




Gin 


Leu 


Tyr 


Asp 


Val 


Asp 


He 


Ala 


Ser 


Asn 


Tyr 


Arg 


Ser 


Pro 


Ser 


Ser 








180 










185 










190 






Leu 


Asn 


Asn 


Gly 


Gly 


Asn 


Pro 


Phe 


Ala 


Leu 


Ala 


Phe 


Ala 


Tyr 


Thr 


Gly 






195 










200 










205 








Phe 


Asn 


Asn 


Thr 


His 


Lys 


He 


Ser 


Phe 


Val 


Asp 


Tyr 


Val 


Phe 


Ser 


Leu 




210 










215 








220 










Asn 


Gly 


Gly 


Gin 


Asn 


Phe 


Asn 


Lys 


Asn 


Leu 


Leu 


Phe 


Ser 


Gin 


Asp Gly 


225 










230 










235 










240 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870AlJ_> 
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310/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



70 



75 



Glu 


Lys 


Lys 


He 


Asp 


Lys 


Val 


Asp 


Leu 


Ser 


Leu 


Gly 


Ser 


Thr 


Ser 


Glu 










245 










250 










255 




Ser 


Met 


Gly 


His 


Asn 


Ala 


Trp 


Pro 


Leu 


Met 


Gly 


Val 


Val 


Phe 


Glu 


Met 








260 










265 










270 






Asn 


Lys 


Gin 


Gly 


Gly 


Lys 


Ser 


Asp 


He 


Gly 


Phe 


Leu 


Ser 


Asn 


Phe 


Val 






275 










280 










285 








Asp 


Asn 


Asp 


Pro 


Glu 


Phe 


Gin 


Trp 


Ser 


Gly 


Pro 


He 


Lys 


Val 


Ser 


Glu 




290 










295 










300 










Ser 


Asp 


I 'let 


Ser 


Phe 


Ser 


Pro 


Lys 


He 


Gin 


Met 


Leu 


Leu 


Asp 


Glu 


Asp 


305 










310 










315 










320 


Asn 


Asn 


Tlir 


He 


Asn 


Gly 


Glu 


Ser 


Cys 


His 


Asn 


Phe 


Met 


He 


Thr 


Tyr 










325 










330 










335 




Ser 


Asp 


Tyr 


Asp 


Ser 


Glu 


Tyr 


Ser 


Asp 


Trp 


Asp 


He 


Arg 


Tyr 


Val 


Tyr 








340 










345 










350 






Pro 


Lvs 


Lys 


Ser 


Phe 


Lys 


Tyr 


Glu 


Lys 


Gly 


Lys 


Thr 


Pro 


Thr 


Met 


Asp 






355 










360 










365 








Asp 


Leu 


Val 


Glu 


Ala 


Phe 


Leu 


Thr 


Ala 


Ser 


Tyr 


Gin 


Ser 


Glu 


Thr 


Asn 




37 0 










375 










380 










Ser 


Gly 


Leu 


Gly 


Tyr 


Asp 


Lys 


Asn 


Ala 


Asn 


His 


Tyr 


Leu 


He 


Thr 


Tyr 


385 










390 










395 










4 00 


Ala 


Lys 


Lys 


Gl u 


Glu 


Asn 


Gly 


Thr 


Asn 


Thr 


Leu 


Lys 


Tyr 


Arg 


Trp 


Ala 










405 










410 










415 




Asn 


Tyr 


Asp 


Lys 


He 


His 


Asn 


Lys 


Asp 


Leu 


Trp 


Ser 


Asp 


Thr 


Phe 


Thr 








420 










4 25 










4 30 






Tyr 


Thr 


Ser 


Ser 


Ala 


Asn 


Ala 


Leu 


Tyr 


Thr 


Pro 


Gin 


Val 


Asp 


He 


Asn 






435 










440 










4 45 








Pro 


Thr 


Lys 


Gly 


Leu 


Val 


Cys 


Trp 


Ser 


Trp 


Val 


Glu 


Tyr 


Leu 


Pro 


Gly 




450 










4 55 










460 










Lys 


Arg 


He 


Val 


Trp 


Ser 


Asp 


Thr 


Gin 


Trp 


Thr 


His 


Ala 


Asn 


Gly 


Val 


465 










470 










475 










480 


Glu 


Asp 


He 


Val 


Met 


Gin 


Glu 


Gly 


Ser 


Met 


Lys 


Leu 


Tyr 


Pro 


Asn 


Pro 










485 










4 90 










4 95 




Ala 


Gin 


Glu 


Tyr 


Ala 


Val 


He 


Ser 


Leu 


Pro 


Thr 


Ala 


Ala 


Asn 


Cys 


Lys 








500 










505 










510 






Ala 


Val 


Val 


T^-r 


Asp 


Met 


Gin 


Gly 


Arg 


Val 


Val 


Ala 


Glu 


Ala 


Ser 


Phe 






515 










520 










525 








Ser 


Gly 


Asn 


Glu 


Tyr 


Arg 


Leu 


Asn 


Val 


Gin 


His 


Leu 


Ala 


Lys 


Gly 


Thr 




530 










535 










540 










Tyr 


He 


Leu 


Lys 


Val 


Val 


Ser 


Asp 


Thr 


Glu 


Arg 


Phe 


Val 


Glu 


Lys 


Leu 


545 










550 










555 










560 


He 


Val 


Glu 





























(2) INFORMATION FOR SEQ ID MO: 37 8 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 786 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 
60 {B) LOCATION 1...7e6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 8 



65 



He 


Val Tyr 


Leu 


cys 


His 


Cys 


Met 


Asn 


His 


Arg 


Arg 


Ser 


Lys 


Thr 


Met 


1 






5 










10 










15 




Leu 


Thr He 


Arg 
20 


Asn 


Phe 


Leu 


Leu 


Phe 
25 


Cys 


Cys 


Leu 


Ser 


Leu 
30 


He 


Ala 


Phe 


Ala Ala 
35 


Asp 


Ala 


Gin 


Ser 


Ser 
40 


Val 


Ser 


Ser 


Gly 


Arg 
45 


Arg 


Leu 


Thr 


Glu 


Tyr Val 
50 


Asn 


Pro 


Phe 


He 
55 


Gly 


Thr 


Ala 


Asn 


Tyr 
60 


Gly 


Thr 


Thr 


Asn 


Pro 


Gly Ala 


Val 


Leu 


Pro 


Asn 


Gly 


Leu 


Met 


Ser 


Val 


Thr 


Pro 


Phe 


Asn 


65 








70 










75 










80 


Val 


Ser Gly 


Ser 


Thr 
85 


Glu 


Asn 


Arg 


Phe 


Asp 
90 


Lys 


Asp 


Ser 


Arg 


Trp 
95 


Trp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Ser 


Ala 


Pro 


Tyr 


Ser 


Ala 


Asp 


Asn 


Ser 


Tyr 


Cys 


lie 


Gly 


Phe 


Ser 


His 








100 










105 










110 






Val 


Asn 


Leu 


Ser 


Gly 


Val 


Gly 


Cys 


Pro 


Glu 


Leu 


Ser 


Gly 


He 


Leu 


Leu 






115 










120 










125 








Met 


Ala 


Thr 


Ser 


Gly 


Thr 


Phe 


Asp 


Pro 


Asp 


Tyr 


cys 


Cys 


Tyr 


Gly 


Ser 




130 










135 










140 










Ser 


Leu 


Ser 


Arg 


Glu 


Tyr 


Ala 


Arg 


Pro 


Gly 


Glu 


Tyr 


Lys 


Ala 


Val 


Leu 


145 










150 










155 










160 


Asp 


Lys 


Tyr 


Gly 


He 


Asp 


Ala 


Ala 


Val 


Thr 


Val 


Thr 


Glu 


Arg 


Thr 


Ala 






165 










170 










175 




Leu 


Thr 


Glu 


Phe 


Ala 


Phe 


Pro 


Glu 


Gly 


Glu 


Gly 


His 


He 


Leu 


Leu 


Asn 








180 










185 










190 






Leu 


Gly 


Gin 


Ala 


Leu 


Ser 


Asn 


Glu 


Ser 


Gly 


Ala 


Ser 


Val 


Arg 


Phe 


Leu 






195 










200 










205 








Asn 


Asp 


Ser 


Thr 


Val 


Val 


Gly 


Ser 


Arg 


Leu 


Met 


Gly 


Thr 


Phe 


Cys 


Tyr 




210 










215 










220 










Asn 


Pro 


Gin 


Ala 


Val 


Phe 


Arg 


Gin 


Tyr 


Phe 


Val 


Leu 


Gin 


Val 


Ser 


Arg 












230 










235 










240 


Arg 


Pro 


He 


Ser 


Ala 


Gly 


Tyr 


Trp 


Lys 


Lys 


Gin 


Pro 


Pro 


Met 


Thr 


Val 










245 










250 










255 




Glu 


Ala 


Gin 


Trp 


Asp 


Ser 


Thr 


Ala 


Gly 


Lys 


Tyr 


Lys 


Gin 


Tyr 


Asp 


Gly 








260 










265 










270 






Tyr 


Lys 


Arg 


Glu 


Met 


Ser 


Gly Asp 


Asp 


He 


Gly 


Val 


Arg 


Phe 


Ser 


Phe 






275 










280 










285 








Asn 


Cys 


Asp 


Gin 


Gly 


Glu 


Lys 


He 


Tyr 


Val 


Arg 


Ser 


Ala 


Val 


Ser 


Phe 




290 










2 95 










300 










Val 


Ser 


Glu 


Ala 


Asn 


Ala 


Leu 


Tyr 


Asn 


Leu 


Glu 


Ala 


Glu 


Gin 


Glu 


Glu 


305 










310 










315 










320 


Val 


Phe 


Ly^ 


Ser 


Val 


Gly 


Gly 


Asn 


Pro 


Ala 


Lys 


Ala 


Phe 


Ser 


Ala 


He 










325 










330 










335 




Arg 


Ser 


Arg 


Ala 


He 


Glu 


Arg 


Trp 


Glu 


Glu 


Ala 


Leu 


Gly 


Thr 


Val 


Glu 








340 










345 










350 






Val 


Glu 


Gly 


Gly 


Thr 


Pro 


Asp 


Glu 


Lys 


Thr 


lie 


Phe 


Tyr 


Thr 


Ala 


Leu 






355 










360 










365 








Tyr 


His 


Leu 


Leu 


He 


His 


Pro 


Asn 


He 


Leu 


Gin 


Asp 


Ala 


Asn 


Gly 


Glu 


370 










375 










380 










Tyr 


Pro 


t'let 


Met 


Gly 


Ser 


Gly 


Lys 


Thr 


Gly 


Asn 


Thr 


Ala 


His 


Asp 


Arg 


385 










390 










395 










400 


Tyr 


Thr 


Val 


Phe 


Ser 


Leu 


Trp 


Asp 


Thr 


Tyr 


Arg 


Asn 


Val 


His 


Pro 


Leu 










405 










410 










415 




Leu 


Cys 


Leu 


Leu 


Tyr 


Pro 


Glu 


Lys 


Gin 


Leu 


Asp 


Met 


Val 


Arg 


Thr 


Leu 








420 










425 










430 






He 


Asp 


Met 


Tyr 


Arg 


Glu 


Ser 


Gly 


Trp 


Leu 


Pro 


Arg 


Trp 


Glu 


Leu 


Tyr 






435 










440 










445 








Gly 


Gin 


Glu 


Thr 


Leu 


Thr 


Met 


Glu 


Gly 


Asp 


Pro 


Ser 


Leu 


He 


Val 


He 




450 










455 










460 










Asn 


Asp 


Thr 


Trp 


Gin 


Arg 


Gly 


Leu 


Arg 


Ala 


Phe 


Asp 


Thr 


Ala 


Thr 


Ala 


465 










470 










475 










480 


Tyr 


Glu 


Ala 


Met 


Lys 


Lys 


Asn 


Ala 


Ser 


Ssr 


Ala 


Gly 


Ala 


Thr 


His 


Pro 










485 










490 










4 95 




He 


Arg 


Pro 


Asp 


Asn 


Asp 


Asp 


Tyr 


Leu 


Thr 


Leu 


Gly 


Phe 


Val 


Pro 


Leu 








500 










505 










510 






Arg 


Glu 


Gin 


Tyr 


Asp 


Asn 


Ser 


Val 


Ser 


His 


Ala 


Leu 


Glu 


Tyr 


Tyr 


Leu 






515 










520 










525 








Ala 


Asp 


Trp 


Asn 


Leu 


Ser 


Arg 


Phe 


Ala 


His 


Ala 


Leu 


Gly 


His 


Lys 


Glu 




530 










535 










540 










Asp 


Ala 


Ala 


Leu 


Phe 


Gly 


Lys 


Arg 


Ser 


Leu 


Gly 


Tyr 


Arg 


His 


Tyr 


Tyr 


545 










550 










555 










560 


Asn 


Lys 


Glu 


Tyr 


Gly 


Met 


Leu 


Cys 


Pro 


Leu 


Leu 


Pro 


Asp 


Gly 


Ser 


Phe 










565 










57 0 










575 




Leu 


Thr 


Pro 


Phe 


Asp 


Pro 


Lys 


Gin 


Gly 


Glu 


Asn 


Phe 


Glu 


Pro 


Asn 


Pro 








580 










585 










590 






Gly 


Phe 


His 


Glu 


Gly 


Ser 


Ala 


Tyr 


Asn 


Tyr 


Ala 


Phe 


Phe 


Val 


Pro 


His 






595 










600 










605 








Asp 


He 


Gin 


Gly 


Leu 


Ala 


Arg 


Leu 


Met 


Gly 


Gly 


Ala 


Lys 


Val 


Phe 


Ser 




610 










615 










620 










Glu 


Arg 


Leu 


Gin 


Lys 


Val 


Phe 


Asp 


Glu 


Gly 


Tyr 


Tyr 


Asp 


Pro 


Thr 


Asn 


625 










630 










635 










640 


Glu 


Pro 


Asp 


He 


Ala 


Tyr 


Pro 


Tyr 


Leu 


Phe 


Ser 


Tyr 


Phe 


Pro 


Lys 


Glu 










645 










650 










655 




Ala 


Trp 


Arg 


Thr 


Gin 


Lys 


Leu 


Thr 


Arg 


Glu 


Leu 


He 


Asp 


Lys 


His 


Phe 








660 










665 










670 






Cys 


Asn 


Ala 


Pro 


Asn 


Gly 


Leu 


Pro 


Gly Asn 


Asp 


Asp 


Ala 


Gly 


Thr 


Met 






675 










680 










685 








Ser 


Ala 


Trp 


Leu 


Val 


Tyr 


Ser 


Met 


Leu 


Gly 


Phe 


Tyr 


Pro 


Asp 


Cys 


Pro 
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490 



690 



695 



700 



5 
10 
15 
20 
25 
30 
35 
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70 
75 



Gly 


Ser 


Pro 


Thr 


Tyr 


Thr 


Leu 


Thr 


Ser 


Pro 


Val 


Phe 


Pro 


Arg 


Val 


Arg 


7 05 










710 










715 










720 


He 


Arg 


Leu 


Asn 


Pro 


Gin 


Tyr 


Tyr 


Pro 


Gin 


Gly 


Glu 


Leu 


He 


He 


Thr 










725 










730 










735 




Thr 


Asn 


Thr 


Glu 


Asn 


Gin 


Pro 


Thr 


Asp 


Ser 


He 


Tyr 


He 


His 


Thr 


Val 








140 










745 










750 






Ser 


Leu 


Gly 


Asn 


Lys 


Thr 


Leu 


Pro 


His 


Gly 


Thr 


Arg 


His 


He 


Ser 


His 






755 










760 










765 








Ala 


Asp 


Leu 


Val 


Arg 


Cys 


Gly 


His 


Leu 


Arg 


Tyr 


Glu 


Leu 


Ser 


Asn 


Arg 




770 










775 










780 










Pro 


Arg 































(2) IK FORMATION FOR SEQ ID NO: 37 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromon as gingival! s 

(ix) FEATURE: 

(A) NAflE/KEY: miEc_feature 

(B) LOCATION 1 ... 814 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379 



Pro 


Arg 


Phe 


His 


Pro 


Gly 


Gly 


Thr 


His 


Ala 


Gin 


Leu 


Cys 


Arg 


Asn 


Gly 


1 








5 










10 










15 




Arg 


Arg 


Asn 


Gin 


Lys 


Ser 


Asn 


Gin 


Ser 


Ser 


Cys 


Phe 


Gly 


Gly 


Gly 


Thr 








20 










25 










30 






Ala 


Pro 


Arg 


Phe 


Phe 


He 


Met 


Cys 


Lys 


He 


Arg 


Phe 


Ser 


Leu 


Leu 


Gin 






35 










40 










45 








Ala 


Leu 


Val 


Val 


Cys 


Leu 


Leu 


Phe 


Thr 


Ser 


Phe 


Ser 


Leu 


Gin 


Ala 


Gin 




50 










55 










60 










Glu 


Glu 


Gly 


He 


Trp 


Asn 


Thr 


Leu 


Leu 


Ala 


He 


His 


Lys 


Thr 


Glu 


Lys 


65 










70 










75 










80 


Ala 


Val 


Giu 


Thr 


Pro 


Lys 


Lys 


Val 


Phe 


Ala 


Val 


Ala 


Asn 


Gly 


Val 


Leu 










85 










90 










95 




Tyr 


Ser 


Val 


Gly 


Lys 


Glu 


Ala 


Pro 


His 


Glu 


Ala 


Lys 


He 


Phe 


Asp 


Arg 








100 










105 










110 




He 


Ser 


Gly 


Leu 


Ser 


Asp 


Thr 


Ser 


Val 


Ser 


Ser 


He 


Ala 


Tyr 


Ser 


Glu 






115 










120 










125 








Gin 


Leu 


Lys 


Ser 


Leu 


Val 


He 


Tyr 


Tyr 


Ala 


Ser 


Gly 


Asn 


He 


Asp 


He 




130 










135 










140 








Leu 


Asp 


Glu 


Ala 


Gly 


Arg 


Val 


Thr 


Asn 


Val 


Pro 


Ala 


Leu 


Lys 


Asp 


Asn 


145 










150 










155 










160 


He 


Asp 


Leu 


He 


Asp 


Lys 


Thr 


Leu 


Asn 


Arg 


Leu 


Leu 


He 


Val 


Gly 


Asn 










165 










170 










175 




Arg 


Ala 


Tyr 


Leu 


Ala 


Gly 


Gly 


Phe 


Gly 


Leu 


Ser 


Val 


Leu 


Asp 


Val 


Ala 








180 










185 










190 






Glu 


Ala 


Arg 


He 


Pro 


Ala 


Thr 


Tyr 


Ala 


Lys 


Gly 


Thr 


Lys 


Val 


Thr 


Asp 






195 










200 










205 








Val 


Ala 


Lys 


Leu 


Asp 


Asn 


Asp 


Arg 


Leu 


Leu 


Met 


Leu 


Lys 


Glu 


Gly 


Gin 




210 










215 










220 










Leu 


Phe 


He 


Gly 


Lys 


Glu 


Thr 


Asp 


Asn 


Leu 


Gin 


Asp 


Pro 


Ala 


Ala 


Trp 


225 










230 










235 










240 


Thr 


Ala 


Leu 


Ser 


Leu 


Asn 


Leu 


Pro 


Met 


Gly 


Ser 


Val 


Thr 


Gly 


Leu 


Gly 










245 










250 








255 




He 


Val 


Gly 


Glu 


Asp 


He 


Cys 


Phe 


Leu 


Leu 


Ala 


Asp 


Gly 


Arg 


Val 


Tyr 








260 










265 










270 






Val 


Ala 


Ala 


Asn 


Gin 


Ser 


Phe 


Glu 


Pro 


Glu 


Leu 


Leu 


Leu 


Ser 


Ser 


Ser 






275 










280 










285 








Ala 


Asp 


Ser 


Arg 


Leu 


Tyr 


Val 


Thr 


Asp 


Arg 


Gly 


Leu 


Phe 


He 


Cys 


Ala 




290 










295 






300 








Glu 


Asn 


Arg 


He 


Tyr 


Phe 


He 


Glu 


Lys 


Gly 


Arg 


Lys 


Thr 


Thr 


Gin 


Phe 


305 










310 










315 










320 


Pro 


He 


Ala 


Asp 


Val 


Leu 


Gly 


Val 


Gly 


Ala 


Met 


Asn 


Glu 


Ser 


Asn 


Thr 
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325 330 335 

Ala Tyr lie Ala Leu Gly Glu Glu Gly Leu Ala Ser Leu Leu Leu Ala 

340 345 350 

Glu Gly Ser Thr Ala Glu Ala Met Pro Val Ala Phe Asp Gly Pro Gly 

355 360 365 

Asp Asn Asp Phe Tyr Glu Met Arg Phe Ser His Gly Arg Leu Tyr Ala 

370 375 380 

Ala Ser Gly Leu Trp Gly Thr Asn Leu Met Gly His Ala Gly Met Val 
385 390 395 400 

Lys Leu Tyr Asp Gly Asn Arg Trp Thr Asn Phe Asp Lys Lys Thr Val 

405 410 415 

Gin Glu Gin Leu Gly Gly Gly Phe Ser Phe Asn Asp Ala He Asp He 

420 425 A30 

Ala Val Ser Asn Gly Asp Pro Asp His Phe Phe Val Gly Thr Trp Gly 

435 440 445 

Asn Gly Leu Phe Glu Phe Lys Asp Gly Lys Ala He Ala Arg Tyr Ser 

450 455 460 

Gly Asn Glu Thr Ala He Ala Glu Cys Asn Pro Gly Asp Ala Arg Val 
465 470 475 480 

Lys Ala He Ala Phe Asp Asn Lys Gly Asn Leu Trp Gly Thr Leu Gly 

485 490 495 

Ala Val Gly Lys Asn He Phe Met Tyr Asp Pro Gin Ser Ser Thr Trp 

500 505 510 

His Ser Phe Ser Tyr Pro Asp Val Ala Asn Leu Ala Ser Phe Gly Asn 

515 520 525 

Met He He Leu Pro Asn Gly Asp Lys Trp Val Asn He Leu His Arg 

530 535 540 

Ser Gly Gly Ser Thr Arg Lys Gly Val Leu He Phe Asn Asp Arg Gly 
545 550 555 560 

Thr Pro Glu Thr Thr Ser Asp Asp Ser His Leu Tyr Val Glu Gin Phe 

565 570 575 

Val Asn Arg Leu Gly Ala Ala He Gly His Lys Thr He Tyr Ala Met 

580 585 590 

Ala Val Asp His Asn Gly Ser Val Trp Met Gly Ser Asp He Gly He 

595 600 605 

Phe Gly Val Tyr Asn Ala Ala Gly Val Leu Ser Ser Thr Ser Thr Pro 

610 615 620 

He Ala Val Arg Pro Val Gly Gly Glu Glu Pro Asn Leu Tyr Tyr Val 
625 630 635 640 

Leu Asp Lys Val Thr Val Thr Asp He Val Val Asp Lys Leu Asn His 

645 650 655 

Lys Trp Val Ala Thr Gin Gly Thr Gly Leu Tyr Leu Leu Ser Glu Asp 

660 665 670 

Cys Ser Lys He Leu Ala Gin Phe Thr Val Glu Asn Ser Pro Leu Leu 

675 680 685 

Ser Asn Asn He Leu Ser Leu Ala Leu Asn Asp Asp Asn Gly Leu Leu 

690 695 700 

Tyr He Gly Thr Ala Asp Gly Leu Met Thr Phe Gin Thr Gly Thr Gly 
705 710 715 720 

Ser Gly Ser Ala Ser Glu Leu Asp Gly Val Tyr Val Tyr Pro Asn Pro 

725 730 735 

Leu Arg Pro Glu Tyr Pro Asp Gly Val Thr He Ala Gly Leu Gin Ala 

740 745 750 

Gly Cys Ser Val Lys He Thr Asp Thr Thr Gly Arg Leu Leu Tyr Gin 

755 760 765 

Thr Glu Ser Val Thr Thr Glu Val Lys Trp Asn Ala Arg Gly Ala Asp 

770 775 780 

Gly Asn Arg Val Ala Ser Gly Val Tyr Ala Val Ala Val Tyr Asp Pro 
785 790 795 800 

Val Ser Lys Lys Ser Lys Leu He Arg Phe Ala Val He Arg 
805 810 

(2) in FORMATION FOR SEQ ID 1 JO: 38 0 

(i) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 1162 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Porphyromonas gingival is 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 
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(ix) FEATURE: 

(A) NAf-1E/KEY: misc_feature 

(B) LOCATION I ... 1162 

(xi) SEQUENCE DESCRIPTIOM: SEQ ID HO: 380 



Ala 


He 


Ser 


Gin 


Met 


Lys 


Arg 


He 


Leu 


Pro 


He 


Val 


Ala 


Phe 


Leu 


Ser 


1 








5 










10 










15 




Leu 


Phe 


Leu 


Ala 


Leu 


Ala 


Leu 


Pro 


Ala 


Lys 


Ala 


Gin 


Arg 


Ala 


Met 


Gly 








20 










25 










30 






Lys 




Ala 


Asp 


Arg 


Ser 


Leu 


Met 


Ala 


Ser 


Gly 


His 


Trp 


Val 


Lys 


He 






35 










4 0 










4 5 








Arg 


Val 


Asp 


Ala 


Ser 


Gly 


Val 


Tyr 


Arg 


Leu 


Thr 


A£-p 


Glu 


Gin 


Leu 


Arg 




50 










55 










60 










Ala 


Asn 


Gly 


Phe 


Ser 


Asp 


Pro 


Ser 


Lys 


Val 


Gly 


Val 


Phe 


Gly 


Tyr 


Gly 


65 










70 










75 










8 0 


Gly 


Gly 


Val 


Leu 


Pro 


Glu 


Asp 


Leu 


Ser 


Arg 


He 


Thr 


Thr 


Asp 


Asp 


Leu 










85 










90 










95 




Pro 


Pro 


Val 


Pro 


Val 


Leu 


Arg 


Gin 


Gly 


Asn 


Ala 


Leu 


Tyr 


Phe 


Tyr 


Ala 








100 










105 










110 






Val 


Gly 


Pro 


Val 


Thr 


Trp 


Phe 


Tyr 


Asn 


Pro 


Ala 


L-j's 


Thr 


Thr 


Met 


Glu 






115 










120 










125 








His 


Thr 


Val 


Asn 


Thr 


Tyr 


Ser 


Thr 


Mis 


Gly 


Tyr 


Tyr 


Phe 


Leu 


Ser 


Asp 




130 










135 










140 










Ala 


Ala 


Gly 


Ala 


Pro 


Leu 


Gin 


Met 


Ser 


Gin 


Tyr 


Thr 


Gly 


Gly 


Gly 


Ala 


145 










150 










155 










160 


Ser 


Ala 


Glu 


Ala 


Leu 


He 


Asp 


Tyr 


Tyr 


Asp 


Glu 


Leu 


Met 


Leu 


His 


Glu 










165 










17 0 










175 




Gin 


Glu 


Leu 


Tyr 


Ser 


Pro 


Lys 


Glu 


Ser 


Gly 


Arg 


Asp 


Leu 


Tyr 


Gly 


Glu 








180 










185 










190 






Ser 


Phe 


Ser 


Ala 


Val 


Asn 


Thr 


Arg 


Thr 


Val 


Lys 


Phe 


Pro 


Leu 


Arg 


Gly 






195 










200 










205 








Asn 


Thr 


Arg 


Ser 


Ser 


Gly 


Glu 


Leu 


Gly 


Thr 


Val 


Phe 


Ser 


Tyr 


He 


Ala 




210 










215 










220 










Lys 


Ala 


Arg 


Ser 


Ala 


Gly 


Gly 


Gly 


Arg 


Glu 


Met 


Ser 


Leu 


Ser 


Ala 


Asn 


225 










230 










235 










240 


Gly 


He 


Leu 


He 


Phe 


Ser 


Asp 


Pro 


Phe 


Ser 


Met 


Thr 


Ser 


Asn 


Glu 


Val 










245 










250 










255 




Ser 


Asn 


Ser 


Tyr 


Leu 


Ala 


Gly 


Lys 


Lys 


Arg 


Arg 


Leu 


Tyr 


His 


Ser 


Thr 








260 










265 










270 






Pro 


t let 


Asn 


Ser 


Leu 


Val 


Asn 


Glu 


Leu 


Arg 


Leu 


Asp 


Ala 


Asn 


Tyr 


Ser 






275 










280 










285 








Met 


Thr 


Gly 


Asp 


Ala 


Val 


Asn 


Leu 


Asp 


Phe 


He 


Glu 


Val 


Ala 


Thr 


Gin 




2 90 










295 










300 










Asn 


Asp 


Leu 


Arg 


Tyr 


A~p 


Gly 


Ala 


Pro 


Met 


His 


He 


Arg 


Arg 


Phe 


Ser 


305 










310 










315 










320 


Asn 


Leu 


Pro 


Val 


Leu 


Gly 


Gly 


Glu 


Ser 


Cys 


Arg 


Phe 


Val 


He 


Ser 


Glu 










325 










330 










335 




Val 


Pro 


Glu 


Ser 


Leu 


Val 


Val 


Leu 


Gin 


Ala 


Asn 


Ser 


Ser 


Leu 


Thr 


Ala 








34 0 










345 










350 






Ser 


Leu 


Val 


Pro 


Val 


Lys 


Thr 


Val 


Gly 


Asp 


Lys 


Thr 


He 


Glu 


Phe 


Val 






355 










360 










365 








Ala 


Pro 


Pro 


Lys 


Gly 


Gin 


Asp 


Arg 


Arg 


Thr 


He 


Asn 


Thr 


Phe 


Tyr 


Ala 




370 










375 










380 










Val 


Asp 


Leu 


Ser 


Gin 


Ala 


Ser 


Ala 


Pro 


Glu 


He 


Leu 


Gly 


Ala 


Val 


Pro 


385 










390 










395 










400 


Asn 


Gin 


Asn 


Leu 


His 


Gly 


Glu 


Glu 


He 


Pro 


Asp 


Leu 


He 


He 


Val 


Ser 










405 










410 










415 




Thr 


Gin 


Ala 


Leu 


Leu 


Leu 


Glu 


Ala 


Asp 


Arg 


Leu 


Ala 


Thr 


Tyr 


Arg 


Arg 








420 










425 










430 






Glu 


Lys 


Asn 


Gly 


Leu 


Lys 


Val 


Leu 


Val 


Val 


Leu 


Gin 


Glu 


Gin 


Val 


Phe 






435 










440 










445 








Asn 


Glu 


Phe 


Ser 


Gly 


Gly 


Thr 


Pro 


Asp 


Ala 


Thr 


Ala 


Tyr 


Arg 


Leu 


Phe 




450 










455 










4 60 










Ala 


Lys 


Met 


Phe 


Tyr 


Asp 


Arg 


Trp 


Lys 


Ala 


Asn 


Ala 


Pro 


Val 


Gly 


Glu 


465 










470 










475 










430 


Thr 


Phe 


Pro 


Met 


Gin 


Met 


Leu 


Leu 


Phe 


Gly 


Asp 


Gly 


Ala 


His 


Asp 


Asn 










485 










490 










4 95 




Arg 


Lys 


Val 


Ser 


Val 


Ala 


Trp 


Gin 


Lys 


Pro 


Tyr 


Leu 


Gin 


Gin 


Thr 


Glu 








500 










505 










510 






Phe 


Leu 


Leu 


Thr 


Phe 


Gin 


Ala 


Val 


Asn 


Ser 


Thr 


Asn 


Val 


Asn 


Ser 


Tyr 






515 










520 










525 






Val 


Thr 


Asp 


Asp 


Tyr 


Phe 


Gly 


Leu 


Leu 


Asp 


Asp 


Gin 


Pro 


Ala 


Ser 


Val 




530 










535 








540 
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Asn 


He 


Gly 




Arg 


Asn 


Tvr 




Met 


Ala 


Val 


Glv 


Arg 


Phe 


Pro 


Val 


54 5 










550 










555 










560 


Arg 


Thr 


Pro 


Ala 


Glu 


Ala 


Arg 


He 


Ala 


Val 


Asp 


Lvs 


Thr 


He 


Arg 


Tvr 










565 










570 










57 5 




Glu 


Glu 


Asp 


Arg 


Glu 


Ser 


Gly 


Ala 


Trp 


Arg 


He 


Arg 


Ala 


Cvs 


Phe 


Ala 








580 










585 










590 






Ala 


ASD 


Asn 


Gl V 


Asp 


Lys 


His 


Ala 


Thr 


Glu 


Thr 


Ser 


Arg 


Leu 


He 


ASD 






595 










600 










605 








Thr 


Val 


Lys 


Arg 


Tyr 


Ala 


Pro 


Ala 


He 


Met 


Pro 


Val 


Arg 


Ala 


Phe 


Gin 




610 










615 










620 










Asp 


Val 


Tyr 


Pro 


His 


Val 


He 


Glu 


Asn 


Gly 


Leu 


His 


Ser 


He 


Pro 


Gly 


625 










630 










635 










640 


Ala 


Lys 


Lys 


Lys 


Met 


Leu 


Glu 


Thr 


Leu 


Gin 


Ser 


Gly 


He 


He 


Leu 


Leu 










645 










650 










655 




Asn 


Tyr 


Ala 


Gly 


His 


Gly 


Gly 


Pro 


Ala 


Gly 


Trp 


Ser 


Asp 


Glu 


His 


Leu 








660 










665 










670 






Leu 


Thr 


Leu 


Asn 


Asp 


He 


His 


Lys 


Phe 


Asn 


Tyr 


Lys 


His 


Met 


Pro 


He 






675 










680 










685 








Trp 


He 


Thr 


Ala 


Thr 


Cys 


Asp 


Phe 


Ala 


Asn 


Tyr 


Asp 


Ser 


Gin 


Thr 


Thr 




690 










695 










700 










3er 


Ala 


Gly 


Glu 


Glu 


Val 


Phe 


Leu 


His 


Glu 


Lys 


Ser 


Glv 


Thr 


Pro 


He 


705 










710 










715 










720 


Met 


Phe 


Ser 


Thr 


Thr 


Arg 


Val 


Val 


Tyr 


Asn 


Thr 


Gin 


Asn 


Glu 




He 










725 










730 










735 




Asn 


Gly 


Phe 


Met 


Leu 


Arg 


Arg 


Met 


Phe 


Glu 


Lv's 


Ala 


Lvs 


ASD 


Glv 


Arg 








740 










74 5 










750 






Tyr 


Arg 


Thr 


Met 


Gly 


Glu 


He 


He 


Arg 


Ser 


Ala 


Lvs 


Gin 


Glv 


Met 


Leu 






755 










760 










7 65 








Ser 


Thr 


Val 


Phe 


Pro 


Asp 


Ser 


He 


Asn 


Gin 


Leu 


Ser 


Phe 


Phe 


Leu 


Met 




770 










775 










780 










Gly 


Asp 


Pro 


Ser 


Val 


Arg 


Met 


Asn 


Leu 


Pro 


Thr 


His 


Lys 


Val 


Gin 


Leu 


785 










790 










7 95 










800 


Thr 


Ala 


He 


As n 


Gly 


Gin 


Asp 


Pro 


Glu 


Gly 


Gin 


Tyr 


Gly 


Thr 


He 


Met 










805 










810 










815 




Leu 


Lys 


Ser 


Leu 


Glu 


Arg 


Val 


Ala 


Leu 


Lys 


Gly 


Lys 


Val 


Thr 


Asp 


Glu 








820 










825 










830 






Lys 


Gly 


Thr 


Phe 


Asp 


Glu 


Thr 


Phe 


Ser 


Gl V 


Lvs 


Val 


Phe 


Leu 


Thr 


Val 






835 










84 0 










845 








Phe 


Asp 


Gly 


Arg 


Lys 


Lys 


Met 


Thr 


Ala 


Leu 


Glu 


Gl u 


Glu 


Glv 


Asn 


Asp 




850 










855 










860 










Leu 


Ser 


Leu 


Val 


Tyr 


Tyr 


Asp 


Tyr 


Pro 


Asn 


Val 


Met 


Tvr 


Ala 


Glv 


He 


865 










870 










875 










880 


Ala 


Glu 


Val 


Lys 


Asp 


Gly 


Leu 


Phe 


Glu 


Thr 


Ser 


Phe 


Ile 


Val 


Pro 


Lys 










885 










890 










895 




Asp 


Val 


Asn 


Tyr 


Ser 


Glu 


His 


Glu 


Gly 


Arg 


He 


Asn 


Leu 


Tyr 


Ala 


Tvr 

* y j- 








900 










905 










910 






Asn 


Glu 


Ser 


Thr 


Lys 


Ala 


Glu 


Ala 


Met 


Glv 


Val 


AsD 


Phe 


Ser 


He 


Arg 






915 










920 










925 








Val 


Gin 


Pro 


Gly 


He 


Pro 


Asp 


Glu 


Val 


Thr 


Glu 


Asp 


Asn 


Thr 


Pro 


Pro 




930 










935 










940 










Glu 


He 


He 


Ser 


Cys 


Phe 


Leu 


Asn 


Asp 


Ser 


Thr 


Phe 


Arg 


Ser 


Gly 


Asp 


945 










950 










955 










960 


Glu 


Val 


Asn 


Pro 


Thr 


Pro 


Leu 


Phe 


Met 


Ala 


Glu 


Val 


Phe 


Asp 


Leu 


Asn 










965 










970 










975 




Gly 


He 


Asn 


He 


Thr 


Gly 


Ser 


Gly Val 


Gly 


His 


Asp 


He 


Thr 


Leu 


Cys 








980 










985 










990 






He 


Asp 


Gly 


Arg 


Ala 


Asp 


Leu 


Thr 


Tyr 


Asn 


Leu 


Asn 


Ala 


Tyr 


Phe 


Thr 






995 










1000 








1005 






Ser 


Ser 


Ala 


Thr 


Asp 


Ala 


Gly 


Val 


Gly 


Thr 


He 


Leu 


Phe 


Met 


He 


Pro 




1010 








1015 








1020 








Ala 


Leu 


Ala 


Glu 


Gly 


Asp 


His 


Thr 


Ala 


Arg 


Leu 


Thr 


Val 


Trp 


Asp 


He 


1025 








1030 








1035 








1 


Phe 


Asn 


Asn 


Ala 


Val 


His 


His 


Asp 


Phe 


Ser 


Phe 


Arg 


Val 


Val 


Asp 


Gly 










1045 








1050 








1055 


He 


Ala 


Pro 


Asp 


Val 


Ala 


Asp 


Val 


He 


Leu 


Phe 


Pro 


Asn 


Pro 


Val 


Arg 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 

1060 1065 1070 

Glu Ser Ala Thr Phe Arg He Phe His Asn Arg Pro Gly Ser Asp Leu 

1075 1080 1085 

Asn Val Ala Val Glu He Tyr Asp Phe Thr Gly Arg Leu Val Asn Ser 
70 1090 1095 1100 

Leu Pro Val Lys Thr Tyr Ser Ser Ser Tyr Gly Glu Pro He Glu He 
1105 1110 1115 1120 

Lys Trp Asp Leu Thr Ser Lys Tyr Gly Val Lys He Gly Asn Gly Phe 
1125 1130 1135 

75 Tyr Leu Tyr Arg Cys Val Val Asn Ser Pro Gly Gly Gin Thr Ala Ser 
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1140 1145 
Met Ala Lys Lys Met lie Val Val Gly Gin 
1155 1160 



1150 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(2) INFORMATION FOR SEQ ID NO: 381 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 973 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(E) LOCATION 1. . . 973 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381 



Phe 


Tyr 


Thr 


Gly 


He 


Asp 


Leu 


His 


He 


Glu 


Ser 


Gin 


Met 


Lys 


Lys 


Leu 


1 








5 










10 










15 




Phe 


Pro 


Leu 


Leu 


Leu 


Leu 


He 


Leu 


Ser 


He 


Leu 


Val 


Gly 


Cys 


Gly 


Lys 








20 










25 










30 






Lys 


Glu 


Lys 


His 


Ser 


Val 


Thr 


Glu 


He 


Ala 


Arg 


Glu 


Lys 


Lys 


Arg 


He 






35 










40 










45 








Thr 


Ala 


Leu 


Leu 


Tyr 


Glu 


Lys 


Glu 


Leu 


Pro 


Thr 


Asp 


Ser 


Val 


Lys 


Gin 




50 










55 










60 










Leu 


Tyr 


Glu 


Asn 


Ser 


Val 


Gin 


Asn 


Lys 


Asn 


Leu 


Val 


Gly 


Gin 


Met 


Leu 


65 










7 0 










75 










80 


Phe 


Ala 


He 


Glu 


Val 


Gly 


Lys 


Arg 


Met 


Arg 


Asn 


Met 


Ser 


Gin 


Tyr 


Thr 










8 5 










90 










95 




Asp 


rti a 


Met 


Leu 


Tyr 


His 


Gin 


Lys 


Gly 


Leu 


Asn 


Ala 


Ala 


Leu 


Asn 


Leu 








100 










105 










110 






Arg 


Asp 


Thr 


He 


Va 1 


Ala 


r\± a 


Li J, n 


r\X a 


Trp 


Hen 

rtsn 


• 

His 


Leu 


Gly 


1 nr 


Asp 






115 










120 










125 








Ser 


Arg 


Arg 


He 


Gly 


Ala 


Leu 


Ala 


Glu 


Ala 


Ser 


Asp 


Tyr 


His 


Tyr 


Lys 




130 










135 










140 










Ala 


Leu 


Ser 


Leu 


He 


Glu 


Ser 


Phe 


Ser 


Gly 


Asn 


Gin 


Asn 


Arg 


Pro 


Ala 


145 










150 










155 










160 


He 


Lys 


Ala 


Arg 


Ser 


Ala 


Ala 


Leu 


Asn 


Gly 


He 


Gly 


Asn 


He 


Asn 


Leu 










165 










170 










175 




Glu 


Leu 


Gly 


Tyr 


His 


Asp 


Glu 


Ala 


Glu 


Lys 


Asn 


Phe 


Leu 


Lys 


Ala 


Leu 








180 










185 










190 






Gin 


Gly 


Glu 


Lys 


Glu 


Leu 


Asp 


Ser 


Pro 


Leu 


Gly 


Gin 


Ala 


He 


Asn 


Tyr 






195 










200 










205 








Ala 


Asn 


Leu 


Gly 


Arg 


He 


Tyr 


Arg 


Gin 


Arg 


Lys 


Glu 


Tyr Asp 


Lys 


Ala 




210 










215 










220 










Arg 


Thr 


Tyr 


Phe 


Leu 


Leu 


Ser 


Leu 


Glu 


Gin 


Asn 


Asn 


Met 


Ala 


Glu 


Asn 


225 










230 










235 










240 


Leu 


Met 


Gly 


He 


Gly 


Leu 


Cys 


Ser 


He 


Asn 


Leu 


Gly 


Glu 


Val 


Asp 


Glu 










245 










250 










255 




Glu 


Lys 


Gly 


Asp 


Tyr 


Gin 


Lys 


Ala 


Leu 


Gin 


Glu 


Tyr 


Ala 


Thr 


Ala 


Tyr 








260 










265 










270 






Lys 


Leu 


Met 


Glu 


Gin 


Leu 


Ser 


Asp 


Arg 


Trp 


His 


Trp 


Leu 


Asn 


Ser 


Cys 






275 










280 










285 








He 


Pro 


Met 


Ala 


Arg 


He 


Asn 


Leu 


Lys 


Gin 


Gly 


Asn 


Glu 


Arg 


Leu 


Tyr 




290 










295 










300 










Gin 


His 


Phe 


He 


Ser 


Leu 


Ala 


Glu 


Gly 


Thr 


Ala 


Lys 


Glu 


He 


Asn 


Ser 


305 










310 










315 










320 


Thr 


Ser 


His 


Leu 


He 


Glu 


He 


Tyr 


Asn 


Leu 


Gin 


Tyr 


Glu 


Asn 


Leu 


Glu 










325 










330 










335 




Arg 


Lys 


Lys 


Glu 


Tyr 


Lys 


Gin 


Ala 


Leu 


Glu 


Ala 


Phe 


Cys 


Leu 


Ser 


Lys 








340 










345 










350 






Thr 


Leu 


Ser 


Asp 


Ser 


Met 


Ser 


He 


Ala 


His 


Lys 


Val 


Ser 


Ser 


He 


Gin 






355 










360 










365 








Glu 


Thr 


Arg 


Phe 


Asn 


Tyr 


Glu 


Arg 


Asn 


Lys 


Ser 


Gin 


Lys 


Glu 


Leu 


Glu 




370 










375 










380 










Glu 


He 


Gin 


Gin 


Val 


Ser 


Lys 


Ala 


Lys 


Gin 


Glu 


Lys 


Ser 


Lys 


Phe 


He 


385 










390 










395 








4 00 


Leu 


Leu 


Ser 


Thr 


Leu 


Phe 


Ala 


Leu 


Phe 


He 


Ser 


He 


Leu 


Leu 


He 


Ser 
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405 410 Alb 

Val Leu I'hr Tyr Aia Tyr Arg Gin Gly Lys Lys His Asn Lys Leu lie 

420 425 430 

Lys Glu Thr Asp Lys Leu Arg Ser Gly Phe Phe Thr Gly He Thr His 
5 -'135 440 44 5 

Glu Phe Ar-g Thr Pro He Thr Val He Gin Gly Leu Asn Glu Lys Ilet 

4 50 4 55 4 60 

Ser Ser Ser Pro Asp Leu Gin Ala Ser Asp Arg Thr Glu Leu His Lys 
4 65 470 475 4 80 

10 He lie Asp Arg Gin Ser Ser His Met Leu Asn Leu Val Asn Gin Leu 

485 490 495 

Leu Asp He Cys Lys He Arg Ser Gly Val Ser Thr Pro Glu Trp Arg 

500 505 510 

Asn Gly Asp lie Vai Ser Phe Val Gin He Leu He Asp Ser Phe Aia 
15 515 520 525 

Pro Tyr Ala Gin Aia Gin Asp He Thr Leu Glu Leu Gin Pro Glu Ser 

530 535 54 0 

Lys Pro lie Val Vai Asp Phe Val Pro Ser Tyr Leu Gin Lys He He 
54 5 550 555 560 

20 Ser Asn Leu Leu Ser Asn Ala He Lys Tyr Ser Leu Ala Gly Gly Arg 

565 570 575 

Val Val He Ser Leu Ala Lys Thr Lys Asn Glu Lys Asn Leu He He 

590 585 590 

Arg Val Aia Asp Asn Gly He Gly He Asp Lys Thr Asp Gin Ala His 
25 595 600 605 

He Phe Asp He Phe Tyr Arg Gly Gii. Ser Ala Thr Glu Lys His Gly 

610 615 620 

Ser Gly Val Gly Leu Ser Phe Thr Asn He Leu Val Glu Asn Leu Arg 
625 630 635 640 

30 Gly Thr He Lys Val Glu Ser Gin Pro Gly Lys Gly Ser Aia Phe Thr 

645 650 655 

He Ser He Pro Thr Gin Asn Gin Ser Ser Ser Ala Glu He Leu Pro 

660 665 670 

Trp Leu Pro Ser Ser Asp Asp He Val Met Pro Val His He Ala Pro 
35 675 680 685 

Asp Asp Ser Fro Thr Ser Pro Met Val Ala Ala Leu Asn His Arg Phe 

690 695 700 

Glu Asp Glu Arg Pro Thr He Leu Leu Val Glu Asp Asn Lys Asp He 
705 710 715 720 

40 Asn Leu Leu Val Lys Leu Leu Leu Cys Asp Arg Tyr Asn Val Leu Ser 

725 730 735 

Ala Ala Asn Gly Lys Glu Gly He Ala Leu Ala Thr Glu His He Pro 

740 745 750 

Asp He He He Thr Asp He Met Met Pro He Met Asp Gly He Glu 
45 755 760 765 

liet Thr He Arg Met Lys Gin Ser Pro Leu Leu Cys His He Pro He 

770 775 780 

Val Aia Leu Thr Ala Lys Ser Thr Glu Gin Asp Arg Leu Glu Gly He 
785 790 795 800 

50 Lys Ser Gly Vai Val Ser Tyr Leu Cys Lys Pro Phe Ser Pro Glu Glu 

805 810 815 

Leu Leu Met Arg He Glu Gin Leu Leu Lys Asp Arg Glu Leu Leu Lys 

820 825 830 

Lys Phe Tyr Met Gin Lys Leu Met Leu Asp Arg Lys Pro Glu Glu Glu 
55 835 840 845 

Pro Gin Pro He Asp Asp Ser Ser Met Gin Phe Leu Leu Ala Ala Lys 

850 855 860 

Asp Ala Val Ser Gly Gly He Lys Gin Asn Pro Asp Phe Ser Ala Gin 
8^5 870 875 800 

60 Asp Leu Ala Glu L'ys Met Cys Met Ser Pro Ser Gin Leu Asn Arg Lys 

8 85 8 90 8 95 

Leu Thr Ser Val Val Gly Cys Ser Thr He Gly Tyr He Gin Gin He 

900 905 910 

Lys He Lys Leu Ala Cys Lys Leu Leu Ala Asp Glu Ser Lys Asn He 
65 915 920 925 

Ser Asp He Ser He Glu Ala Gly Phe Ser Asp Pro Ala Tyr Phe Ser 

930 935 94 0 

Arg Thr Phe Lys Arg Tyr Met Asn Cys Ser Pro Ser Gin Tyr Arg Gin 
945 950 955 Q^n 

70 Lys Leu Leu Ala Met Pro Gly Ser Asp Lys Glu Thr Val 

965 97 0 

(2) IllFORMATIOM FOR SEQ ID MO: 382 

75 (i) SEQUEIJCE CHARACTERISTICS: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



• 

wo 99/29870 
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318/ 490 



(A) LENGTH: 563 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) OR I GI HAL SOURCE: 
10 (A) ORGAMISM: Porphyromonas gingival is 

(i;0 FEATURE: 

(A) MAI IE/ KEY: miocf eature 

(B) LOCATIOM 1 . . . 563 

15 

(xi) SEQUENCE DESCRIPTIOJI : SEQ ID 110:382 





Hi5 


His 


Lys 


Thr 


Tyr 


Gin 


Thr 


Met 


Lys 


Lys 


Leu 


Leu 


Gin 


Ala 


Lys 


Ala 


20 


1 








5 










10 










15 




Leu 


He 


I.eu 


Ala 


Leu 


Gly 


Leu 


Phe 


Gin 


Leu 


Pro 


Ala 


He 


Ala 


Gin 


Thr 










20 










25 










30 








Gin 


Met 


Gin 


Ala 


Asp 


Arg 


Thr 


Asn 


Gly 


Gin 


Phe 


Ala 


Thr 


Glu 


Glu 


Met 








35 










40 










45 








25 


Gl n 


Arg 


Ala 


Phe 


Gin 


Glu 


Thr 


Asn 


Pro 


Pro 


Ala 


Gly 


Pro 


Val 


Arg 


Ala 




50 










55 










60 












li- 


Ala 


Glu 


Tyr 


Glu 


Arg 


Ser 


Ala 


Ala 


Val 


Leu 


Val 


Arg 


Tyr 


Pro 


Phe 




es 










70 










7 5 










80 




Gly 


He 


Pro 


Met 


Glu 


Leu 


He 


Lys 


Glu 


Leu 


Ala 


Lys 


Asn 


Asp 


Lys 


Val 


30 










85 










90 










95 




He 


Thr 


He 


Val 


Ala 


Ser 


Glu 


Ser 


Gin 


Lys 


Asn 


Thr 


Val 


He 


Thr 


Gin 










100 










105 










110 








Tyr 


Thr 


Gin 


Ser 


Gly 


Val 


Asn 


Leu 


Ser 


Asn 


Cys 


Asp 


Phe 


He 


He 


Ala 








115 










120 










125 








35 


Lys 


Thr 


Asp 


Ser 


Tyr 


Trp 


Thr 


Arg 


Asp 


Tyr 


Thr 


Gly 


Trp 


Phe 


Ala 


Met 




130 










135 










140 












Tyr 


Asp 


Thr 


Asn 


Lys 


Val 


Gly 


Leu 


Val 


Asp 


Phe 


He 


Tyr 


Asn 


Arg 


Pro 




145 










150 










155 










160 




Arg 


Pro 


7\sn 


Asp 


Asp 


Glu 


Phe 


Pro 


Lys 


Tyr 


Glu 


Ala 


Gin 


Tyr 


Leu 


Gly 


40 










165 










170 










175 




He 


Glu 


[let 


Phe 


Gly 


Met 


Lys 


Leu 


Lys 


Gin 


Thr 


Gly 


Gly 


Asn 


Tyr 


Met 










180 










185 










190 








Thr 


Asp 


OXy 


Tyr 


Gly 


Ser 


Ala 


Val 


Gin 


Ser 


His 


He 


Ala 


Tyr 


Thr 


Glu 








195 










200 










205 








45 


Asn 


Ser 


Ser 


Leu 


Ser 


Gin 


Ala 


Gin 


Val 


Asn 


Gin 


Lys 


Met 


Lys 


Asp 


Tyr 




210 










215 










220 












Leu 


Gly 


He 


Thr 


His 


His 


Asp 


Val 


Val 


Gin 


Asp 


Pro 


Asn 


Gly 


Glu 


Tyr 




225 










230 










235 










240 




He 


Asn 


His 


Val 


Asp 


Cys 


Trp 


Gly 


Lys 


Tyr 


Leu 


Ala 


Pro 


Asn 


Lys 


He 


50 










245 










250 










255 




Leu 


He 


Arg 


Lys 


Val 


Pro 


Asp 


Asn 


His 


Pro 


Gin 


His 


Gin 


Ala 


Leu 


Glu 










260 










265 










270 








Asp 


Ilet 


Ala 


Ala 


Tyr 


Phe 


Ala 


Ala 


Gin 


Thr 


Cys 


Ala 


Trp 


Gly 


Thr 


Lys 








275 










280 










28-5 








55 


Tyr 


Glu 


Val 


Tyr 


Arg 


Ala 


Leu 


Ala 


Thr 


Asn 


Glu 


Gin 


Pro 


Tyr 


Thr 


Asn 




290 










2 95 










300 












Ser 


Leu 


He 


Leu 


Asn 


Asn 


Arg 


Val 


Phe 


Val 


Pro 


Val 


Asn 


Gly 


Pro 


Ala 




305 










310 










315 










320 




S-r 


Val 


Asp 


Asn 


Asp 


Ala 


Leu 


Asn 


Val 


Tyr 


Lys 


Thr 


Ala 


Met 


Pro 


Gly 


60 










325 










330 










335 




Tyr 


Glu 


He 


He 


Gly 


Val 


Lys 


Gly 


Ala 


Ser 


Gly 


Thr 


Pro 


Trp 


Leu 


Gly 










3-4 0 










345 










350 








Thr 


Asp 


Ala 


Leu 


His 


Cys 


Arg 


Thr 


His 


Glu 


Val 


Ala 


Asp 


Lys 


Gly 


Tyr 








355 










360 










365 








65 


Leu 


Tyr 


He 


Lys 


His 


Tyr 


Pro 


He 


Leu 


Gly 


Glu 


Gin 


Ala 


Gly 


Pro 


Asp 




37 0 










375 










380 










Tyr 


Lys 


He 


Glu 


Ala 


Asp 


Val 


Val 


Ser 


Cys 


Ala 


Asn 


Ala 


Thr 


He 


Ser 




385 










390 










395 










400 




Pro 


Val 


Gin 


Cys 


Tyr 


Tyr 


Arg 


He 


Asn 


Gly 


Ser 


Gly 


Ser 


Phe 


Lys 


Ala 


70 










AOb 










410 










415 




Ala 


Asp 


!1et 


Thr 


Met 


Glu 


Ser 


Thr 


Gly 


His 


Tyr 


Thr 


Tyr 


Ser 


Phe 


Thr 










420 










425 










430 








Gly 


Leu 


Asn 


Lys 


Asn 


Asp 


Lys 


Val 


Glu 


Tyr 


Tyr 


He 


Ser 


Ala 


Ala 


Asp 








■^35 










440 










445 








75 


Asn 


Ser 


Gly 


Arg 


Lys 


Glu 


Thr 


Tyr 


Pro 


Phe 


Ile 


Gly 


Glu 


Pro 


Asp 


Pro 




450 










455 










460 











SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1 J_> 




10 



15 



40 



45 



50 



55 



60 



65 



70 



75 



WO 99/29870 PCT/AU98/01023 

319/490 





Lys 


Phc- 


Thr 


Cys 


Met 


Asn 


Glu Thr 


Asn 


Thr 


Cys 


Thr 


Val 


Thr 


Gly 


465 










470 








475 










480 


ALa 


Ala 


Lys 


Ala 


Leu 


Arg 


Ala 


Trp Phe 


Asn 


Ala 


Gly 


Arg 


Ser 


Glu 


Leu 










^85 








4 90 










4 95 




Ala 


Val 


Ser 


Val 


Ser 


Leu 


Asn 


He Ala 


Gly 


Thr 


Tyr 


Arg 


He 


Lys 


Leu 








500 








505 










510 






Tyr 


Pisn 


Thr 


Ala 


Gly 


Gl u 


Glu 


Val Ala 


Ala 


I let 


Thr 


Lys 


Glu 


Leu 


Val 






515 










520 








525 








Ala 


Gly 


Thr 


Ser 


Val 


Phe 


Ser 


Met Asp 


Val 


Tyr 


Ser 


Gin 


Ala 


Pro 


Gly 




530 










535 








540 










Thr 


Tyr 


Val 


Leu 


Val 


Val 


Glu 


Gly Asn 


'31 y 


He 


Arg 


Glu 


Thr 


Met 


Lys 


545 










550 








555 










560 


He 


Leu 


Lys 



























(2) IMFORMATICM FOR SEQ ID MO: 383 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

25 (iii) HYPOTHETICAL: YES 

( V i ) OR T G H I AL SOU RCE : 

(A) ORGAIIISM: Por phyromonas gingiva lis 

30 {!:<) FEATURE: 

(A) NAJ'IE/KEY: inisc_f eature 

(B) LOCATION 1 ... 437 



35 





(xi ] 


1 SEQUEMCE DESCRIPTION: SEQ ID HO: 38 3 










Thr 


Thr 


Asn 


Arg 


Lys 


Pro 


Asn 


Thr 


Asn 


Met 


Lys 


Leu 


Ser 


Ser 


Lys 


Lys 


1 








5 










10 










15 




He 


Leu 


Ala 


He 


He 


Ala 


Leu 


Leu 


Thr 


Met 


Gly 


His 


Ala 


Val 


Gin 


Ala 








20 










25 










30 






Gin 


Phe 


Val 


Pro 


Ala 


Pro 


Thr 


Thr 


Gly 


He 


Arg 


Met 


Ser 


Val 


Thr 


Thr 






35 










40 










45 








Thr 


Lys 


Ala 


Val 


Gly 


Glu 


Lys 


He 


Glu 


Leu 


Leu 


Val 


His 


Ser 


He 


Glu 




50 










55 










60 










Lys 


Lys 


Gly 


He 


Trp 


He 


Asp 


Leu 


Asn 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gin 


Gin 


65 










70 










75 










80 


Gly 


Glu 


Glu 


He 


Thr 


Val 


Phe 


Asp 


Glu 


Ala 


Tyr 


His 


Glu 


Tyr 


Thr 
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(2) Itf FORMAT I Oil FOR SEQ ID 110:384 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 316 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

25 (ii) IIOLECULE TYPE: protein 

( iii ) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Pcrphyroinonas gingival if 

(ix) FEATURE: 

(A) NAME/ KEY: misc^feature 

(B) LOCATION 1 ... 318 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384 
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(2) II I FORMATION FOR SEQ ID HO: 38 5 

(1) SEQUEMCE CHARACTERISTICS: 

{A) LEMGTH: 4GI amino acids 
(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



15 (vi) ORIGIllAL SOURCE: 

(A) ORGAHISM: Porphyromonas gingivali^ 

(i>:) FEATURE: 

(A) n.AME/KEY: misc_f eature 
20 (B) LOCATIOH 1...4 61 

(Ml) SEQUEIJCE DESCRIPTION: SEQ ID JJO:385 
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(2) IMFORMATIOH FOR SEQ ID HO: 386 

10 (i) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 4 51 amino acids 
(P) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 
20 (A) ORGAtlISM: Fo rphyromonas gingival is 

(ix) FEATURE: 

(A) MAI IE/ KEY: misc_feature 

(B) LOCATIOtl 1 ... 451 
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(:-:i) SEQUENCE DESCRIPTIOM: SEQ ID 110:386 
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(2) INFORMATIOM FOR SLQ ID MO: 387 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LEtlGTM: 195 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

( V i ) OR I G I M AL SOURCE : 

(A) ORGANISM: Porphyromonas gingival is 



(ix) FEATURE: 

(A) nAME/p:EY: miEC_f eature 

(B) LOCATIOH 1 ... 195 

30 (xi) SEQUENCE DESCRIPTIOM: SEQ ID 130:38 7 
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Ser 


Tyr 


Ala 


Leu 


Gly 


Leu 


Ser 


He 


Gly 


Asn 


Asn 


Phe 


1 


















10 










15 




Lys 


Ser 


Ser 


Gly 


He 


Asp 


Ser 


Val 


Val 


I'let 


Asp 


Asp 


Phe 


Met 


Gin 


Gly 








20 










25 










30 






Leu 


Ser 


Asp 


Val 


Leu 


Glu 


Glu 


Lys 


Ala 


Pro 


Gin 


Leu 


Ser 


Tyr 


Asp 


Glu 






35 










40 










45 






Ala 


Lys 


Arg 


Glu 


He 


Glu 


Ala 


Tyr 


Phe 


Met 


Asp 


Leu 


Gin 


Gin 


Lys 


Ala 




50 










55 










60 








Val 


Lys 


Leu 


Asn 


Lys 


Glu 


Ala 


Gly 


Glu 


Glu 


Phe 


Leu 


Lys 


He 


Asn 


Ala 


65 










70 










75 










80 


His 


Lys 


Glu 


Gly 


Val 


Thr 


Thr 


Leu 


Pro 


Ser 


Gly 


Leu 


Gin 


Tyr 


Glu 


Val 










85 










90 










95 




He 


Lys 


1 let 


Gly 


Glu 


Gly 


Pro 


Lys 


Pro 


Thr 


Leu 


Ser 


Asp 


Thr 


Va 1 


Thr 








100 










105 










110 






Cys 


His 


Tyr 


His 


Gly 


Thr 


Leu 


He 


Asn 


Gly 


He 


Val 


Phe 


Asp 


Ser 


Ser 






115 










120 










125 








Met 


Asp 


Arg 


Gly 


Gl u 


Pro 


Ala 


Ser 


Phe 


Pro 


Leu 


Arg 


Gly 


Val 


He 


Ala 




130 










135 










14 0 










Gly 


Trp 


Thr 


Glu 


He 


Leu 


Gin 


Leu 


Met 


Pro 


Val 


Gly 


Ser 


Lys 


Trp 


Lys 


145 










150 










155 










160 


Val 


Thr 


He 


Pro 


Ser 


Asp 


Leu 


Ala 


Tyr 


Gly 


Asp 


Arg 


Gly Ala 


Gly 


Glu 










165 










170 










175 




His 


He 


Lys 


Pro 


Gly 


Ser 


Thr 


Leu 


He 


Phe 


He 


He 


Glu 


Leu 


Leu 


Ser 








180 










185 










190 






He 


Asn 


Lys 
































195 




























(2) 


INFORMATIOM 


FOR 


SEQ 


ID NO:388 

















40 



45 



50 



55 



60 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 
(E) TYPE: amino acid 
(D) TOPOLOGY: linear 

65 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

70 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



75 



(ix) FEATURE: 

(A) NAi'lE/KEY: misc_f eatur- 

(B) LOCATION 1 . . . 27 3 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929a70Al_l_> 



wo 99/29870 



PCT/AU98/01023 



324/490 



SEQUEIJCE DESCRI PTIOII : SEQ ID IJO:388 

Met Lys Lys Ala Leu Leu He Giy Ala Ala Leu Leu Gly Ala Val Ser 
5 1 5 10 15 

Phe Ala Ser Ala Gin Ser Leu Ser Thr He Ly^ Val Gin Asn Asn Ser 

20 25 30 

Val Gin Gin Pro Arg Glu Glu Ala Thr He Gin Val Cys Gly Glu Leu 
3 5 4 0 4 5 

10 Ala Glu Gin Val Asp Cys He Gly Thr Gly Asn Ser Ala He He Ala 
50 55 60 

Ala Ala Ala Lys Phe Glu Ser Asp Asp Leu Glu Ser Tyr Val Gly Trp 
65 70 75 80 

Glu He net Ser Val Asp Phe Phe Pro Gly Tyr Lys Ala Cys Lys Tyr 
15 85 90 95 

Thr Ser Ala Val Trp Ala Asp Asp Met Thr He Leu Gly Gin Ser Glu 

100 105 110 

Asp Ser Asp Pro Glu Met Gin Thr He Asn Asn Leu Ala Leu Lys Thr 
H5 120 125 

20 Ser Val Lyp He Glu Ala Gly Lys Asn Tyr He Val Gly Tyr He Ala 
130 135 140 

Asn Thr Ala Gly Gly His Pro He Gly Cys Asp Gin Gly Pro Ala Val 
150 155 160 

Asp Gly Tyr Gly Asp Leu Val Ser He Ser Glu Asp Gly Gly Ala Thr 
25 165 170 175 

Phe Pro Tro Phe Glu Ser Leu His Gin Ala Val Pro Thr Leu Asn Tyr 

180 185 190 

Asn He Tyr Val Val Val His Leu Lys Lys Gly Glu Gly Val Glu Ala 
195 200 205 

30 Val Leu Thr Asn Asp Lys Ala Asn Ala Tyr Val Gin Asn Gly Val He 
210 215 220 

Tyr Val Ala Gly Ala Asn Gly Arg Gin Val Ser Leu Phe Asp Met Asn 
225 230 235 240 

Giy Lys Val Val Tyr Thr Gly Val Ser Glu Thr He Ala Ala Pro Gin 
35 245 250 255 

Lys Gly I let Tyr He Leu Arg Val Gly Ala Lys Ser He Lys Leu Ala 
260 265 270 

He 



40 



60 



65 



70 



75 



(2) IHFORIiATIOH FOR SEQ ID MO: 38 9 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 554 amino acids 
45 (E) TYPE: amino acid 

(D) TOPOLOGV: linear 

(ii) IIOLECULE TYPE: protein 

50 (iii) HYPOTHETICAL: YES 

( V i ) OR I G 1 1 1 AL S OU RCE : 

(A) ORGAMISM: Porphyromonas gingiva lis 

55 (ix) FEATURE: 

(A) rJAl'IE/KEY : mis cofeature 

(B) LOCATION 1 ... 554 



iMi) SEQUEHCE DESCRIPTIOII: SEQ ID HO:38£ 



I let 


Pro 


Arg 


He 


Met 


Lys 


Leu 


Lys 


He 


Ala 


Leu 


Arg 


Leu 


Leu 


Leu 


Ala 


1 








5 










10 










15 






Phe 


Ala 


He 


Val 


Leu 


Phe 


Ser 


Pro 


Leu 


Ala 


Lys 


Ala 


Gin 


Met 


Asp 








20 










25 










30 






He 


Gly 


Gly 


Asp 


Asp 


Val 


Leu 


He 


Glu 


Thr 


Met 


Ser 


Thr 


Leu 


Ser 


Gly 






35 










4 0 










4 5 








Tyr 


Ser 


Glu 


Asp 


Phe 


Tyr 


Tyr 


Lys 


Met 


Ala 


Val 


Al a 


Asp 


Asn 


Gly 


Trp 




50 










55 










60 






He 


Tyr 


Val 


Met 


Leu 


Asp 


Phe 


Ser 


Arg 


He 




Phe 


Asp 


Asp 


Val 


Arg 


65 










70 










75 










80 


Leu 


Tyr 


Arg 


Ser 


Lys 


Asp 


Gly 


Gly 


Ala 


Thr 


Tyr 


Gin 


Lys 


Leu 


Gly 


Ser 










85 










90 










95 




Leu 


Gly 


Ser 


Leu 


Val 


Pro 


Tyr 


Asp 


Phe 


Asp 


Val 


Ser 


His 


Cys 


Asp 


Phe 








100 










105 










110 






He 


Val 


Thr 


'31 y 


Lys 


Asp 


Glu 


Asp 


Asp 


He 


Asn 


Val 


Trp 


Thr 


Val 


Met 



BNSDOCID: <WO 9929870A1_L> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 




wo 99/29870 PCT/AU 98/0 1023 

325/490 

115 120 125 

Thr Ala Phe Glu Tyr Val Gly Gly Thr lie Giy Asn Gly Val Leu Leu 

130 135 140 

Met His Arg His Asp Ala Asp He Asn Asn Thr Glu Cys Val Tyr Lys 
5 I'JS 150 155 160 

Lys Asp Phe Pro Asn Asn Arg Leu hlet Gly Val Ala He Ala Ser Asn 

165 170 17 5 

Tyr Arg Ala Pro Ser Pro Tyr Gly Leu Giy Gly Asp Pro Phe Ala Leu 
180 185 190 

10 Ala Val Ala Val Ser Gly Ser Gly Ser Asp His Ser Phe Leu Asp Tyr 
195 200 205 

He Phe Ser Leu Asp Gly Gly Val His Phe Glu Gin Lys Arg lie Tyr 

210 215 220 

Thr Arg Pro Gin Lys Leu Thr He Asn Arg Val Asp Leu Ser Leu Gly 
15 225 230 235 240 

Ser Thr Ser Pro Ser Leu Gly Phe Asn Thr Trp Pro Leu Met GJy Va] 

245 250 255 

Val Phe Glu ilet Asn Lys Asn Leu Asp Gly Phe Asp He Gly Phe He 
260 265 270 

20 Ser Asn Phe Val Asp Tyr Asp Pro Arg Tyr Ala Trp Ser Glu Pro He 
275 280 2 85 

He He Glu Glu Asp Cys Gly Trp Thr Asp Plie Asn Pro Leu Gly Ala 

2 90 2 95 300 

Leu Ser He Glu He Gin Met Met Leu Asp Asp Asn Ser Asp Asn Thr 
25 305 310 315 320 

Val Gly Gly Glu Arg Ser His Asn Phe Leu He Thr Tyr Pro Gly His 

325 330 335 

Tyr Val Tyv Pro Lys Gin Ser Phe Asn Tyr Ser Pro Gly His Thr Pro 
340 345 350 

30 Thr Lys Lys Asp Leu Val Phe Lys His Cys He Gly He Pro Ala Leu 
355 360 365 

Ala Tyr Asp Lys Glu Gly Asp Arg Tyr Leu Thr Thr Phe Gin Asp His 

370 375 380 

Asn Leu Met Arg Tyr Arg Trp He Lys Tyr Asp Asp He Asn Ser Phe 
35 385 390 3 95 4 00 

Tyr Gly Trp Ser Trp Pro Tyr Val Tyr Ala Lys Glu Ala Lys Asp Lys 

405 410 415 

Lys Arg Arg Arg Pro Gin Val Ala Leu Asn Pro Thr Asn Gly Lys Ala 
420 425 430 

40 C-ys Trp Val Trp His Thr Arg Lys Ser Pro Tyr Asp Glu Thr Lys Pro 
435 440 445 

His Pro Thr Pro Val He He Lys His Phe Leu Trp Ser Asp Thr Glu 

450 455 460 

Trp Val His Ala Leu Asp Val Gly Asp Val Leu Gin Lys Glu Gly Ser 
45 465 470 475 480 

Met Lys Leu Tyr Pro Asn Pro Ala Lys Glu Tyr Val Leu He Asn Leu 

4 85 ^90 ^95 

Pro Lys Glu Gly Gly His Glu Ala Val Val Tyr Asp Met Gin Gly Arg 
500 505 510 

50 He Val Glu Lys Val Ser Phe Ser Gly Lys Glu Tyr Lvs Leu Asn Val 
515 520 525 

Gin Tyr Leu Ser Lys Gly Thr Tyr Met Leu Lys Val Val Ala Asp Thr 

530 535 540 

Glu Tyr Phe Val Glu Lys He He Val Glu 
55 545 550 

(2) INFORMATIOH FOR SEQ ID NO: 390 

{i) SEQUEMCE CHARACTERISTICS: 
60 (A) LEIJGTH: 550 arr.ino acids 

(B) TYPE: amino acid 
(D} TOPOLOGY: linear 



65 



70 



Hi) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 

(A) ORGAIIISt'I: Porphyromonas gingival i£ 

(i;:) FEATURE: 

(A) HAME/KEY: rriisc_f eature 

(B) LOCATION 1 . . .550 



75 (;-:i) SEQUEMCE DESCRI PTIOI J : SEQ ID 110:390 



BNSDOCID: <WO 9929870A1_I_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 




wo 99/29870 PCT/AU98/01023 

326/ 490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Met 


Lys 


Lsu 


Lys 


He 


Ala 


Leu 


Arg 


Leu 


Leu 


Leu 


Ala 


Thr 


Phe 


Ala 


He 


1 








5 










10 










15 




Val 


Leu 


Phe 


Ser 


Pro 


Leu 


Ala 


Lys 


Ala 


Gin 


Met 


Asp 


He 


Gly 


Gly 


Asp 








20 










25 










30 






A^p 


Val 


Leu 


He 


Glu 


Thr 


Met 


Ser 


Thr 


Leu 


Ser 


Gly 


Tyr 


Ser 


Glu 


Asp 






35 










40 










45 








Phe 


Tyr 


Tyr 


Lys 


Met 


Ala 


Val 


Ala 


Asp 


Asn 


Gly 


Trp 


He 


Tyr 


Val 


Met 




50 










55 










6 0 










Leu 


Asp 


Phe 


Ser 


Arg 


He 


Tyr 


Phe 


Asp 


Asp 


Val 


Arg 


Leu 


Tyr 


Arg 


Ser 


65 










70 










75 










80 


Lys 


Asp 


Gly 


Gly 


Ala 


Thr 


Tyr 


Gin 


Lys 


Leu 


Gly 


Ser 


Leu 


Gly 


Ser 


Leu 










85 










90 










95 




Val 


Pro 


Tyr 


Asp 


Phe 


Asp 


Val 


Ser 


His 


Cys 


Asp 


Phe 


He 


Val 


Thr 


Gly 








100 










105 










HO 






Lys 


Asp 


Glu 


Asp 


Asp 


He 


Asn 


Val 


Trp 


Thr 


Va 1 


Met 


Thr 


Ala 


Phe 


Glu 






115 










120 










125 








Tyr 


Val 


Gly 


Gly 


Thr 


He 


Gly 


Asn 


Gly 


Val 


Leu 


Leu 


Met 


His 


Arg 


His 




130 










135 










14 0 










Asp 


Ala 


Asp 


He 


Asn 


Asn 


Thr 


Glu 


Cys 


Val 


Tyr 


Lys 


Lys 


Asp 


Phe 


Pro 












150 










155 










160 


Asa 


Asn 


Arg 


Leu 


Met 


Gly 


Val 


Ala 


He 


Ala 


Ser 


Asn 


Tyr 


Arg 


Ala 


Pro 










165 










170 










175 




Ser 


Pro 


Tyr 


Gly 


Leu 


Gly 


Gly 


Asp 


Pro 


Phe 


Ala 


Leu 


Ala 


Val 


Ala 


Val 








180 










185 










190 






Ser 


Gly 


Ser 


Gly 


Ser 


Asp 


His 


Ser 


Phe 


Leu 


Asp 


Tyr 


He 


Phe 


Ser 


Leu 






195 










200 










205 








Asp 


Gly 


Gly 


Val 


His 


Phe 


Glu 


Gin 


Lys 


Arg 


He 


Tyr 


Thr 


Arg 


Pro 


Gin 




210 










215 










220 










Lys 


Leu 


Thr 


He 


Asn 


Arg 


Val 


Asp 


Leu 


Ser 


Leu 


Gly 


Ser 


Thr 


Ser 


Pro 


1 o 










230 










235 










240 


Ser 


Leu 


Gly 


Phe 


Asn 


Thr 


Trp 


Pro 


Leu 


Me t 


Gly 


Val 


Val 


Phe 


Glu 


Met 










245 










250 










255 




A^n 


Lys 


Asn 


Leu 


Asp 


Gly 


Phe 


Asp 


He 


Gly 


Phe 


He 


Ser 


Asn 


Phe 


Val 








260 










265 










270 






Asp 


Tyr 


Asp 


Pro 


Arg 


Tyr 


Ala 


Trp 


Ser 


Glu 


Pro 


He 


He 


He 


Glu 


Glu 






275 










280 










285 








Asp 


Cys 


Gly 


Trp 


Thr 


Asp 


Phe 


Asn 


Pro 


Leu 


Gly 


Ala 


Leu 


Ser 


He 


Glu 




290 










295 










300 










He 


Gin 


Met 


Met 


Leu 


Asp 


Asp 


Asn 


Ser 


Asp 


Asn 


Thr 


Val 


Gly 


Gly 


Glu 


305 










310 










315 










320 


Arg 


Ser 


His 


Asn 


Phe 


Leu 


He 


Thr 


Tyr 


Pro 


Gly 


His 


Tyr 


Val 


Tyr 


Pro 










325 










330 










335 




Lys 


Gin 


Ser 


Phe 


Asn 


Tyr 


Ser 


Pro 


Gly 


His 


Thr 


Pro 


Thr 


Lys 


Lys 


Asp 








340 










345 










350 






Leu 


Val 


Phe 


Lys 


His 


Cys 


He 


Gly 


He 


Pro 


Ala 


Leu 


Ala 


Tyr 


Asp 


Lys 






355 










360 










365 








Glu 


Gly 


Asp 


Arg 


Tyr 


Leu 


Thr 


Thr 


Phe 


Gin 


Asp 


His 


Asn 


Leu 


Met 


Arg 




37«? 










375 










38 0 










Tyr 


Arg 


Trp 


He 


Lys 


Tyr 


Asp 


Asp 


He 


Asn 


Ser 


Phe 


Tyr 


Gly 


Trp 


Ser 


385 










3 90 










3 95 










400 


Trp 


Pre 


Tyr 


Val 


Tyr 


Ala 


Lys 


Glu 


Ala 


Lys 


Asp 


Lys 


Lys 


Arg 


Arg 


Arg 










405 










410 










415 




Pro 


Gin 


Val 


Ala 


Leu 


Asn 


Pro 


Thr 


Asn 


Gly 


Ly- 


Ala 


Cys 


Trp 


Val 


Trp 








420 










425 










430 






His 


Thr 


Arg 


Ly^ 


Ser 


Pro 


Tyr 


Asp 


Glu 


Thr 


Lys 


Pro 


His 


Pro 


Thr 


Pro 






435 










440 










445 








Val 


He 


He 


Lys 


His 


Phe 


Leu 


Trp 


Ser 


Asp 


Thr 


Glu 


Trp 


Val 


His 


Ala 




4 50 










455 










4 60 










Leu 


Asp 


Val 


Gly 


Asp 


Val 


Leu 


Gin 


Lys 


Glu 


Gly 


Ser 


Met 


Lys 


Leu 


Tyr 


465 










470 










475 










4 80 


Pro 


Asn 


Pro 


Ala 


Lys 


Glu 


Tyr 


Val 


Leu 


He 


Asn 


Leu 


Pro 


Lys 


Glu 


Gly 










485 










4 90 










495 




Gly 


His 


Glu 


Ala 


Val 


Val 


Tyr 


Asp 


Met 


Gin 


Gly 


Arg 


He 


Val 


Glu 


Lys 








500 










505 










510 






Val 


Ser 


Plte 


Ser 


Gly 


Lys 


Glu 


Tyr 


Lys 


Leu 


Asn 


Val 


Gin 


Tyr 


Leu 


Ser 






515 










520 










525 








Lys 


Gly 


Thr 


Tyr 


Met 


Leu 


Lys 


Val 


Val 


Ala 


Asp 


Thr 


Glu 


Tyr 


Phe 


Val 




5 30 










535 










54 0 










Glu 


Lys 


He 


He 


Val 


Glu 






















b45 










550 























(2) IHFORMATIOn FOR SEQ ID IJO:391 



BNSDCX:iD: <WO 9929B70A1_L> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 




25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



WO 99/29870 PCT/AU98/01023 

327/490 



( i ) SEQUEr ICE CHARACTERl STICS : 

(A) LENGTH: 390 amino acids 
{B} TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

10 (vi) ORIGIHAL SOURCE: 

(A) ORG AM ISM: Porphyromonas qingivaiis 

(ix) FEATURE: 

(A) tlAT-lE/KEY: misc_feature 
15 (B) LOCATIOII 1 . . .390 

(xi) SEQUEDCE DESCRIPTION: SEQ ID HO: 391 



20 



net 


Lys 


Arg 


Leu 


Leu 


Pro 


Phe 


Leu 


Leu 


Leu 


Ala 


Gly 


Leu 


Val 


Ala 


Val 


1 








c 










10 










15 




Gly 


A.~n 


Val 


Ser 
2 0 


Ala 


Gin 


Ser 


Pro 


Arg 


He 


Pro 


Gin 


Val 


Asp 


Val 


His 


Thr 


Ara 


IJe 


Ala 


Arg 


Asn 


Ala 


Arg 


25 
Tyr 


Arc 


Leu 


Asp 


Lys 


3 0 
He 


Ser 


Val 






36 










40 










45 








Pro 


Asp 


Ser 


Arg 


Gin 


He 


Phe 


Asp 


Tyr 


Phe 


Tyr 


Lys 


Glu 


Glu 


Thr 


He 




50 










55 










60 










Pro 


Thr 




He 


Gin 


Thr 


Thr 


Thr 


Gly 


Gly 


Ala 


He 


Thr 


Ser 


He 


Asp 


66 










70 










75 










80 


Ser 


Leu 


Phe 


Tyr 


Glu 


Asp 


Asp 


Arg 


Leu 


Val 


Gin 


Val 


Arg 


Tyr 


Phe 


Asp 










85 










90 










95 




Asn 


Asn 


Leu 


Glu 


Leu 


Lys 


Gin 


Ala 


Glu 


Lys 


Tyr 


Val 


Tyr 


Asp 


Gly 


Ser 








100 










105 










110 






Lys 


Leu 


Val 


Leu 


Arg 


Glu 


He 


Arg 


Lys 


Ser 


Pro 


Thr 


Asp 


Glu 


Thr 


Pro 






115 










120 










125 








He 


Lys 


Lys 


Val 


Ser 


Tyr 


His 


Tyr 


Leu 


Cys 


Gly 


Ser 


Asp 


Met 


Pro 


Phe 




130 










135 










140 










Glu 


He 


Thr 


Thr 


Glu 


Met 


Ser 


Asp 


Gly 


Tyr 


Phe 


Glu 


Ser 


His 


Thr 


Leu 


145 










150 










155 










160 




Tyr 


Leu 


Asn 


Gly 


Lys 


He 


Ala 


Arg 


He 


Asp 


He 


r-Iet 


Thr 


Gin 


Gin 










165 










170 










175 




Asn 


Pro 


Ser 


Ala 


Glu 


Leu 


He 


Glu 


Thr 


Gly 


Arg 


Met 


Val 


Tyr 


Glu 


Phe 








180 










185 










190 






Asp 


Ala 


Asn 


Asn 


Asp 


Ala 


Val 


Leu 


Leu 


Arg 


Asp 


Ser 


Val 


Phe 


Leu 


Pro 






195 










200 










205 








Leu 


Gin 


Asn 


Lys 


Trp 


Val 


Glu 


1 let 


Phe 


Thr 


His 


Arg 


Tyr 


Thr 


Tyr 


Asp 




210 










215 










22 ^" 










Asn 


Lys 


His 


Asn 




He 


Arg 


Trp 


Glu 


Gin 


Asp 


Glu 


Phe 


Gly 


Thr 


Leu 


IOC 










2 30 










235 










240 


Thr 


Leu 


Al a 


Asn 


Asn 


Phe 


Glu 


Tyr 


Asp 


Thr 


Thr 


He 


Pro 


Leu 


Ser 


Ser 










245 










250 










255 




Val 


Leu 


Phe 


Pro 


Thr 


His 


Glu 


Glu 


Phe 


Phe 


Arg 


Pro 


Leu 


Leu 


Pro 


Asn 








2 60 










265 








270 






The 


1 let 


Lys 


His 


Met 


Arg 


Thr 


Lys 


Gin 


Thr 


Tyr 


Phe 


Asn 


Asn 


Ser 


Gly 






275 










2 80 










285 








Glu 


Gly 


Leu 


Ser 


Glu 


Val 


Cys 


Asp 


Tyr 


Asn 


Tyr 


Phe 


Tyr 


Thr 


Asp 


bJet 




290 










295 










300 










Gin 


Gly 


Asn 


Ala 


Leu 


Thr 


Asp 


Val 


Ala 


Val 


Asn 


Glu 


Ser 


He 


Lys 


He 


305 










310 










315 










32 0 


Tyr 


Pre 


Arg 


Pro 


Ala 


Thr 


Asp 


Phe 


Leu 


Arg 


He 


Glu 


Gly 


Ser 


Gin 


Leu 










325 










330 










335 




Leu 


Arg 


Leu 


Ser 


Leu 


Phe 


Asp 


Met 


Asn 


Gly 


Lys 


Leu 


He 


Arg 


Ala 


Thr 








340 










345 










350 






Glu 


Leu 


Thr 


Gly 


Asp 


Leu 


Ala 


He 


He 


Gly 


Val 


Ala 


Ser 


Leu 


Pro 


Arg 






355 










360 










365 






Gly 


Thr 


Tyr 


He 


Ala 


Glu 


He 


Thr 


Ala 


Ala 


Asn 


Ser 


Lys 


Thr 


He 


Arg 




370 










375 










330 








Ala 


Lys 


Val 


Ser 


Leu 


Arg 






















385 










390 






















(2) 


1 H FORMAT 1 OH 


FOR 


SEQ 


ID HO:3?2 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



328/490 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 

5 

(vi) ORIGIIJAL SOURCE: 

(A) ORGAMISll: Porphyromonas gingivalis 

FEATURE; 

10 (A) WAIIE/KEY: mi£c_f eature 

(B) LOCATION 1 . . . 411 



(xi) SEQUEtlCE DESCRI PTIOt i : SEQ ID HO: 3 92 



15 


Met 


Arg 


Gin 


His 


Leu 


Ser 


Leu 


Phe 


Pro 


Phe 


He 


Leu 


Phe 


Leu 


Leu 


Leu 




1 








5 










10 










1 5 






Ala 


Phe 


Ser 


Tvr 

X j 1- 


Val 


Glv 




Arg 


Thr 


Vol 


A 


Gin 


Thr 


Pro 




Gl 










20 










2 5 










3 0 






20 




Gl u 


Arg 




Val 


Val 


Val 


L'='U 


S'^-r 


L'='U 


Asp 




Ph'=' 


Arg 


P 

ro 


ASp 






3 5 










4 0 










4 5 








T -> • r- 


T h r 


MS [J 


rg 


Al a 


A - 
rg 


Thr 


Pro 


Ala 


L*^ u 


ASp 


Arg 


M'=' t 






•"Z 1 1 1 
'^1 U 






50 










5 5 
























Tell 


S^r 




S'^ £ 


T 1 


(t; 1 m 
'ji n 


Pro 


Cys 


r ne 


Pro 




jbeu 


1 n 1 


tr ne 


Pro 




o ^ 










' U 




















0 0 


ASTi 


H j 


T j" r 


o e r 


1 let 




Thr 


'jly 


Leu 


Tyr 


Pro 


Asp 


His 


His 


Gly 


He 












8 5 










90 










95 






V a 1 




A" u 


J3 i, U 




Val 


Asp 


Ser 


Leu 


Leu 




lie 


r ne 


Arg 


lie 


Ser 






























110 






30 




Arg 




Al a 


Va 1 


Glu 


1 nr 


Pro 




r ne 


Trp 






Glu 


Pro 


Val 






115 










12 0 










12 5 










1 rp 


Asn 


Thr 


Al a 


Al a 


A - 


Gin 




T 1 

J. 1 e 


Arg 


1 ni 


J. y 




Tyr 


tr ne 


Trp 






130 










135 










1 A f} 
±H\J 












Val 




S*^r 


1j1 u 


Thr 


Al a 


V a -L 


Asn 


o J. y 


Asn 


Arg 


Pro 


Trp 


Arg 


Trp 


Lys 


35 


14 5 










1 50 










1 










16''^ 


L.v« 
jj^ - 


Phe 


Ser 


Ser 


Thr 


Val 


Pro 


Phe 




Asp 


Arg 


Al a 


A<= 
sp 


S'^r 


Val 


11*= 












165 










17 0 










17 5 






Ala 




Leu 


Gl V 


Leu 


Pro 


Glu 




Glu 


Arg 


Pro 




Leu 


L«u 


M'=t 


I rp 










18 0 










185 










190 






40 


Tyr 


He 


Glu 


Glu 


Pro 


Asp 


Met 


lie 


Gly 


His 


Ser 


Gin 


Thr 


Pro 


Glu 


Ser 




195 










200 








205 










Pro 


Leu 


Thr 


Leu 


Ala 


Met 


Val 


Glu 


Arg 


Leu 


Asp 


Ser 


Val 


Val 


Gly 


Tyr 






210 










215 










220 












Phe 


Arg 


Lys 


Arg 


Leu 


Asp 


Ser 


Leu 


Pro 


He 


Ala 


Ala 


Gin 


Thr 


Asp 


Phe 


45 


225 










230 










235 










240 


He 


He 


Vol 


Ser 


Asp 


His 


Gly 


Met 


Ala 


Thr 


Tyr 


Glu 


Asn 


Glu 


Lys 


cys 












245 










2 50 










255 






Val 


Asn 


Leu 


Ser 


Hi- 


Tyr 


Leu 


Pro 


Ala 


Asp 


Ser 


Phe 


Leu 


Tyr 


Met 


Ala 










260 










265 










270 






50 


Thr 


Gly 


Alfi 


Phe 


Thr 


Hi5 


Leu 


Tyr 


Pro 


Lys 


Pro 


Ser 


Tyr 


Thr 


Glu 


Arg 






27 5 










280 










285 










Ala 


Tyr 


Glu 


He 


Leu 


Arg 


Ala 


He 


Pro 


His 


He 


Ser 


Val 


Tyr 


Arg 


Lys 






290 










295 










300 












Gly 


Glu 


Val 


Pro 


Lys 


Arg 


Leu 


Arg 


Cys 


Gly 


Thr 


Asn 


Pro 


Arg 


Leu 


Gly 


55 


305 










310 










315 










320 


Glu 


Leu 


Va 1 


Val 


He 


Pro 


Asp 


He 


Gly 


Ser 


Thr 


Val 


Phe 


Phe 


Ala 


He 












325 










330 










335 






Asn 


Glu 


Asp 


Val 


Arg 


Pro 


Gly 


Ala 


Ala 


His 


Gly 


Tyr 


Asp 


Asn 


Gin 


Ala 










340 










345 










350 






60 


Pro 


Glu 


Met 


Arg 


Ala 


Leu 


Leu 


Arg 


Ala 


Val 




Pro 


Asp 


Phe 


Arg 


Pro 






355 










360 










365 








Gly 


Ser 


Arg 


Val 


Glu 


Asn 


Leu 


Pro 


Asn 


He 


Thr 


He 


Tyr 


Pro 


Leu 


lie 






37 0 










37 5 










380 












Cys 


Arg 


Leu 


Leu 


Gly 


He 


Glu 


Pro 


Ala 


Pro 


Asn 


Asp 


Ala 


Asp 


Glu 


Thr 


65 


385 










3 90 










395 










4 00 


Leu 


Leu 


Asn 


Gly 


Leu 


He 


Arg 


Asp 


Lys 


Arg 


Pro 













4 05 410 



(2) IHEQRMATIOt>l FOR SEQ ID tlO: 393 

70 (i) SEQUEHCE CHARACTERISTICS: 

(A) LEHGTH: 24 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

75 Hi) MOLECULE TYPE: protein 



BNSDOCID: .:WO 9929870A1_I_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 




10 



45 



60 



65 



70 



WO 99/29870 PCT/AU 98/0 1023 

329/490 



(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) PEATURE: 

(A) MAJ'lE/KEY: misc_feature 

(B) LOCATION 1 ... 246 

(::i) SEQUEIICE DESCRI PTIOIJ : SEQ ID IIO:393 



M^t Ly£ Val Gly Leu Phe lie Pro Cys Tyr Val Asn Ala Val Tyr Pro 
1 5 10 15 

lO Glu Val Gly lie Ala Thr Tyr Lys Leu Leu Lys Ser Leu Asp lie Asp 
20 25 30 

Val Asp Tyr Pro Met Asp Gin Thr Cys Cys Gly Gin Pro Met Ala Asn 

35 4 0 AS 

Ala Gly Phe Glu Gin Lys Ala Gin Lys Leu Ala Leu Arg Phe Glu Glu 
20 50 55 60 

Leu Phe Glu Ser Tyr Asp Val Val Val Gly Fro Ser Ala Ser Cys Val 
65 70 75 80 

Ala Phe Val hys Glu Asn Tyr Asp His He Leu Arg Fro Thr Gly hi is 
85 90 95 

25 Val Cys Lys Ser Ala Ala Lys Val Arg Asp He Cys Glu Phe Leu His 
100 105 110 

Asp Asp Leu Lys He Thr Ser Leu Pro Ser Ara Phe Ala His Lys Val 

11^^ 120 125 

Ser Leu His Asn Ser Cys His Gly Val Arg Glu Leu His Leu Ser Thr 
OU 130 135 140 

Pro Ser Glu Val His Arg Pro Tyr His Asn Lys Val Arg Arg Leu Leu 
150 155 160 

Glu Met Val Gin Gly He Glu Val Phe Glu Pro Lys Arg He Asp Glu 

170 175 
Cys Cys Gly Phe Gly Gly Met Tyr Ser Val Glu Glu Pro Glu Val Ser 

180 185 190 

Thr Cys Met Gly His Asp Lys Val Leu Asp His He Ser Thr Gly Ala 

195 200 205 

Glu Tyr He Thr Gly Pro Asp Ser Ser Cys Leu Met His Met Gin Gly 
40 210 215 220 

Val He Asp Arg Glu Lys Leu Pro He Lys Thr He His Ala Val Glu 
225 230 235 240 

He Leu Ala Ala Asn Leu 
2^5 



(2) IIIFORMATIOM FOR SEQ ID 1)0:394 



(i) SEQUEtJCE CHARACTERISTICS: 

(A) LENGTH: 24 6 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 (iii) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 

(A) ORGAIIISM: Po rpliyromonas gingival i.= 



(ix) FEATURE: 

(A) MAIJE/KEY: misc^feature 

(B) LOCATION 1 . . .24 6 

(xi) SEQUEIICE DESCRIPTIOII : SEQ ID 110:394 



Met Asp He Val Ser Met Ala Asp Lys Ala Leu Val Val Glu Met Arg 
1 5 10 15 

Asp Val Thr Leu Cys Gin Glu Glu Asn Val He Phe Gin Asn Leu Asn 

20 25 30 

Leu Thr Leu Ser Ala Gly Asp Phe Val Tyr Leu He Gly Ser Val Gly 

35 4 0 Ab 

Ser Gly Lys Ser Thr Leu Leu Lys Ala Leu Tyr Ala Glu Val Pro He 

50 55 60 

Ser Ala Gly Tyr Ala Arg Val He Asp Tyr Asp Leu Ala Lys Leu Lys 
/O 65 7 0 7 5 8 0 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1J_> 




wo 99/29870 PCT/AU 98/0 1023 

330/490 



10 



15 



20 



25 



30 



50 



55 



60 



65 



70 



75 



Arg 


Lys 


Gin 


Leu 


Pro 


Tyr 


Leu 


Arg 


Arg 


Asn 


Leu 


Gly 


He 


Val 


Phe 


Gin 










85 










90 










95 




Asp 


Phe 


Gin 


Leu 


Leu 


Asn 


Gly 


Arg 


Thr 


Val 


Ala 


Glu 


Asn 


Leu 


Asp 


Phe 








100 










105 










110 






Val 


Leu 


Arg 


Ala 


Thr 


Asp 


Trp 


Lys 


Asn 


Arg 


Ala 


Asp 


Arg 


Glu 


Gin 


Arg 






115 










120 










125 








lie 


Glu 


Glu 


Val 


Leu 


Thr 


Arg 


Val 


Gly 


Met 


Ser 


Arg 


Lys 


Ala 


Tyr 


Lys 




130 










135 










140 










Arg 


Pro 


His 


Glu 


Leu 


Ser 


Gly 


Gly 


Glu 


Gin 


Gin 


Arg 


Val 


Gly 


He 


Ala 


145 










150 










155 










160 


Arg 


Ala 


Leu 


Leu 


Ala 


Lys 


Pro 


Ala 


Leu 


He 


Leu 


Ala 


Asp 


Glu 


Pro 


Thr 










165 










170 










175 




Gly 


Asn 


Leu 


Asp 


Ser 


Val 


Thr 


Gly 


Leu 


Gin 


He 


Ala 


Ser 


Leu 


Leu 


Tyr 








IBO 










185 










190 






Glu 


lie 


Ser 


hys 


Gin 


Gly 


Thr 


Ala 


Val 


Leu 


Met 


Ser 


Thr 


His 


Asn 


Ser 






195 










200 










205 








Ser 


Leu 


Leu 


Ser 


Mis 


Leu 


Pro 


Ala 


Arg 


Thr 


Leu 


Ala 


Val 


Arg 


Lys 


Asn 




210 










215 










22 0 










Gly 


A.-P 


Ala 


Ser 


Ser 


Leu 


Val 


Glu 


Leu 


Ser 


Ala 


Asp 


Ala 


Val 


Ser 


Arg 


225 










230 










235 










24 0 




Asn 


Thr 


Gl u 


lie 


Asp 























(2) IMFORIIATIOII FOR SEQ ID /JO: 3 95 

(i) SBQUEHCE CHARACTERISTICS: 

(A) LEIIGTH: 241 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii ) HYPOTHETICAL: YES 

35 (vi) ORIGINAL SOURCE: 

(A) ORGAMISI'I: Porphyromonas gingival is 

(ix) FEATURE: 

(A) iJAUE/KEY: misc_f eature 
40 { B ) LOCATION 1 ... 2 4 1 

(Mi) SEQUENCE DESCRIPTION: SEQ ID ]]0:395 



45 



Met 


Ala 


Asp 


Lys 


Ala 


Leu 


Val 


Val 


Glu 


Met 


Arg 


Asp 


Val 


Thr 


Leu 


Cys 


1 








5 










10 










15 




Gin 


Glu 


Glu 


Asn 


Val 


He 


Phe 


Gin 


Asn 


Leu 


Asn 


Leu 


Thr 


Leu 


Ser 


Ala 








20 










25 










30 






Gly 


Asp 


Phe 


Val 


Tyr 


Leu 


He 


Gly 


Ser 


Val 


Gly 


Ser 


Gly 


Lys 


Ser 


Thr 






35 










40 










45 








Leu 


Leu 


Lys 


Ala 


Leu 


Tyr 


Ala 


Glu 


Val 


Pro 


He 


Ser 


Ala 


Gly 


Tyr 


Ala 




50 










55 










60 










Arg 


Val 


He 


Asp 


Tyr 


Asp 


Leu 


Ala 


Lys 


Leu 


Lys 


Arg 


Lys 


Gl n 


Leu 


Pro 


65 










70 










75 










80 


Tyr 


Leu 


Arg 


Arg 


Asn 


Leu 


Gly 


He 


Val 


Phe 


Gin 


Asp 


Phe 


Gin 


Leu 


Leu 










85 










90 










95 




Asn 


Gly 


Arg 


Thr 


Val 


Ala 


Glu 


Asn 


Leu 


Asp 


Plie 


Val 


Leu 


Arq 


Ala 


Thr 








100 










105 










110 






Asp 


Trp 


Lys 


Asn 


Arg 


Ala 


Asp 


Arg 


Glu 


Gin 


Arg 


He 


Glu 


Glu 


Val 


Leu 






115 










120 










125 








Thr 


Arg 


Val 


Gly 


Met 


Ser 


Arg 


Lys 


Ala 


Tyr 


Lys 


Arg 


Pro 


His 


Glu 


Leu 




130 










135 










140 










Ser 


Gly 


Gly 


Glu 


Gin 


Gin 


Arg 


Val 


Gly 


He 


Ala 


Arg 


Ala 


Leu 


Leu 


Ala 


14 5 










150 










155 










160 


Lys 


Pro 


Ala 


Leu 


He 


Leu 


Ala 


Asp 


Glu 


Pro 


Thr 


'31y 


Asn 


Leu 


Asp 


Ser 










165 










170 










175 




Val 


Thr 


Gly 


Leu 


Gin 


He 


Ala 


Ser 


Leu 


Leu 


Tyr 


Glu 


He 


Ser 


Lys 


Gin 








180 










185 










150 






Gly 


Thr 


Ala 


Val 


Lgu 


Met 


Ser 


Thr 


His 


Asn 


Ser 


Ser 


Leu 


Leu 


Ser 


His 






195 










200 










205 








Leu 


Pro 


Ala 


Arg 


Thr 


Leu 


Ala 


Val 


Arg 


Lys 


Asn 


Gly 


Asp 


Ala 


Ser 


Ser 




210 










215 




















Leu 


Val 


Glu 


Leu 


Ser 


Ala 


Asp 


Ala 


Val 


Ser 


Arg 


Lys 


Asn 


Thr 


Glu 


He 


225 










230 










235 













Asp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 




10 



15 



60 



65 



70 



75 



WO 99/29870 PCT/AU98/01023 

331/490 



(2) INFORMATIO]! FOR SEQ ID MO: 396 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LEIIGTH: 232 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIilAL SOURCE: 

(A) ORGAN I SI I: Porphyromonas gingival is 

(iM) FEATURE: 

(A) IIA1-1E/KEY: mi 5c_ feature 

(B) LOCATIOM 1 . . .232 



20 (xi) SEQUEMCE DESCRIPTION: SEQ ID MO: 3 96 



25 



llet 


Arg 


Asp 


Val 


Thr 


Leu 


Cys 


Gin 


Glu 


Glu 


Asn 


Val 


He 


Phe 


Gin 


Asn 


1 








5 










10 










15 




Leu 


Asn 


Leu 


Thr 


Leu 


Ser 


Ala 


Gly 


Asp 


Phe 


Va 1 


Tyr 


Leu 


He 


Gly 


Ser 








20 










25 










30 




Val 


Gly 


Ser 


'31 y 


Lys 


Ser 


Thr 


Leu 


Leu 


Lys 


Ml 5 


Leu 


Tyr 


Ala 


Glu 


Va 1 






35 










AO 










A5 








Pro 


He 


Ser 


Ala 


Gly 


Tyr 


Ala 


Arg 


Val 


He 


Asp 


Tyr 


Asp 


Leu 


Ala 


Lys 




50 










55 










60 










Leu 


Lys 


Arg 


Lys 


Gin 


Leu 


Pro 


Tyr 


Leu 


Arg 


Arg 


Asn 


Leu 


Gly 


He 


Val 


65 










70 










75 










80 


Phe 


Gin 


Asp 


Phe 


Gin 


Leu 


Leu 


Asn 


Gly 


Arg 


Thr 


Val 


Ala 


Glu 


Asn 


Leu 










85 










90 










95 




Asp 


Phe 


Val 


Leu 


Arg 


Ala 


Thr 


Asp 


Trp 


Lys 


Asn 


Arg 


Ala 


Asp 


Arg 


Glu 








100 










105 










110 






Gin 


Arg 


He 


Glu 


Glu 


Val 


Leu 


Thr 


Arg 


Val 


Gly 


Itet 


Ser 


Arg 


Lys 


Ala 






115 










120 










125 








Tyr 


Lys 


Arg 


Pro 


His 


Glu 


Leu 


Ser 


Gly 


Gly 


Glu 


Gin 


Gin 


Arg 


Val 


Gly 




130 










135 










140 






lie 


Ala 


Arg 


Ala 


Leu 


Leu 


Ala 


Lys 


Pro 


Ala 


Leu 


He 


Leu 


Ala 


Asp 


Glu 


1^5 










150 










155 










160 


Pro 


Thr 


Gly 


Asn 


Leu 


Asp 


Ser 


Val 


Thr 


Gly 


Leu 


Gin 


He 


Ala 


Ser 


Leu 










165 










170 










175 




Leu 


Tyr 


Glu 


He 


Ser 


Lys 


Gin 


Gly 


Thr 


Ala 


Val 


Leu 


Met 


Ser 


Thr 


His 








180 










185 










190 






Asn 


Ser 


Ser 


Leu 


Leu 


Ser 


His 


Leu 


Pro 


Ala 


Arg 


Thr 


Leu 


Ala 


Val 


Arg 






195 










200 








205 






Lys 


Asn 


Gly 


Asp 


Ala 


Ser 


Ser 


Leu 


Val 


Gl u 


Leu 


Ser 


Ala 


Asp 


Ala 


Val 




210 










215 










2 20 










Ser 


Arg 


Lys 


Asn 


Thr 


Glu 


He 


Asp 


















225 










230 






















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:397 

















30 



35 



40 



45 



50 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



Hi} MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1 . . .219 

(;:i) SEQUENCE DESCRIPTION: SEQ ID NO: 3 97 

Met He Glu He Ser Asn Leu Thr Lys Val Phe Arg Thr Glu Glu He 

1 5 10 15 

Glu Thr Val Ala Leu Asp Gly Val Ser Leu Lys Val Asp Lys Gly Glu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_t_> 




wo 99/29870 PCT/AU98/01023 

332/490 



10 



15 



20 



25 



35 



40 



55 



60 



65 



70 



75 









2 0 










25 










30 






Phe 


li- 


Ala 

35 


He 


Met 


Gly 


Pro 


Ser 
40 


Gly 


Cys 


'3ly 


Lys 


Ser 
45 


Thr 


Leu 


Leu 


Asm 


ne 
50 


Leu 


Gly 


Leu 


Leu 


Asp 
55 


Asn 


Pro 


Thr 


Ser 


Gly 

60 


He 


Tyr 


Lys 


Leu 


Asp 


Gly 


Ala 


Glu 


Val 


Gly 


Asn 


Leu 


Arcj 


Glu 


Lys 


Asp 


Arg 


Thr 


Ala 


Val 


66 










7 0 










15 










80 


Arg 


Lyt- 


Gly 


A^n 


He 
8 5 


Gly 


Phe 


Val 


Phe 


Gin 
90 


Ser 


Phe 


Asn 


Leu 


He 
95 


Glu 


Glu 


Met 


T Ill- 


Val 
100 


Ser 


Glu 


A^n 


Val 


Glu 
105 


Leu 


Pro 


Leu 


Val 


Tyr 
110 


Leu 


Gly 


Val 


Ly- 


Ala 
115 


Ser 


Glu 


Arg 


Lys 


Glu 
120 


Arg 


Val 


Glu 


Glu 


Ala 
125 


Leu 


Arg 


Lys 


Met 


Ser 
130 


He 


Ser 


Hi^ 


Arg 


Ala 
135 


Gly 


His 


Phe 


Pro 


Asn 
lAO 


Gin 


Leu 


Ser 


Gly 


Gly 


Gi n 


Gin 


Gin 


Arg 


Val 


Ala 


He 


Ala 


Arg 


Ala 


Val 


Val 


Ala 


Asn 


Pro 


145 










150 










155 










160 


Ly.- 


Leu 


He 


Leu 


Ala 
165 


Asp 


Glu 


Pro 


Thr 


Gly 
17 0 


Asn 


Leu 


Asp 


Ser 


Lys 
175 


Asn 


Gly 


Ala 


Asp 


Val 
180 


Met 


Glu 


Leu 


Leu 


Arg 
185 


Gly 


Leu 


Asn 


Arg 


Glu 
190 


Gly 


Ala 


Thr 


He 


Val 
195 


Met 


Val 


Thr 


His 


Ser 

200 


Glu 


His 


Asp 


Ala 


Arg 
2 05 


Ser 


Ala 


Gly 


Arg 


He- 
210 


He 


Asn 


Leu 


Phe 


Asp 
215 


Gly 


Lys 


He 


Arg 













(2) IIIFORMATIOII FOR SEQ TD IIO:598 



(i) SEQL'EHCE CHARACTERISTICS: 
30 (A) LEHGTH: 595 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGII-iAL SOURCE: 

(A) ORGANISM: Porphyrorr.onas gingiva lis 

(ix) FEATURE: 

(A) IIAME/KEY: misc_f eature 

(B) LOCATION 1 ... 595 



45 {::i) SEQUEMCE DESCRI PTIOl J : SEQ ID HO: 398 



50 



Met 


Lys 


Glu 


Phe 


Phe 


Lys 


net 


Phe 


Phe 


Ala 


Ser 


He 


Leu 


Gly 


Val 


He 


1 


















10 










15 




Thr 


Ala 


Gly 


He 


He 


Leu 


Phe 


cys 


He 


Phe 


Leu 


Phe 


He 


Phe 


Phe 


Gly 








20 










25 










30 






He 


Val 


Ala 


Gly 


He 


Ala 


Ser 


Lys 


Ala 


Thr 


Gly 


Gly 


Thr 


He 


Pro 


Lys 






35 










40 










45 






He 


Glu 


Ala 


Asn 


Ser 


He 


Leu 


His 


He 


Unk 


A^n 


Ser 


Ser 


Phe 


Pro 


Glu 




50 




















60 










He 


Val 


Ser 


Ala 


Asn 


Pro 


Trp 


Ser 


Met 


Leu 


Thr 


Gly 


Lys 


Asp 


Gl u 


Ser 


65 










70 










75 








80 


Val 


Ser 


Leu 


Ser 


Gin 


Ala 


Val 


Glu 


Ala 


He 


Gly 


Gin 


Ala 


Lys 


Asn 


Asn 










85 










90 














Pro 


Asn 


He 


Thr 


Gly 


He 


Phe 


Leu 


Asp 


Leu 


Asp 


Asn 


Leu 


Ser 


Val 


Gly 








100 










105 










110 






Net 


Ala 


Ser 


Ala 


Gl u 


Glu 


Leu 


Arg 


Arg 


Ala 


Leu 


Gin 


Asp 


Phe 


Lys 


Met 






115 










120 










125 








Ser 


Gly 


Lys 


Phe 


Val 


Val 


Ser 


Tyr 


Ala 


Asp 


Arg 


Tyr 


Thr 


Gin 


Lys 


Gly 




130 










135 










140 










Tyr 


Tyr 


Leu 


Ser 


Ser 


He 


Ala 


Asp 


Lys 


Leu 


Tyr 


Leu 


Asn 


Pro 


Lys 


Gly 


145 










150 










155 










160 


Met 


Leu 


Gly 


Leu 


He 


Gly 


He 


Ala 


Thr 


Gin 


Thr 


J let 


Phe 


Tyr 


Lys 


Asp 










165 










170 










175 




Ala 


Leu 


Asp 


Lys 


Phe 


Gly 


Val 


Lys 


Met 


Glu 


He 


Phe 


Lys 


Val 


Gly 


Thr 








180 










185 










190 






Tyr 


Lys 


Ala 


Ala 


Val 


Glu 


Pro 


Phe 


Met 


Leu 


Asn 


Arg 


Met 


Ser 


Asp 


Ala 






195 










200 








205 








Asn 


Arg 


Glu 


Gl n 


He 


Thr 


Thr 


Tyr 


He 


As n 


Gly 


Leu 


Trp 


Asp 


Lys 


He 




210 










215 










220 








Thr 


Ser 


Asp 


He 


Ala 


Glu 


Ser 


Arg 


Lys 


Thr 


Ala 


Met 


Asp 


Ser 


Val 


Lys 
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22S 230 235 2A0 

1-let Phe Ala Arp Lys Gly Glu Met Phe Gly Leu Ala Glu hys Ala Val 

2Ab 250 265 

Glu I-let Lys Leu Val Asp Glu Leu Ala Tyr Arg The Asp Val Glu Lys 
5 2bO 265 270 

Glu Leu Lys Lys Met Ser Gin Arg Gly Glu Lys Asp Glu Lou Arg Phe 

275 280 285 

Val Ser Leu Ser Gin Val Leu Ala Asn Gly Pro Met Asn Lys Thr Lys 
290 295 300 

10 Gly Ser Arg He Ala Val Leu Phe Ala Glu Gly Glu lie Thr Glu Glu 
305 310 315 320 

He He Lys Lys Pro Phe Asp Thr Asp Gly Ser Ser He Thr Gin Glu 

325 330 335 

Leu Ala Lys Glu He Lys Ala Ala Ala Asp Asp Asp Asp He Lys Ala 
15 340 3-J5 350 

Val Val Leu Arg Val Asn Ser Pro Gly Gly Ser Ala Phe Thr Ser Glu 

355 360 355 

Gin He Trp Lys Gin Val Ala Asp Leu Lys Ala Lys Lys Pro He Val 

375 380 
ZU Val Ser Met Gly Asp Val Ala Ala Ser Gly Gly Tyr Tyr He Ala Cys 
385 390 395 ^00 

Ala Ala Asn Ser He Val Ala Glu His Thr Thr Leu Thr Gly Ser He 

4 05 410 415 

Gly He Phe Gly Met Phe Pro Asn Phe Ala Gly Val Ala Lys Lys He 

4 20 ^25 4 30 

Gly Val Asn Met Asp Val Val Gin Thr Ser Lys Tyr Ala Asp Leu Gly 

435 440 
Asn Thr Phe Ala Pro Met Thr Val Glu Asp Arg Ala Leu He Gin Ara 

4 50 4 55 4 60 

Tyr He Glu Gin Gly Tyr Asp Leu Phe Leu Thr Arg Val Ser Glu Gly 
465 470 475 480 

Arg Asn Arg Thr Lys Ala Gin He Asp Ser He Ala Gin Gly Arg Val 

4 85 4 90 4 95 

Trp Leu Gly Asp Lys Ala Leu Ala Leu Gly Leu Val Asp Glu Leu Gly 

500 505 510 

Gly Leu Asp Thr Ala He Lys Arg Ala Ala Lys Leu Ala Gin Leu Gly 

515 520 525 

Gly Asn Tyr Ser He Glu Tyr Gly Lys Thr Lys Arg Asn Phe Phe Glu 
530 535 540 

4U Glu Leu Leu Ser Ser Ser Ala Ala Asp Met Lys Ser Ala He Leu Ser 
5^5 550 555 560 

Thr He Leu Ser Asp Pro Glu He Glu Val Leu Arg Glu Leu Arg Ser 

565 570 575 

Met Pro Pro Arg Pro Ser Gly He Gin Ala Arg Leu Pro Tyr Tyr Phe 
45 580 585 590 

Met Pro Tyr 
5 95 



30 



35 



50 



55 



(2) INFORMATION FOR SEQ ID IIO:399 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 589 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

60 (vi) ORIGINAL SOURCE: 

{A) ORGANISM: Porphyromonas gingivalis 

(iM) FEATURE: 

(A) NAME/FXY: inisc_f eatur e 
o5 (B) LOCATION 1 . . . 58 9 

(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 3 99 

Met Phe Phe Ala Ser He Leu Gly Val He Thr Ala Gly He He L-u 

70 15 10 15 

Phe Cys He Phe Leu Phe He Phe Phe Gly He Val Ala Gly He Ala 
20 25 30 

Ser Lys Ala Thr Gly Gly Thr He Pro Lys He Glu Ala Asn Ser 11° 
7c 35 40 45 

/O Leu His He Unk Asn Ser Ser Phe Pro Glu He Val Ser Ala Asn Pro 
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50 55 60 

Trp Ser I let Leu Thr Gly Lys Asp Glu Ser Val Ser Leu Ser Gin Ala 

70 75 80 

Val Glu Ala lie Gly Gin Ala Lys Asn Asn Pro Asn He Thr Gly He 
5 85 90 95 

Phe Leu Asp Leu Asp Asn Leu Ser Val Gly Met Ala Ser Ala Glu Glu 

100 105 110 

Leu Arg Arg Ala Leu Gin Asp Phe Lys Met Ser Gly Lys Phe Val Val 
115 120 125 

10 Ser Tyr Ala Asp Arg Tyr Thr Gin Lys Gly Tyr Tyr Leu Ser Ser He 
130 135 140 

Ala Asp Lys Leu Tyr Leu Asn Pro Lys Gly llet Leu Gly Leu He Gly 
I'iS 150 155 160 

He Ala Thr Gin Thr Met Phe Tyr Lys Asp Ala Leu Asp Lys Phe Gly 
15 165 170 175 

Val Lys Ilet Glu He Phe Lys Val Gly Thr Tyr Lys Ala Ala Val Glu 

100 185 190 

Pro Phe Met Leu Asn Arg Met Ser Asp Ala Asn Arg Glu Gin He Thr 
195 200 205 

20 Thr Tyr He Asn Gly Leu Trp Asp Lys He Thr Ser Asp He Ala Glu 
210 215 220 

Ser Arg Lys Thr Ala Met Asp Ser Val Lys Met Phe Ala Asp Lys Gly 
225 230 235 2^0 

Glu Met Phe Gly Leu Ala Glu Lys Ala Val Glu Met Lys Leu Val Asp 
25 245 250 255 

Glu Leu Ala Tyr Arg Thr Asp Val Glu Lys Glu Leu Lys Lys Met Ser 

260 265 270 

Gin Arg Gly Glu Lys Asp Glu Leu Arg Phe Val Ser Leu Ser Gin Val 

275 280 285 

Leu Ala Asn Gly Pro Met Asn Lys Thr Lys Gly Ser Arg He Ala Val 

290 295 300 

Leu Phe Ala Glu Gly Glu He Thr Glu Glu He He Lys Lys Pro Phe 
305 310 315 320 

Asp Thr Asp Gly Ser Ser He Thr Gin Glu Leu Ala Lys Glu He Lys 
35 325 330 335 

Ala Ala Ala Asp Asp Asp Asp He Lys Ala Val Val Leu Arg Val Asn 

340 345 350 

Ser Pro Gly Gly Ser Ala Phe Thr Ser Glu Gin He Trp Lys Gin Val 
355 360 36b 

40 Ala Asp Leu Lys Ala Lys Lys Pro He Val Val Ser Met Gly Asp Val 
370 375 380 

Ala Ala Ser Gly Gly Tyr Tyr He Ala Cys Ala Ala Asn Ser He Val 
385 390 395 400 

Ala Glu His Thr Thr Leu Thr Gly Ser He Gly He Phe Gly Met Phe 
45 405 410 415 

Pro Asn Phe Ala Gly Val Ala Lys Lys He Gly Val Asn Met Asp Val 

^20 425 430 

Val Gin Thr Ser Lys Tyr Ala Asp Leu Gly Asn Thr Phe Ala Pro Met 

435 440 445 

Thr \'al Glu Asp Arg Ala Leu He Gin Arg Tyr He Glu Gin Gly Tyr 

450 455 460 

Asp Leu Phe Leu Thr Arg Val Ser Glu Gly Arg Asn Arg Thr Lys Ala 
465 470 475 480 

Gin He Asp Ser He Ala Gin Gly Arg Val Trp Leu Gly Asp Lys Ala 
55 485 490 495 

Leu Ala Leu Gly Leu Val Asp Glu Leu Gly Gly Leu Asp Thr Ala He 

500 505 510 

Lys Arg Ala Ala Lys Leu Ala Gin Leu Gl^' Gly Asn Tyr Ser He Glu 

515 520 525 

Tyr Gly Lys Thr Lys Arg Asn Phe Phe Glu Glu Leu Leu Ser Ser Ser 

^30 535 540 

Ala Ala Asp Met Lys Ser Ala He Leu Ser Thr He Leu Ser Asp Pro 
^'IS- 550 555 560 

Glu He Glu Val Leu Arg Glu Leu Arg Ser Met Pro Pro Arg Pro Ser 

565 570 575 

Gly He Gin Ala Arg Leu Pro Tyr Tyr Phe Met Pro Tyr 
580 585 



50 



60 



65 



70 



75 



(2) inFORMATlOM FOR SEQ ID MO: 400 

;i) SEQUEUCE CHARACTERISTICS: 

(A)' LEIJGTH: 313 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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wo 99/29870 PCT/AU98/01023 

335/490 



(ii) MOLECULE TiTE: protein 

(iii) HYPOTHETICAL: YES 

5 (vi) ORIGItlAL SOURCE: 

(A) ORGAMISIi: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) IJAllE/KEY: misc_feature 
10 (B) LOCATIOII 1...313 

(xi) SEQUENCE DESCRIPTIOH: SEQ ID MO: 4 00 



15 



65 



Ifet 


Arg 


Ala 


Asn 


He 


Trp 


Gin 


He 


Lou 


Ser 


Val 


Ser 


Val 


Leu 


Phe 


Phe 


1 








5 










10 










15 




Phe 


Gly 


Thr 


Ala 


He 


Gly 


Gin 


Ala 


Gin 


Ser 


Arg 


Asn 


Arg 


Thr 


Tyr 


Glu 








20 










25 










3 0 






Ala 


Tyr 


Val 


Lys 


Gin 


Tyr 


Ala 


Asp 


Glu 


Ala 


He 


Arg 


Gin 


Met 


Ser 


Arg 






35 










4 0 










45 








Tyr 


Asn 


He 


Pro 


Ala 


Ser 


He 


Thr 


He 


Ala 


Gin 


Aia 


Leu 


Val 


Glu 


Thr 














55 










60 










Gly 


Ala 


Gly 


Ala 


Ser 


Thr 


Leu 


Ala 


Ser 


Val 


His 


Asn 


Asn 


His 


Phe 


Gly 


65 










7 0 










75 










0 0 


He 


hys 


C'v'^ 


His 


Lys 


Ser 


Trp 


Thr 


Gly 


Lys 


Arg 


Thr 


Tyr 


Arg 


Thr 


Asp 










85 










90 










95 




Asp 


Ala 


Pro 


Asn 


Glu 


Cys 


Phe 


Arg 




Tyr 


Ser 


Ala 


Ala 


Arg 


Glu 


Ser 








100 










105 










110 






T/i- 


Glu 


Asp 


His 


Ser 


A.rg 


Phe 


Leu 


Leu 


Gin 


Pro 


Arg 


Tyr 


Arg 


Pre 


Leu 






115 










120 










125 








r-he 


Lyi? 


Leu 


Asp 


Arg 


Glu 


Asp 


Tyr 


Arg 


Gly 


Trp 


Ala 


Thr 


Gly 


Leu 


Gin 




130 










135 










1^0 










f\ia 




Gly 


Tyr 


Ala 


Thr 


Asn 


Arg 


Gly 


Tyr 


7\la 


Asn 


Leu 


Leu 


He 


Lys 












150 










155 










160 


I-Iet 


Val 


Glu 


Leu 


T:i*r 


Glu 


Leu 


Tyr 


Ala 


Leu 


Asp 


Arg 


Glu 


Lys 


Tyr 


Pro 










165 










170 










175 




Ser 


Trp 


Phe 


His 


Lys 


Ser 


Tyr 


Pro 


Gly 


Ser 


Asn 


Lys 


Lys 


Ser 


His 


Gin 








180 










185 










190 






Thr 


Thr 


Ly^ 


Gin 


L;,'s 


Gin 


Ser 


G 1 y 


Leu 


Lys 


His 


Glu 


Ala 


Tyr 


Phe 


Ser 






195 










200 










205 








Tyr 


Gly 


Leu 


Leu 


Tyr 


He 


He 


Ala 


Lys 


Gin 


Gly 


Asp 


Thr 


Phe 


Asp 


Ser 




210 










215 










220 










Leu 


Ala 


Glu 


Glu 


Phe 


Asp 


Met 


Arg 


Ala 


Ser 


Lys 


Leu 


Ala 


Lys 


Tyr 


Asn 


225 










230 










235 










2-10 


Asp 


Ala 


Pro 


Val 


Asp 


Phe 


Pro 


He 


Glu 


Lys 


Gly 


Asp 


Val 


He 


Tyr 


Leu 










245 










250 










255 




Gl u 


Ly^ 


hys 


His 


Ala 


Cys 


Ser 


He 


Ser 


Lys 


His 


Thr 


Gin 


His 


Val 


Val 








260 










265 










270 






Alg 


Val 


Gly 


Asp 


Ser 


Met 


His 


Ser 


He 


Ser 


Gin 


Arg 


Tyr 


Gly 


He 


Arg 






275 










280 










285 








Met 


Lys 


Asn 


Leu 


Tyr 


Lys 


Leu 


Asn 


Asp 


Lys 


Asp 


Gly 


Glu 


Tyr 


He 


Pro 




2 90 










295 










300 










Gl n 


Glu 


Gly 


Asp 


He 


Leu 


Arg 


Leu 


Arg 
















305 










310 























20 



25 



30 



35 



40 



45 



50 



55 (2 1 IHFORMATIOH FOR SEQ ID MO:^101 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 

(iil MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



( V 1 ) OR I G II I AL SOU RCE : 

(A) ORGANISM: Porphyromonas gingiva lis 



(ix) FEATURE: 
70 (A) NAME/ KEY: misc_feature 

(B) LOCATION 1 . . .523 



75 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO: 4 01 
Met Asp Gly Arg Arg Tyr Ser Asp Gly Leu His Gin Ala He Glu Aia 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 




75 
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336/490 



1 5 10 15 

Lys Giu His Val Ly^ Val Glu Ala Ala Thi' Gin Thr Phe Ala Thr lie 

1*0 25 30 

Thr Leu Gin Asn Tyr Phe Arq Met Tyr His Lys Leu Ala Gly Met Thr 
5 35 40 45 

Gly Thr Ala Glu Thr Glu Ala Gly Glu Leu Trp Asp lie Tyr Lys Leu 

50 55 60 

Asp Val Val Val He Pro Thr Asn Lys Pro lie Ala Arg Lys Asp Met 
65 10 75 80 

10 A^n Asp Arg He Tyr Lys Thr Ala Arg Glu Lys Tyr Ala Ala Val He 

85 90 95 

Glu Glu He Val Arg Leu Val Glu Glu Gly Arg Pro Val Leu Val Gly 

100 105 110 

Thr Thr Ser Val Glu He Ser Glu Leu Leu Ser Arg Met Leu Arg Leu 
15 115 120 125 

Arg Gly He Gin His Asn Val Leu Asn Ala Lys Leu His Gin Lys Glu 

130 135 140 

Ala Glu He Val Ala Gin Ala Gly Gin Lys Gly Thr Val Thr He Ala 
145 150 155 160 

20 Thr Asn Met Ala Gly Arg Gly Thr Asp He Lys Leu Ser Ala Glu Val 

165 170 17 5 

Lys Ly- Ala Gly Gly Leu Ala He He Gly Thr Glu Arg His Glu Ser 

100 185 190 

Arg Arg Val Asp Arg Gin Leu Arg Gly Arg Ser Gly Arg Gin Gly Asp 
25 ' " 1 95 200 205 

Pro Gly Ser Ser He Phe T'yr Val Ser Leu Glu Asp His Leu Met Ara 

210 215 220 

Leu Phe Ala Thr Glu Lys He Ala Ser Leu Met Asp Arg Leu Gly Phe 
225 230 235 ^ 240 

30 Lys Glu Gly Glu Val Leu Glu Asn Asn Met Leu Ser Lys Ser Val Glu 

245 250 255 

Arg Ala Gin Lys Lys Val Glu Glu Asn Asn Phe Gly He Arg Lys His 

260 265 270 

Leu Leu Glu Tyr Asp Asp Val Met Asn Ser Gin Arg Glu Val He Tyr 
35 275 280 285 

Thr Arg Arg Arg His Ala Leu >let Gly Glu Arq He Gly Met Asp Val 

290 295 300 

Leu Asn Thr He Tyr Asp Val Cys Lys Ala Leu He Asp Asn Tyr Ala 
305 310 315 320 

40 Glu Ala Asn Asp Phe Glu Gly Phe Lys Glu Asp Leu Met Arg Ala Leu 

325 330 335 

Ala He Glu Ser Pro He Thr Gin Glu He Phe Arg Gly Lys Lys Ala 

340 345 350 

Glu Glu Leu Thr Asp Met Leu Phe Asp Glu Ala Tyr Lys Ser Phe Gin 
45 355 360 365 

Arg Lys Met Asp Leu He Ala Glu Val Ala His Pro Val Val His Gin 

370 375 380 

Val Phe Glu Thr Gin Ala Ala Val Tyr Glu Arg He Leu He Pro He 
385 390 395 400 

50 Thr Asp Gly Lys Arg Val Tyr Asn He Gly Cys Asn Leu Arg Glu Ala 

4 05 410 ^415 

Asp Glu Thr Gin Gly Lys Ser He He Lys Glu Phe Glu Lys Ala He 

420 425 430 

Val Leu His Thr He Asp Glu Ser Trp Lys Glu His Leu Arg Glu Het 
55 435 440 445 

Asp Glu Leu Arg Asn Ser Val Gin Asn Ala Ser Tyr Glu Asn Lys Asp 

450 455 460 

Pro Leu Leu He Tyr Lys Leu Glu Ser Tyr Glu Leu Phe Arg Lys Met 
465 470 475 430 

60 Val Glu Ala Met Asn Arg Lys Thr Val Ala lie Leu Met Arg Ala Arg 

4 85 4 90 " 4 95 

He Pro Val Pro Glu Ala Pro Ser Gin Glu Glu Leu Glu His Arg Arg 

500 505 510 

Gin He Glu He Arg His Ala Thr Gin Gin Ara 
65 515 520 

(2) IHFORMATIOII FOR SBQ ID NO: 402 

(i) SEQUENCE CHARACTERISTICS: 
70 (A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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Uii) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival! s 

5 

(i::) E'EATURE: 

(A) IJAJ'IE/K'EY: misc_featur- 
{B) LOCATION 1 . . .37 5 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:402 



15 



Met A^-ri Phe Leu Lys Lys Glu Pro Phe Ly^ He Phe Ser Met He Tyr 
IS 10 15 

Leu Leu Leu Asp Thr He Thr Asn Arg Ala Gly Thr Glu Arg Ala Val 

20 25 30 

He Asn Leu Ala Asn Asn Leu His Ala Asn Gly His Arg Val Ser Leu 

35 ii 0 4 5 

Val Ser Val Cys Thr Lys Glu Gly Glu Pro Ser Phe Gin Val Glu Lys 
50 55 GO 

20 Gly He Glu Val His His Leu Gly He Arg Leu Tyr Gly Asn Ala Leu 
65 7 0 7 5 80 

Ala Arg Lys Thr Val Tyr Phie Lys Ala Tyr Arg Arg He Lys Ala Leu 

85 90 95 

Tyr Lys Lys Arg Glu Pro Val Leu Leu He Gly Thr Asn He Phe He 
^5 100 105 110 

A.^n Thr lie Leu Ser Gin He Ser Asn Arg Gly Arg He Phe Thr He 

115 12 0 125 

Gly Cys Glu His He Ser Tyr Asp He Ala Ara Pro He Thr Lys Arg 

130 135 140 

He Arg Gly Phe Leu Tyr Ser Gly Leu Asp Ala Val Val Ala Leu Thr 
1^5 150 155 160 

Lys Arg Asp Gin Gin Ser Phe Glu Ala He Leu Arg Gly Arg Ser Lys 

lC-5 170 175 

Ala Tyr Val He Pro Asn Gin Val Ser Phe Thr Thr Val Gin Ara Asp 

180 185 190 

Ala Thr Thr His Lys Gin Met Leu Ala He Gly Ara Leu Thr Tyr Gin 

195 200 205 

Lys Gly Phe Glu Phe Met He Glu Asp Ala Ser Arg Val Leu Arg Glu 

210 215 220 

Arg Pro Asp Trp Lys Leu He He Val Gly Asp Gly Glu Asn Glu Ser 
225 230 235 240 

Met Leu Arg Lys Glu He Ala Ser Arg Asn Met Glu Ser Gin He Glu 

2AS 250 255 

lie His Pro Ser Thr Pro Glu He Arg Lys Tyr Tyr Glu Ser Ser Ala 
45 260 265 270 

He Tyr Leu Met Thr Ser Arg Phe Glu Gly Leu Pro Met Val Leu Leu 

2'75 280 285 

Glu Ala Glu Ala Tyr Ala Leu Pro He He Ser Tyr Asp Cys Pro Thr 
„ 2?0 295 300 

OU Gl-y Pro Arg Glu Leu He Glu Asn Gly Arg Asn Gly Phe Leu Val Pro 
3''^^ 310 315 320 

Met Glu Ala His Glu Asp Phe Ala Asp Lys Leu Arg Leu Leu Met Asp 

325 330 335 

Asp Glu Thr Leu Arg Lys Lys Met Gly Gin Glu Ser Glu Leu Met Val 
^5 340 3^5 350 

Lys Ser Tyr Ser Pro Ala Asn He Tyr Glu Cys Trp Lys Lys Leu Phe 

355 360 365 

Val Glu He Gly Tyr Met Asn 
370 375 



30 



35 



40 



60 



65 



(2) INFORMATION FOR SEQ ID JJO:4 03 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

70 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAl-JISM: Forphyromonas gingival is 

75 (i>:} FEATURE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



338/490 



70 



(A) NAtlE/KEV: misc_featufe 
{ B) LOCATIOn I . . . 362 

(xi} StlQUEIICE DESCRIPTION: SEQ ID 110:403 



Met 


He 


Tyr 


Leu 


Leu 


Leu 


Asp 


Thr 


He 


Thr 


Asn 


Arg 


Ala 


Gly 


Thr 


Glu 


1 








5 










10 










15 




Arg 


Ala 


Val 


He 


Asn 


Leu 


Ala 


Asn 


Asn 


Leu 


His 


Ala 


Asn 


Gly 


His 


Arg 








20 










25 










30 






Val 


Ser 


Leu 


Val 


Ser 


Val 


Cys 


Thr 


Lys 


Glu 


Gly 


Glu 


Pro 


Ser 


Phe 


Gin 






35 










40 










■15 








Val 


Glu 


Ly^ 


Gly 


He 


Glu 


Val 


His 


His 


Leu 


Gly 


He 


Arg 


Leu 


Tyr 


Gly 




50 










55 










60 










A^n 


Ala 


Leu 


Ala 


Arg 


Lys 


Thr 


Val 


Tyr 


Phe 


Lys 


Ala 


Tyr 


Arg 


Arg 


He 


65 










7 0 










75 










80 


hys 


Ala 


Leu 


Tyr 


Lys 


Lys 


Arg 


Glu 


Pro 


Val 


Leu 


Leu 


He 


Gly 


Thr 


Asn 








85 










90 










95 




He 


Phe 


He 


Asn 


Thr 


He 


Leu 


Ser 


Gin 


He 


Ser 


Asn 


Arg 


Gly 


Arg 


He 








100 










105 










110 






Phe 


Thr 


He 


Gl y 


cys 


Glu 


His 


He 


Ser 


Tyr 


Asp 


He 


Ala 


Arg 


Pro 


He 






115 










120 










125 








Thr 


Ly.-? 


Arg 


He 


Arg 


Gly 


Phe 


Leu 


Tyr 


Ser 


Gly 


Leu 


A^p 


Ala 


Val 


Va 1 




130 










135 










110 










Ala 


Leu 


Thr 


Lys 


Arg 


Asp 


Gin 


Gin 


Ser 


Phe 


Glu 


Ala 


He 


Leu 


Arg 


Gly 


14b 










150 










155 










160 


Arg 


Ser 


Lys 


Ala 


Tyr 


Val 


He 


Pro 


Asn 


Gin 


Val 


Ser 


Phe 


Thr 


Thr 


Val 










165 










1 7 0 










175 




Gin 


Arg 


Af p 


Ala 


Thr 


Thr 


His 


Lys 


Gin 


Met 


Leu 


Ala 


He 


'31 y 


Arg 


Leu 








180 










185 










190 






Thr 


Tyr 


Gin 


Lys 


Gly 


Phe 


Glu 


Phe 


Met 


He 


Glu 


Asp 


Ala 


Ser 


Arg 


Val 






195 










200 










205 








Leu 


Arg 


Glu 


Arg 


Pro 


Asp 


Trp 


hys 


Leu 


He 


He 


Val 


Gly 


Asp 


Gly 


Glu 




210 










215 










220 










Asn 


GJ u 


Ser 


Met 


Leu 


Arg 


Lys 


Glu 


He 


Ala 


Ser 


Arg 


Asn 


Met 


Glu 


Ser 


225 










230 










235 










240 


Gin 


He 


Glu 


He 


His 


Pro 


Ser 


Thr 


Pro 


Glu 


He 


Arg 


Lys 


Tyr 


Tyr 


Glu 










245 










2 50 










255 




Ser 


Ser 


Ala 


He 


Tyr 


Leu 


I-fet 


Thr 


Ser 


Arg 


Phe 


Glu 


Gly 


Leu 


Pro 


Met 








260 










265 










27 0 






Val 


Leu 


Leu 


Glu 


Ala 


Glu 


Ala 


Tyr 


Ala 


Leu 


Pro 


He 


He 


Ser 


Tyr 


Asp 






275 










280 










285 








Cys 


Pro 


Thr 


Gly 


Pro 


Arg 


Glu 


Leu 


He 


Glu 


Asn 


Gly 


Arg 


Asn 


Gly 


Phe 




290 










295 










300 










Leu 


Val 


Pro 


Met 


Glu 


Ala 


His 


Glu 


Asp 


Phe 


Ala 


Asp 


Lys 


Leu 


Arg 


Leu 


305 










310 










315 










320 


Leu 


Met 


Asp 


Asp 


Glu 


Thr 


Leu 


Arg 


Lys 


Lys 


Met 


Gly 


Gin 


Glu 


Ser 


Glu 










325 










330 










335 




Leu 


Met 


Val 


Ly^ 


Ser 


Tyr 


Ser 


Pro 


Ala 


Asn 


He 


Tyr 


Glu 


cys 


Trp 


Lys 








340 










345 










350 






Lys 


Leu 


Phe 


Val 


Glu 


He 


Gly 


Tyr 


Met 


AS!i 


















355 










360 



















10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) IHFORMATlOrJ FOR SEQ ID HO:4Q4 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 
65 (A) ORGAIIISM: Porphyromonas gingival is 

(ir.) FEATURE: 

(A) IIAi IE/KEY: !ni sc_f ea t ure 

(B) LOCATION 1 , , . 64 0 



(xi) SEOUEHCE DESCRI PTIOII : SEQ ID 110:404 



Met Gly Lys He He Gly He Asp Leu Gly Thr Thr Asn Ser Cys Val 
15 10 15 

75 Ser Val Leu Glu Gly Asn Glu Pro He Val He Thr Asn Ser Glu Gly 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID; <WO 9929870A1_t_> 




wo 99/29870 PCT/AU98/01 023 

339/490 

20 25 30 

hys Arg Thr Thr Pro Ssr Val Val Ala Phe Val Asp Gly Gly Glu Arg 

35 ^0 45 

hys Val Gly Asp Pro Als Lys Arg Gin Ala lie Thr Asn Pro Thr Lys 
5 50 55 60 

Thr lie Tyr Ser lie Lys Arg Phe Met Gly Glu Thr Tyr Asp Gin Val 
65 70 75 00 

Ser Arg Glu Val Glu Arg Val Pro Phe Lys Val Val Arg Gly Asp Asn 
85 90 95 

10 Asn Thr Pro Arg Val Asp lie Asp Gly Arg Leu Tyr Thr Pro Gin Glu 
100 105 ' 110 

He Ser Ala I let He Leu Gin Lys Met Lys Lys Thr Ala Glu Asp Tyr 

115 120 125 

Leu Gly Gin Glu Val Thr Glu Ala Val He Thr Val Pro Ala Tyr Phe 
15 130 135 1^0 

Asn Asp Ala Gin Arg Gin Ala Thr Lys Glu Ala Gly Glu He Ala Gly 
1^5 150 155 160 

Leu Lys Val Arg Arg He Val Asn Glu Pro Thr Ala Ala Ser Leu Ala 
165 170 175 

20 Tyr Gly Leu Asp Lys Ser Asn Lys Asp Met Ly^ He Ala Val Phe Asp 
180 185 190 

Leu Gly Gly Gly Thr Phe Asp He Ser He Leu Glu Leu Gly Asp Gly 

195 200 205 

Val Phe Glu Val Lys Ser Thr Asn Gly Asp Thr His Leu Gly Gly Asp 
25 210 215 220 

Asp Phe Asp His Val He He Asp Trp Leu Ala Glu Glu Phe Lys Ser 
225 230 235 240 

Gin Glu Gly Val Asp Leu Arg Gin Asp Pro Met Ala Met Gin Ara Leu 
2*^5 250 255 

30 Lys Glu Ala Ala Glu Lys Ala Lys He Glu Leu Ser Ser Thr Ser Ser 
260 265 270 

Thr Glu He Asn Leu Pro Tyr He Met Pro Val Asn Gly He Pro Lys 

275 280 285 

His Leu Val Met Thr Leu Thr Arg Ala Lys Phe Glu Gin Leu Ala Asp 
35 290 295 300 

Arg Leu He Gin Ala Cys Val Ala Pro Cys Glu Thr Ala Leu Lys Asp 
305 310 315 320 

Ala Gly Met Ser Arg Gly Asp He Asp Glu Val He Leu Val Gly Gly 

325 330 335 

Ser Thr Arg He Pro Ala He Gin Glu He Val Glu Lys He Phe Gly 

340 345 350 

Lys Ala Pro Ser Lys Gly Val Asn Pro Asp Glu Val Val Ala Val Gly 

355 360 365 

Ala Ala He Gin Gly Gly Val Leu Thr Gly Glu Val Lys Asp Val Leu 
45 370 375 380 

Leu Leu Asp Val Thr Pro Leu Ser Leu Gly He Glu Thr Met Gly Gly 
385 390 395 400 

Val Met Thr Arg Leu He Asp Ala Asn Thr Thr He Pro Thr Lys Lys 
405 410 415 

5U Ser Glu He Phe Thr Thr Ala Val Asp Asn Gin Pro Ser Val Glu He 
•^2 0 4 25 4 30 

His Val Leu Gin Gly Glu Arg Ser Leu Ala Lys Asp Asn Lys Ser He 

4 35 4 40 445 

Gly Arg Phe Asn Leu Asp Gly He Ala Pro Ala Pro Arg Gin Thr Pro 
55 450 455 460 

Gin He Glu Val Thr Phe Asp He As-p Ala Asn Gly He Leu Asn Val 
470 475 480 

Thr Ala His Asp Lys Ala Thr Gly Lys Lys Gin Asn He Arg He Glu 
485 490 495 

Oil Ala Ser Ser Gly Leu Ser Asp Asp Glu He Lys Arg Met Lys Glu Glu 
500 505 510 

Ala Gin Ala Asn Ala Glu Ala Asp Lys Lys Glu Lys Glu Arg He Asp 

515 520 525 

Lys He Asn Gin Ala Asp Ser Met He Phe Gin Thr Glu Lys Gin Leu 
o5 530 535 540 

Lys Glu Leu Gly Asp Lys Phe Pro Ala Asp Lys Lys Ala Pro He Asp 
545 550 555 560 

Thr Ala Leu Asp Lys Leu Lys Glu Ala His Lys Ala Gin Asp Val Ala 

^^^^ ^"^^^ S"^- 

/U Ala He Asp Thr Ala Met Ala Glu Leu Gin Thr Ala Leu Ser Ala Ala 
580 535 590 

Gly Glu Glu Leu Tyr Lys Asn Ala Gly Ala Ala Gin Gly Gly Ala Gin 

595 600 605 

Pro Gly Pro Asp Phe Gly Gly Ala Gin Gly Pro Ser Ala Gly Asp Gin 
/O 610 615 620 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



40 



BNSDOCID; <WO 9929870A1_L> 




10 



wo 99/29870 PCT/AU98/01023 

340/490 

Pro Ser Asp Asp Lys Asn Val Thr Asp Val Asp Phe Glu Glu Val Lys 
630 635 640 

(2) 11]F0RI!AT10(J FOR SEQ ID HO: 4 05 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LEMGTM: 44? amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



15 (vi) ORIGINAL SOURCE: 

(A) ORGAIJISH; Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAJIE/KEY: misc_feature 
20 (B) LOCATIOIJ 1...44 9 

(xi) SEQUEMCE DESCRI PTIOI I : SEQ ID HO: 4 05 

Met Arg Tyr Asp Leu Ala lie lie Gly Gly Gly Pro Ala Gly Tyr Thr 
25 1 5 10 15 

Ala Ala Glu Arg Ala Ala Lys Gly Gly Leu Lys Thr Leu Leu II- Glu 

20 25 30 

Ly- Asn Ala Leu Gly Gly Val Cys Leu Asn Glu Gly Cys lie Pro Thr 

35 4 0 45 

Lys Thr Leu Leu Tyr Ser Ala Lys Val Leu His Gin He Ala Thr Ala 

SO 55 60 

Ser Lys Tyr Ala Val Ser Gly Thr Ala Asp Gly Leu Asp Leu Gly Lys 

*70 75 80 

Val He Ala Arg Lys Gly Lys He He Arg Lys Leu Thr Ala Gly He 

85 90 95 

Arg Ser Arg Leu Thr Glu Ala Gly Val Glu I!et Val Thr Ala Glu Ala 

100 105 110 

Thr Val Thr Gly Cys Asp Ala Asp Gly He He Gly He Thr Ala Gly 

115 120 125 

Glu Ala Gin Tyr Lys Ala Ala Asn Leu Leu Leu Cys Thr Gly Ser Glu 

130 135 140 

Thr Phe He Pro Pro He Pro Gly Val Glu Gin Thr Glu Tyr Trp Thr 
150 155 160 

Asn Arg Glu Ala Leu Gin Asn Lys Glu He Pro Thr Ser Leu Val He 

165 170 175 

He Gly Gly Gly Val He Gly Ilet Glu Phe Ala Ser Phe Phe Asn Gly 

IBO 185 190 

He Gly Thr Gin Val His Val Val Glu J let Leu Pro Glu He Leu Asn 

l-'S 2 00 205 

Gly He Asp Pro Glu His Ala Ala Met Leu Arg Ala His Tyr Glu Ly^ 

210 215 22 0 

Glu Gly He Lys Phe Tyr Leu Gly His Lys Val Thr Ser Val Arg Asn 
225 230 235 240 

Gly Ala Val Thr Val Glu Tyr Glu Gly Glu Ser Lys Glu He Glu Gly 

245 250 255 

Glu Arg He Leu Ilet Ser Val Gly Arg Arg Pro Val Leu Gin Gly Ph- 

260 265 270 

Glu Ser Leu Gly Leu Val Leu Ala Gly Lys Gly Val Lys Thr Asn Glu 

2'75 280 285 

Arg Met Gin Thr Ser Leu Pro Asn Val Tyr Ala Ala Gly Asp He Thr 

290 295 300 

Gly Phe Ser Leu Leu Ala His Thr Ala Val Arg Glu Ala Glu Val Ala 
305 310 315 320 

Val Asp Gin He Leu Gly Lys Thr Asp Glu Thr Met Ser Tyr Arq Ala 
, , 325 330 335 

Val Fro Gly Val Val Tyr Thr Asn Pro Glu Val Ala Gly Val Gly Glu 

340 345 350 

Thr Glu Glu Ser Leu Arg Lys Ala Gly Arg Ala Tyr Thr Val Arq Arc 

355 360 365 

Leu Pro Met Ala Phe Ser Gly Arg Phe Val Ala Glu Asn Glu Gin Glv 

370 375 380 

Asn Gly Glu Cys Lys Leu Leu Leu Asp Glu Glu Asn Arg Leu 11*=^ Gl" 
3*^^ 390 395 

Ala His Leu He Gly Asn Pro Ala Gly Glu Leu He Val Thr Ala Ala 
405 410 415 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



BNSDOCID: <WO 9929870A1_I_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



341/490 



llet Ala He GIu Thr Gly Met Thr Asp Arg Gin He Glu Arg He He 

4 2 0 4 25 ']30 

Phe Pro His Pro Thr Val Gly Glu He Leu Lys Glu Thr Leu Ala Gly 
435 440 44b 

Gly 



(2) IIIFOPJIATIOI! FOR SEQ ID HO: 4 06 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LEIIGTH: 941 amino acid^ 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 

(A) ORGAIJISM: Po rphyromonas gingiva lis 

(ix) FEATURE: 

(A) MAIIE/KEY: misc feature 

(E) LOCATIOII 1 . . . 94^1 



(Mi) SEQUEIICE DESCRIPTIOIl : SEQ ID 110:406 



I let 


Gl u 


Leu 


Lys 


Arg 


Phe 


Leu 


Ser 


Leu 


'3ly 


Leu 


Leu 


Leu 


Val 


Gly 


Phe 


1 








5 










10 










15 




He 


Pro 


Met 


Lys 


Leu 


Ser 


Ala 


Gin 


Gin 


Ala 


Gin 


Pro 


Leu 


Pro 


Thr 


Asp 








20 










25 










30 






Pro 


Ala 


Val 


Arg 


Val 


Gly 


Lys 


Leu 


Asp 


Asn 


Gly 


Leu 


Thr 


Tyr 


Phe 


He 






35 










40 








45 






Arg 


His 


Asn 


Glu 


Asn 


Pro 


Lys 


Asp 


Arg 


Ala 


Asp 


Phe 


Phe 


He 


Ala 


Gin 




50 










55 










60 










Lys 


Val 


Gly 


Ser 


He 


Leu 


Glu 


Glu 


Asp 


Ser 


Gin 


Ser 


Gly 


Leu 


Ala 


His 


65 










70 










75 








80 


Phe 


Leu 


Glu 


His 


I let 


Ala 


Phe 


Asn 


Gly 


Thr 


Lys 


Asn 


Phe 


Pro 


Gly 


Lys 










85 










90 










95 




As n 


Leu 


He 


Asn 


Tyr 


Leu 


Glu 


Thr 


He 


Gly 


Val 


Arg 


Phe 


Gly 


Gin 


Asn 








100 










105 










110 






Leu 


Asn 


Ala 


Ser 


Thr 


Gly 


Phe 


Asp 


Lys 


Thr 


Glu 


Tyr 


Thr 


He 


Met 


Asp 






115 










120 










125 






Val 


Pro 


Thr 


Thr 


Arg 


Gin 


Gly 


He 


He 


Asp 


Ser 


Cys 


Leu 


Leu 


lie 


Leu 




130 










135 










140 










flis 


Asp 


Trp 


Ser 


Asn 


Asn 


He 


Thr 


Leu 


Asp 


Gly 


His 


Glu 


He 


Asp 


Glu 


145 










150 










155 










160 


Glu 


Arg 


Gly 


Val 


He 


Gin 


Glu 


Glu 


Trp 


Arg 


Ala 


Arg 


Arg 


Asp 


Ala 


Asn 










165 










170 










175 




Leu 


Arg 


Met 


Phe 


Giu 


Ala 


He 


Leu 


Ala 


Lys 


Ala 


Met 


Pro 


Gly Asn 


Lys 








ItJO 










185 










190 






Tyr 


Ala 


Glu 


Arg 


Met 


Pro 


He 


Gly 


Leu 


Met 


Asp 


Val 


Val 


Leu 


Asn 


Phe 






195 










200 










205 








Lys 


His 


Asp 


Gl u 


Leu 


Arg 


Asn 


Tyr 


Tyr 


Lys 


Lys 


Trp 


Tyr 


Arg 


Pro 


Asp 




210 










215 










220 










Leu 


Gin 


Gly 


Leu 


Val 


lie 


Val 


Gly 


Asp 


He 


Asp 


Val 


Asp 


Tyr 


Val 


Glu 


225 










230 








235 






240 


Acn 


Lys 


He 


Lys 


Glu 


Leu 


Phe 


Lys 


Asp 


Val 


Pro 


Ala 


Pro 


Val 


Asn 


Pro 










245 










2 50 










255 




Ala 


Glu 


Arg 


He 


Tyr 


Thr 


Pro 


Val 


Glu 


Asp 


Asn 


Asp 


Glu 


Pro 


He 


Val 








2 60 










265 








27 0 






Ala 


He 


Ala 


Thr 


Asp 


Ala 


Glu 


Ala 


Thr 


Thr 


Thr 


Gin 


Leu 


Ser 


He 


Ser 






275 










280 










285 








Phe 


Lys 


Ser 


Asp 


Pro 


Thr 


Pro 


Gin 


Glu 


Val 


Arg 


Gly 


Ser 


He 


Phe 


Gly 




2 90 










295 










300 










Leu 


Val 


Glu 


Asp 


Tyr 


Met 




Gin 


Val 


He 


Thr 


Thr 


Ala 


Val 


Asn 


Glu 


305 










310 










315 










320 


Arg 


Leu 


Ser 


Gl u 


He 


Thr 


His 


Lys 


Pro 


Asn 


Ala 


Pro 


Phe 


Leu 


Ser 


Ala 










325 










330 










335 




Gly 


Ala 


Phe 


Phe 


Ser 


Asn 


Phe 


Met 


Tyr 


He 


Thr 


Gin 


Thr 


Lys 


Asp 


Ala 








340 










345 










350 




Phe 


Asn 


Phe 


Val 


Ala 


Thr 


Val 


Arg 


Glu 


Gly 


Glu 


Ala 


Glu 


Lys 


Ala 


Met 






355 










360 










365 






Asn 


Ala 


Leu 


Val 


Ala 


Glu 


He 


Glu 


Ser 


Leu 


Arg 


Gin 


Phe 


Gly 


He 


Th r 




37 0 










37 5 








360 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 




wo 99/29870 PCT/AU98/01023 

342/ 490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 





Gly 


Glu 


Tyr 


Asp 


Arg 


Ala 


Arg 


Thr 


Asn 


Val 


Leu 


Lys 


Arg 


Tyr 


Glu 


38S 










390 










395 










4 00 


A5n 


Gin 


Tyr 


Aen 


Glu 


Arg 


Asp 


Lys 


Arg 


Lys 


Asn 


Asn 


Ala 


Tyr 


Ala 


Asn 










4 05 










410 










415 




Glu 


Tyr 


Ser 


Thr 


Tyr 


Phe 


Thr 


Asp 


Gly 


Gly 


Tyr 


He 


Pro 


Gly 


He 


Glu 








4 20 










425 










430 






Val 


Glu 


Tyr 


Gin 


Thr 


Val 


As n 


Ala 


Phe 


Ala 


Pro 


Gin 


Val 


Pro 


Leu 


Glu 
















4 40 










445 








Ala 


Fhe 


ASTl 


Gin 


Ala 


He 


Ala 


Gin 


Met 


He 


Asp 


Pro 


Val 


Lys 


Asn 


Ala 




'150 










455 










460 










Val 


Val 


Thr 


Leu 


Thr 


Gly 


Pro 


Ser 


Lys 


Ala 


Glu 


Ala 


Lys 


He 


Pro 


Ser 


465 










4 70 










475 










4 80 


Glu 


Ala 


Asp 


Phe 


Leu 


Ala 


Ala 


Phe 


Lys 


Ala 


Ala 


Arg 


Gin 


Gin 


Lys 


Val 










4 85 










4 90 










4 95 




Glu 


Ala 


Lys 


Lys 


Asp 


Glu 


Val 


Ser 


Asp 


Gin 


Lys 


Leu 


Met 


Glu 


Lys 


Ala 








500 










505 










510 






Pro 


Lys 


Ala 


Gly 


Lys 


He 


Val 


Ser 


Glu 


Ly^ 


Lys 


Asp 


Gin 


Lys 


Phe 


Gly 






515 










520 










525 








Thr 


Thr 


Glu 


Leu 


Thr 


Leu 


Ser 


Asn 


Gly 


He 


Lys 


Val 


Tyr 


Leu 


Lys 


Lys 




530 










535 










540 










Thr 


Asp 


Phe 


Lys 


Ser 


Asn 


Glu 


He 


Leu 


Met 


Ser 


Ala 


Leu 


Ser 


Pro 


Gly 












550 










555 










560 


Gly 


He 


Leu 


Ser 


Gly 


Lys 


Hi- 


Ala 


Pro 


Asn 


Gin 


Ser 


Val 


Met 


Asn 


Ser 










565 










570 










575 




Phe 


net 


Asn 


Val 


Gly 


'31 y 


Leu 


Gly 


Asn 


Phe 


Asp 


Ala 


He 


Gin 


Leu 


Asp 








50 0 










585 










590 






Lys 


Val 


Leu 


Thr 


Gly 


Arg 


Ser 


Ala 


Ser 


Val 


Ser 


Fro 


Ser 


Leu 


Ser 


Leu 






5 95 










600 










605 








Leu 


Ser 


Glu 


Gly 


Leu 


Ser 


Gly 


Lys 


Thr 


Thr 


Val 


Glu 


Asp 


Met 


Glu 


Thr 




610 










615 










620 










Ph- 


Phe 


Gin 


Leu 


He 


Tyr 


Leu 


Gin 


Met 


Thr 


Ala 


Asn 


Arg 


Lys 


Asp 


Pro 


62b 










630 










635 










64 0 


Glu 


Ala 


Phe 


Lys 


Ala 


Thr 


Gin 


Glu 


Lys 


Leu 


Tyr 


Asn 


Asn 


Leu 


Lys 


Asn 










645 










650 










655 




Gin 


Glu 


Ala 


Asn 


Pro 


Met 


Ala 


Ala 


Leu 


Met 


Asp 


Ser 


He 


Arg 


His 


Thr 








660 










665 










670 






Met 


Tyr 


Gly 


Asp 


Asn 


Pro 


Met 


Met 


Lys 


Pro 


Met 


Lys 


Ala 


Ala 


Asp 


Val 






675 










680 










685 








Glu 


Lys 


Val 


Asn 


Tyr 


Asp 


Gin 


Val 


Met 


Ala 


Phe 


Tyr 


Asn 


Glu 


Arg 


Phe 




690 










695 










700 










Ala 


hsp 


Ala 


Gly 


Asp 


Phe 


Met 


Phe 


Phe 


Phe 


He 


Gly 


Asn 


Leu 


Asp 


Glu 


705 










710 










715 










720 


Ala 


Lys 


1 let 


Lys 


Pro 


Leu 


He 


Glu 


Thr 


Tyr 


Le\j 


Ala 


Ser 


Leu 


Pro 


Asn 










725 










730 










735 




Leu 


Ly- 


Arg 


Gly 


Asp 


Lys 


Met 


Asn 


Lys 


Ala 


Gin 


Val 


Pro 


Ala 


Ala 


Arg 








740 










745 










750 






Ser 


Gly 


Lys 


lie 


Asp 


Cys 


Lys 


Phe 


Glu 


Lys 


Glu 


Met 


Asp 


Thr 


Pro 


Ser 






755 










760 










765 








Thr 


Thr 


lie 


Phe 


Asp 


Val 


Val 


Ser 


Gly 


Asn 


Val 


Glu 


Tyr 


Thr 


Leu 


Lys 




770 










775 










7 80 










A^n 


Ser 


Leu 


Leu 


Leu 


Glu 


Val 


Phe 


Ser 


Ala 


Val 


Met 


Asp 


Gin 


Val 


Tyr 












790 










795 










800 


Thr 


Ala 


Ttir 


Val 


Arg 


Glu 


Lys 


Glu 


Gly 


Gly 


Ala 


Tyr 


Ser 


Val 


Ala 


Ala 










805 










810 










815 




Phe 


Gly 


Gly 


Leu 


Glu 


Gin 


Tyr 


Pro 


Gin 


Pro 


Lys 


Ala 


Leu 


Met 


Gin 


He 








820 










825 










830 






Tyr 


Phe 


Fro 


Thr 


Asp 


Pro 


Ala 


Arg 


Ala 


Glu 


Glu 


Met 


Asn 


Ala 


He 


Val 






835 










840 










845 








The 


Ala 


Glu 


Leu 


Glu 


Lys 


Leu 


Ala 


Lys 


Glu 


Gly 


Pro 


Asn 


Val 


Glu 


Tyr 




850 










855 










860 












Ly- 


Lys 


Thr 


He 


Glu 


Asn 


Leu 


Asn 


Lys 


Gin 


His 


Lys 


Glu 


Se v 


Leu 


865 










870 










875 










880 


Arg 


Glu 


Am 


Arg 


Phe 


Trp 


Leu 


Glu 


Ala 


Met 


Lys 


Ala 


Ser 


Phe 


Phe 


Glu 










085 










890 










895 




Gly 


A^ n 


Asp 


Phe 


He 


Thr 


Asp 


Tyr 


Glu 


Ser 


Val 


Leu 


Asn 


Gly 


Leu 


Thr 








900 










905 










910 






Pro 


Ala 


Glu 


Leu 


Gin 


Lys 


Phe 


Ala 


Ala 


Asp 


Leu 


Leu 


Lys 


Gin 


Gin 


Asn 






915 










920 










925 








Arg 


Vnl 


Val 


Val 


Met 


Met 


Ala 


Pro 


Val 


Ala 


Lys 


Ala 


Gin 










930 










935 










940 











(2) IIirORMATlOIJ FOR SEQ ID IiO:407 



(i) SEQUENCE CHARACTERISTICS: 
75 (A) LEllGTH: 684 amino acids 



BNSDOCID: <WO 9929S70A1_L> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



343/490 



10 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) IIOLECLILE TYPE: protein 

(iii) HYPOTHETICAL: YES 

( V i ) OR I G I M AL SOU RCE : 

(A) ORGAMIStI: Porphyrcinonas gingival is 

(ix) FEATURE: 

(A) riAlIE/FCEY: mi sc_f ea t u re 
( B) LOCATIOIJ 1 ... 684 



15 (xi) SEQUEMCE DESCRIPTIOM: SEQ ID 110:^07 



20 



I let 


Ser 


Lys 


Lys 


Gly 


Thr 


He 


Gly 


Val 


Thr 


Ser 


Asp 


Asn 


He 


Phe 


Pro 


1 








5 










1 0 










1 5 




Val 


He 


Lys 


Lys 


Phe 


Leu 


Tyr 


Ser 


Asp 


His 


Glu 


He 


Phe 


Leu 


Arg 


Glu 








20 










25 










30 






He 


Val 


Ser 


Asn 


Ala 


Val 


Asp 


Ala 


Thr 


Gin 


Lys 


Leu 


Lys 


Thr 


Leu 


Thr 






35 










40 










45 








Ser 


Val 


'^ly 


Glu 


Phe 


Lys 


Gly 


Glu 


Thr 


Gly 


Asp 


Leu 


Arg 


Val 


Thr 


Val 




50 










55 










60 










Ser 


Val 


Asp 


Glu 


Val 


Ala 


Arg 


Thr 


He 


Thr 


Val 


Ser 


Asp 


Arg 


Gly 


Val 


65 










70 










7 5 










6 0 


Gly 


Met 


Thr 


Glu 


Glu 


Glu 


Val 


Glu 


Lys 


Tyr 


He 


Asn 


Gin 


He 


Ala 


Phe 










85 










90 










95 




Sei- 


Ser 


Ala 


Glu 


Gl u 


Phe 


Leu 


Glu 


Lys 


Tyr 


Lys 


Asp 


Asp 


Lys 


Ala 


Ala 








100 










105 










110 






ne 


He 


Gly 


His 


Phe 


Gly 


Leu 


Gly 


Phe 


Tyr 


Ser 


Ai a 


Phe 


Met 


Val 


Ser 






115 










120 










125 








Glu 


Arg 


Val 


Asp 


Val 


He 


Thr 


Arg 


Ser 


Phe 


Ara 


Glu 


Asp 


Ala 


Thr 


Ala 




130 










135 










140 










Val 


Lys 


Trp 


Ser 


Cys 


Asp 


Gl y 


Ser 


Pro 


Glu 


T'-r 


Thr 


Leu 


Glu 


Pro 


Ala 


145 










150 










1 55 










160 


Asp 


Lys 


Ala 


Asp 


Arg 


Gly 


Thr 


ASD 


He 


Val 


i let 


His 


He 


AsD 


Glu 


Glu 










165 










170 










175 




A^?n 


Ser 


Gl u 


Fhe 


Leu 


Lys 


Lys 


Glu 


Lys 


He 


Glu 


Gl V 


Leu 


Leu 


Gl^' 


Lvs 








180 










185 










190 






Tyr 


Cy^ 


Lys 


Phe 


Leu 


Thr 


Val 


Pro 


He 


He 


Phe 


Gly 


Lys 


Lys 


Gin 


Glu 






195 










200 










205 








Trp 


Lys 


Asp 


Gly 


Lys 


Met 


Gin 


Asp 


Thr 


Asp 


Glu 


Asp 


Asn 


Gin 


He 


As n 




210 










215 










22 ''p 










A^p 


Thr 


His 


Pro 


Ala 


Trp 


Thr 


Lys 


Lys 


Pro 


Ala 


Asp 


Leu 


Lys 


Asp 


Gl u 


225 










230 










235 










240 


Asp 


Tyr 


Lys 


Glu 


Phe 


Tyr 


Arg 


Ser 


Leu 


Tyr 


Pro 


Met 


Ser 


Glu 


Glu 


Pro 










245 










2 50 










2 55 




Leu 


Phe 


Trp 


He 


His 


Leu 


Asn 


Val 


Asp 


Tyr 


Pro 


Fhe 


Asn 


Leu 


Thr 


Gly 








260 










265 










270 






He 


Leu 


Tyr 


Phe 


Pro 


Lys 


He 


Lys 


Asn 


Asn 


Leu 


Asp 


Leu 


Gin 


Arg 


Asn 






275 










280 










285 








Lys 


He 


Gin 


Leu 


Tyr 


Cys 


Asn 


Gin 


Val 


Tyr 


Val 


Thr 


A^^p 


Glu 


Val 


Gin 




290 










295 










300 










Gly 


He 


Val 


Pro 


Asp 


Phe 


Leu 


Thr 


Leu 


Leu 


His 


Gly 


Val 


He 


Asp 


Ser 


305 










310 










315 










320 


Pro 


Asp 


He 


Pro 


Leu 


Asn 


Val 


Ser 


Arg 


Ser 


Tyr 


Leu 


Gin 


Ser 


Asp 


Ala 










325 










330 










335 




A£-n 


Val 


Lys 


Lys 


He 


Ser 


Ser 


His 


He 


Thr 


Lys 


Lys 


Val 


Ala 


Asp 


Arg 








340 










345 










350 




Leu 


Glu 


Glu 


He 


Phe 


hys 


Asn 


Asp 


Arq 


Pro 


Thr 


Phe 


Glu 


Glu 


Lys 


Trp 






355 










360 










365 








A:rp 


Ser 


Leu 


Lys 


Leu 


Phe 


Val 


Glu 


Tyr 


Gly 


Met 


Leu 


Thr 


Asp 


Glu 


Lys 




370 










375 








33 0 








Phe 


Tyr 


Glu 


Arg 


Ala 


Ala 


Lys 


Phe 


Phe 


Leu 


Fhe 


Thr 


Asp 


Met 


Asp 




385 










390 










3 95 










400 


His 


Lys 


Tyr 


Thr 


Phe 


Asp 


Glu 


Tyr 


Arg 


Thr 


Leu 


Val 


Glu 


Gly 


Val 


Gin 










405 










410 










415 




Tlir 


Asp 


Lys 


Asp 


Gly 


Gin 


Val 


Val 


Tyr 


Leu 


Tyr 


Ala 


Thr 


Asp 


Lys 


His 








420 










4 25 










4 30 






Gly 


Gin 


Tyr 


Ser 


His 


Val 


Lys 


Arg 


Ala 


Ser 


Asp 


Lys 


Gly 


Tyr 


Ser 


Val 






435 










440 










445 








I let 


Leu 


Leu 


Asp 


GJy 


Gin 


Leu 


Asp 


Pro 


Mis 


He 


Val 


Ser 


Leu 


Leu 


Glu 




4bO 










455 










4 60 










Gin 


Lys 


Leu 


Glu 


Lys 


Thr 


His 


Phe 


Val 


Arg 


Va 1 


Asp 


Ser 


Asp 


Thr 


He 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1„I_> 



wo 99/29870 



PCT/AU98/01023 



344/ 490 





^65 






470 










475 










4 80 




Asr\ 


Asn 


Leu He Arg 


Lys 


Glu 


Glu 


Arg 


Ala 


Glu 


Val 


Lys 


Leu 


Ser 


Asp 








4 85 










4 90 










4 95 






Thr 


Glu 


Arg Ala Thr 


Leu 


Val 


Lys 


Leu 


Phe 


Glu 


Ala 


Arg 


Leu 


Pro 


Arg 


c 
O 






500 








505 










51 0 








Asp 


Glu 


Lys Lys His 


Phe 


Asn 


Val 


Ala 


Phe 


Glu 


Ser 


Leu 


Gl y 


Ala 


Glu 








515 






520 










525 










Gly 


Glu 


Ala He Leu 


He 


Thr 


Gin 


Ala 


Glu 


Phe 


Met 


Arg 


Arg 


Met 


Arg 


i n 
xu 




530 






535 










54 0 










Asp 


I let 


Al s Gin Leu 


Gin 


Pro 


Gly 


1 let 


Ser 


Phe 


Tyr 


Gly 


Glu 


Leu 


Pro 




54 5 






550 










555 










560 




Asp 


Ser 


Tj-r Asn Leu 


Val 


Leu 


Asn 


Thr 


Asp 


His 


Pro 


Leu 


He 


Asp 


Arg 








565 










57 0 










575 




15 


Val 


Leu 


Ser Gly Glu 


Lys 


Glu 


Ser 


Val 


Glu 


Pro 


Ser 


Leu 


Thr 


Glu 


Leu 






580 








585 










5 90 








Arg 


Ala 


Lys He Ala 


Glu 


Leu 


Lys 


Ala 


Glu 


Glu 


Ala 


Lys 


Leu 


Leu 


Asp 








5?5 






600 










605 










Glu 


Glu 


Lys Gly Lys 


Lys 


Pro 


Glu 


Glu 


He 


Pro 


Val 


Ala 


Thr 


Lys 


Glu 






610 






615 










620 










Ala 


Lys 


Gl'j Asn Asn 


Ala 


Val 


Glu 


Gin 


Ala 


Lys 


Thr 


Glu 


Gly 


Ser 


He 




625 






630 










635 










64 0 




A^n 


Asp 


Gin Leu Thr 


Ly3 


Tyr 


Ala 


Gin 


Asp 


Asn 


Glu 


Leu 


He 


Gly 


Gin 








645 










650 










655 




25 


Leu 


lie 


Asp Leu Ala 


Leu 


Leu 


Gly 


Ser 


Gly 


L"u 


L-u 


Thr 


Gl"- 


Glu 


Al a 






660 






665 








67 0 








Leu 


Ala 


Glu Phe He 


Arg 


Arg 


Ser 


Gin 


Arg 




Leu 






















68 0 


















30 


{2) 


IMFORIIATIOI) FOR 


SEQ 


ID NO: 408 




















{ i ) 


SEQUENCE CHARACTERISTICS: 






















(A) LEIIGTH: 4 64 amino 


acids 




















(B) TVPE: 


amino acid 
























[ D) TOPOLOGY : 


linear 


















35 


































(ii) 


MOLECULE TYPE: 


protein 






















(iii) 


HYPOTHETICAL: YES 




















4U 




(vi) 


ORIGINAL SOURCE: 


























(A) ORGANISM: 


Porphyromonas gingival is 














FEATURE: 
























4o 






{A) NAT IE /KEY: 


misc feature 


















(E) LOCATION 1 . . .464 
























SEQUENCE DESCRIPTION: SEQ ID 110:408 










du 


I let 


Glu 


Lys Leu He 


Asp 


He 


Leu 


Val 


Val 


A.sp 


Asp 


Asp 


-/a 1 


Ala 


Val 


1 














10 








15 






Cys 


Ala 


Ala Leu Arg 


Leu 


Val 


Leu 


Lys 


Arg 


Ala 


Gly 


Tyr 


Asn 


Pro 


Val 
















25 










30 








He 


Ala 


Asn Ser Pro 


Asp 


Glu 


Ala 


Leu 


Ser 


He 


Met 


Arg 


Asn 


Pro 


Asp 


55 






35 






40 










45 








Gly 


Gly 


Cys Lys Pro 


Ala 


Val 


He 


Leu 


Met 


Asp 


Met 


Asn 


Phe 


Ser 


Leu 






50 






55 










60 












Ser 


Thr 


Ser Gly Arg 


Glu 


Gly 


Leu 


Glu 


Leu 


Leu 


Glu 


Lys 


Met 


Gin 


He 




65 






7 0 










75 










80 


DU 


Phe 


Thr 


Ser Cys Pro 


Val 


He 


Leu 


Met 


Thr 


Ala 


Trp 


Ala 


Ser 


He 


Pro 






85 










90 








c. c 






Leu 


Ala 


Val Glu Gly 


Met 


Arg 


Leu 


Gly 


Ala 


Phe 


Asp 


Phe 


He 


Gly 


Lys 








100 








105 










110 








Pro 


Tfp 


Asp Asn Asp 


Arg 


Leu 


Leu 


Arg 


Thr 


He 


Asp 


Thr 


Al a 


L*='U 


Hi" 


65 






115 






120 










125 








Leu 


Ala 


Ala Pro Ser 


Ala 


Val 


Ala 


Asn 


Pro 


Ser 


Glu 


Gin 


Ser 


Asp 


Arg 






130 






135 










14 0 










Asp 


Thr 


Ala Arg Gin 


Pro 


Lys 


Ala 


Thr 


Val 


Gin 


Glu 


Asn 


Asp 


Pro 


C* *s 




145 






150 










155 








160 


70 


Ala 


His 


He He Gly 


Arg 


Ser 


Asp 


Ala 


He 


Cys 


Lys 


He 


Lys 


Glu 


Arg 






165 










170 










175 






He 


Arg 


Arg He Ala 


Pro 


Thr 


His 


Ala 


Ser 


V^a 1 


Leu 


He 


Thr 


Gly 


Glu 








180 








185 










190 








Ser 


Gly 


Thr Gly Lys 


Glu 


Leu 


He 


Ala 


Glu 


Ala 


Leu 


His 


Arg 


Gly 


Ser 


75 






195 






2 00 










205 








Lys 


Arg 


Ala Ser Ala 


Pro 


Phe 


Val 


Lys 


Val 


Asn 


Leu 


Gly 


Gl y 


He 


Pro 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <W0 9929870A1 J_> 



wo 99/29870 



PCT/AU98/01023 



345/490 



40 



45 



50 



55 



60 



65 



70 



75 





210 










215 










2 2 0 










Glu 


Ser 


Leu 


Phe 


Glu 


Ser 


Glu 


Leu 


Phe 


Gly 


His 


Lys 


Lys 


Gly 


Ala 


Phe 


225 










230 










235 








24 0 


Thr 


Asn 


Ala 


Phe 


Ser 


Asp 


Arg 


Lys 




Arg 


Phe 


Glu 


Leu 


Ala 


Asp 


Gly 










245 










2 50 










255 


Gly 


Thr 


He 


Phe 


Leu 


Asp 


Glu 


He 


Gly 


Glu 


Leu 


Pro 


Val 


Gly 


Asn 


Gin 








2 60 










265 










270 






Val 


Ly^ 


Leu 


Leu 


Arg 


Val 


Leu 


Gin 


Glu 


Gin 


Thr 


Phe 


Glu 


Pro 


Leu 


Gly 






275 










280 










285 






Glu 


Ser 


Val 


Ser 


His 


Arg 


Val 


Asp 


He 


Arg 


Val 


Val 


Ser 


Ala 


Thr 


Asn 




290 










2 95 










300 










Ala 


Ser 


Leu 


Glu 


Arg 


Met 


Val 


Ala 


Glu 


Gly 


Arg 


Phe 


Arg 


Glu 


Asp 


Leu 


305 










310 










315 










320 


Tyi- 


Tyr 


Arg 


He 


Asn 


Leu 


He 


His 


Leu 


His 


Leu 


Pro 


Pro 


Leu 


Arg 


Glu 










325 










330 










335 




Arg 


Gin 


Glu 




He 


Gin 


Leu 


Leu 


Val 


Glu 


Ala 


Phe 


Ser 


Glu 


Ala 


Phe 








34 0 










345 










350 






Ala 


Gin 


Ser 


Asn 


Gly 


Leu 


Pro 


His 


Ala 


Val 


Trp 


Ser 


Ala 


Glu 


Ala 


Met 






355 










360 










365 








Arg 


Arg 


He 


Cys 


Ala 


Met 


Pro 


Leu 


Pro 


Gly 


Asn 


Val 


Arg 


Glu 


Leu 


Lys 




370 










375 










380 








A^n 


Val 


Val 


Glu 


Arg 


Thr 


Leu 


Leu 


Leu 


Ser 


Gly 


Ser 


Arg 


Glu 


He 


Ser 


385 










390 










395 








4 00 


Ala 


Arg 


Asp 


Val 


Ala 


Acp 


Phe 


Gly 


Ser 


Gin 


Val 


Thr 


Ala 


Ala 


Asp 


His 










4 05 










410 










415 




Ser 


Asp 


Glu 


Arg 


Ala 


Leu 


Thr 


Asp 


Met 


Glu 


Glu 


Ala 


Ala 


He 


Arg 


Glu 








4 20 










4 25 










4 30 






Thr 


Leu 


Thr 


Lys 


Tyr 


Asn 


Gly 


Asn 


Val 


Ser 


Arg 


Ala 


Ala 


Arg 


Ala 


Leu 






435 










440 










4 45 






Gly 


Leu 


Ser 


Arg 


Ala 


Ala 


Leu 


Tyr 


Arg 


Arg 


Met 


Glu 


Lys 


Tyr 


Gly 


Leu 




450 










455 










460 









10 



15 



20 



25 



30 



(2) IMFORMATIOn FOR SEQ ID IJO:4 09 

35 (i) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

( i i i ) HY POTHETI CAL : YES 

(vi) ORIGItlAL SOURCE: 

(A) ORGAIJISM: Forphyromonas gingival is 

(ix) FEATURE: 

(A) IIAME/KEY: misc_feature 

(B) LOCATIOM 1 . . .2 50 

(xi) SEQUEtfCE DESCRI PTIOH : SEQ ID 110:409 



Met 


Leu 


Lys 


He 


Lys 


Asn 


Leu 


His 


Ala 


Thr 


Val 


Gin 


Gly 


Lys 


Glu 


He 


1 








5 










10 






15 




Leu 


Lys 


Gly 


He 


Asn 


Leu 


Glu 


He 


Asn 


Ala 


Gly 


Glu 


He 


His 


Ala 


He 








20 










25 








30 






Met 


Gly 


Pro 
35 


Asn 


Gly 


Ser 


Gly 


Lys 
40 


Ser 


Thr 


Leu 


Ser 


Ser 
45 


Val 


Leu 


Val 


Gly 


His 
50 


Fro 


Ser 


Phe 


Glu 


Val 
55 


Thr 


Glu 


Gly 


Glu 


Val 
60 


Thr 


Phe 


Asn 


Gly 


He 


Asp 


Leu 


Leu 


Glu 


Leu 


Glu 


Pro 


Glu 


Glu 


Arg 


Ala 


His 


Leu 


Gly 


Leu 


65 










70 










7 










BO 


Phe 


Leu 


Ser 


Phe 


Gin 
85 


Tyr 


Pro 


Val 


Glu 


He 
90 


Pro 


Gly 


Val 


Ser 


Met 
95 


Val 


Asn 


Phe 


Met 


Arg 


Ala 


Ala 


Val 


Asn 


Glu 


His 


Arg 


Lys 


Ala 


He 


Gly 


Ala 








100 










105 










110 




Glu 


Pro 


Val 


Ser 


Ala 


Ser 


Asp 


Phe 


Leu 


Lys 


Met 


Met 


Arg 


Glu 


Lys 


Arg 






115 










120 










125 




Ala 


He 
130 


Val 


Glu 


Leu 


Asp 


Asn 
135 


Lys 


Leu 


Ala 


Ser 


Arg 
140 


Ser 


Val 


Asn 


Glu 


Gly 


pr,e 


Ser 


Gly 


Gly 


Glu 


Lys 


Lys 


Arg 


Asn 


Glu 


He 


Phe 


Gin 


Met 


Ala 


145 










150 










155 










160 


Met 


Leu 


Glu 


Fro 


Lys 


Leu 


Ala 


He 


Leu 


Asp 


Glu 


Thr 


Asp 


Ser 


Gly 


Leu 




He 






165 










170 








175 




Asp 


A.TP 


Ala 


Leu 


Arg 


He 


Val 


Ala 


Gly 


Gly 


Val 


Asn 


Arg 


Leu 


Arg 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1 J_> 



wo 99/29870 



PCT/AU98/01023 



346/490 



10 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



180 185 190 

S^r Pro Glu Asn Ala Ala lie Val He Thr His Tyr Glv\ Arg Leu Leu 

195 200 205 

Glu Tyr He Lys Pro Asp Phe Val His Val Leu Tyr Lys Gly Arg He 

210 215 220 

Val Lys Ser Gly Gly Ala Glu Leu Ala Leu Thr Leu Glu Glu Lys Gly 
225 230 235 240 

Tyr A^p Trp He Lys Glu Glu He Gly Glu 
245 250 

(2) ItlFORnATIOH FOR SEQ ID MO: 410 



(i) SEQUEHCE CHARACTERISTICS: 

(A) LEliGTH: 4 61 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (iii) HYPOTHETICAL: YES 

(vi) ORIGIIJAL SOURCE: 

(A) ORGAriISM: Porphyromonas gingivali.<? 

25 (ix) FEATURE: 

(A) MAI IE /KEY: misc_feature 

(B) LOCATION H . .461 



{Mi) SEQUEIJCE DESCRIPTION: SEQ ID MO: 410 



Met 


Ala 


Lys 


Glu 


Lys 


Thr 


He 


Tyr 


Vol 


Cys 


Arg 


Ser 


Cys 


Gly 


Thr 


Lys 


1 








5 










10 










15 




J. J - r 


Al a 


T r-c 


Trp 


Gin 




As n 


Cys 


Asn 


Al a 


Cys 




Glu 


Trp 


As n 


Cys 








20 










25 










30 






He 


Asp 


Glu 


Glu 


Lys 


Val 


Pro 


Ala 


Pre 


Ala 


Ser 


Gly 


Lys 


His 


Ala 


Ala 






35 










40 










45 








Lys 


Ser 


Phe 


Met 


Pro 


Arg 


Glu 


Gin 


Asp 


Asn 


Arg 


Pro 


Arg 


Leu 


Leu 


Gin 




50 










55 










60 










Asp 


Val 


Glu 


Ser 


Gly 


Asp 


Glu 


Glu 


Arg 


He 


Arg 


Leu 


Gly Asp 


Glu 


Glu 


65 










70 










75 










80 


Phe 


Asp 


Arg 


Val 


Leu 


Gly 


Gly 


Gly 


He 


Val 


Lys 


Gly 


Ala 


Phe 


Val 


Leu 










85 










90 










95 




Leu 


Gly 


Gly 


Glu 


Pro 


Gly 


He 


Gly 


Lys 


Ser 


Thr 


Leu 


He 


Leu 


Gin 


Thr 








100 










105 










110 






Val 


Leu 


Arg 


Leu 


Pro 


Gin 


Leu 


Arg 


Thr 


Leu 


Tyr 


Val 


Ser 


Gly 


Glu 


Glu 






115 










120 










125 








Ser 


Ala 


Arg 


Gin 


Leu 


Lys 


Met 


Arg 


Ala 


Glu 


Arg 


Leu 


Gly 


Gin 


Ala 


Met 




130 










135 










140 










Asn 


Gly 


Cys 


Tyr 


Val 


Tyr 


Cys 


Glu 


Thr 


Asn 


He 


Glu 


Arg 


He 


Leu 


Ser 


145 










150 










155 










160 


Arg 


Ala 


Glu 


Glu 


Leu 


Thr 


Pro 


Asp 


Leu 


Leu 


Val 


He 


Asp 


Ser 


He 


Gin 










165 










170 










175 




Thr 


Val 


Tyr 


Thr 


Glu 


Glu 


Met 


Glu 


Ser 


Ser 


Ala 


Gly 


Ser 


Val 


Gly 


Gin 








180 










185 










190 






He 


Arg 


Glu 


Cys 


Ala 


Ala 


Leu 


Leu 


Leu 


Ly^ 


Tyr 


Cys 


Lys 


Thr 


Thr 


Gly 






195 










200 










205 








He 


Pro 


Val 


He 


Val 


He 


Gly 


His 


He 


Thr 


Lys 


Glu 


Gly 


Ser 


He 


Ala 




210 










215 










220 










Gly 


Pro 


Lys 


Val 


Leu 


Glu 


His 


He 


Val 


Asp 


Thr 


Val 


Leu 


Leu 


Phe 


Asp 


225 










230 










235 










240 


Gly 


Asp 


Lys 


His 


His 


Leu 


Tyr 


Arg 


He 


Leu 


Arg 


Gly 


Gin 


Lys 


Asn 


Arg 










245 










250 










255 




Tyr 


Gly 


Ser 


Thr 


Ser 


Glu 


Leu 


Gly 


He 


Tyr 


Glu 


Met 


Arg 


Gin 


Asp 


Gly 








260 










265 










270 






Leu 


Arg 


Gly 


Val 


Glu 


Asn 


Pro 


Ser 


Gl u 


His 


Leu 


He 


Thr 


Arg 


Asn 


Arg 






275 










280 










285 








Glu 


Asp 


Leu 


Ser 


Gly 


He 


Ala 


He 


Ala 


Val 


Ala 


Met 


Glu 


Gly 


He 


Arg 




290 










295 










300 










Pro 


He 


Leu 


He 


Glu 


Ala 


Gin 


Ala 


Leu 


Val 


Ser 


Ser 


Ala 


He 


Tyr 


Ala 


305 










310 










315 










320 


As n 


Pro 


Gin 


Arg 


Ser 


Ala 


Thr 


Gly 


Phe 


Asp 


He 


Arg 


Arg 


Met 


Asn 


Met 










325 










330 










335 




Leu 


L-u 


Ala 


Val 


Leu 


Glu 


Lys 


Arg 


Ala 


Gly 


Phe 


Lys 


Leu 


He 


Gin 


Lys 








340 










345 










350 






Asp 


Val 


Fhe 


Leu 


Asn 


He 


Ala 


Gly 


Gly 


He 




He 


Ala 


Asp 


Pro 


Ala 



BNSDOCtD: <WO 9929870A1_L> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



347/490 



25 



35 



40 



45 



50 



55 



60 



65 







355 










360 










365 








Thr 


Asp 


Leu 


Al a 


Val 


He 


Ser 


Ala 


Val 


Leu 


Ala 


Ser 


Ser 


Leu 


Asp 


He 




370 










375 










380 








Val 


He 


Pro 


Pro 


Ala 


Val 


Cys 


Met 


Thr 


Gly 


Glu 


Val 


Gly 


Leu 


Ser 


Gly 


385 










390 










395 










400 


Glu 


He 


Arg 


Pro 


Val 


Ser 


Arg 


He 


Glu 


Gin 


Arg 


He 


Thr 


Glu 


Ala 


Arg 










405 










410 










415 




Arg 


He 


Giy 


Phe 


Lys 


Glu 


He 


Leu 


Val 


Pro 


Ala 


A«p 


Asn 


Phe 


Arg 


Gin 








420 










425 










4 30 






Glu 


Asp 


Ala 


Gly 


Arg 


Phe 


Gly 


He 


Arg 


Leu 


Val 


Pro 


Val 


Arg 


Lys 


Va 1 






435 










440 










445 








Glu 


Glu 


Ala 


Phe 


Arg 


His 


Leu 


Phe 


Ser 


Lys 


Gly 


Arg 


Glu 










450 










455 










460 










i2) 


It I FORI lATIOM 


FOR 


SEQ 


ID IJO: 411 

















10 



15 

(i) SEQUEtlCB CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

(ii} IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGIIIAL SOURCE: 

(A) ORGAMISil: Porphyromonas gingivalis 



{ix) FEATURE: 
30 (A) IIAIIE/KEY: misc_feature 

(B) LOCATIOII 1 ... 271 



(xi) SEQUENCE DESCRI PTIOII : SEQ ID UO:411 



Met 


Asn 


Ser 


Arg 


His 


Leu 


Thr 


He 


Thr 


He 


He 


Ala 


Gly 


Leu 


Ser 


Leu 


1 








5 










10 










15 




Phe 


Val 


Leu 


Thr 


Leu 


Gly 


Gly 


Cys 


Ser 


Val 


Ala 


Gin 


Gin 


Asp 


Thr 


Gin 








20 










25 










30 






Trp 


Thr 


Leu 


Giy 


Gly 


Lys 


Leu 


Phe 


Thr 


Ser 


Ala 


Trp 


He 


Gin 


Arg 


Ser 






35 










40 










45 








Ala 


Glu 


Tyr 


Gin 


Ala 


Leu 


Cys 


He 


Gin 


Ala 


Tyr 


Asn 


He 


Ala 


Thr 


Glu 




50 










55 










60 










Arg 


Val 


Asp 


Ala 


Leu 


Pro 


Ala 


Glu 


Arg 


Lys 


Gin 


Gly 


Asp 


Arg 


Pro 


Tyr 


65" 










70 










75 










80 


Ala 


He 


Val 


Thr 


Asp 


He 


Asp 


Glu 


Thr 


He 


Leu 


Asp 


Asn 


Thr 


Pro 


Asn 










85 










90 










95 




Ser 


Val 


Tyr 


Gin 


Ala 


Leu 


Arg 


Gly 


Lys 


Asp 


Tyr 


At'p 


Glu 


Glu 


Thr 


Trp 








1 00 










105 










110 






Gly 


Lys 


Trp 


Cys 


Ala 


Gin 


Ala 


Asp 


Ala 


Asp 


Thr 


Leu 


Ala 


Gly 


Ala 


Leu 






115 










120 










125 








Ser 


Phe 


Phe 


Leu 


His 


Ala 


Ala 


Asn 


Lys 


Gly 


He 


Glu 


Val 


Phe 


Tyr 


Val 




130 










135 










140 








Thr 


Asn 


Arg 


Arg 


Asp 


Asn 


Leu 


Arg 


Glu 


Ala 


Thr 


Leu 


Gin 


Asn 


Leu 


Gin 


145 










150 










155 










160 


Arg 


Tyr 


Gly 


Phe 


Pro 


Phe 


Ala 


Asp 


Gl u 


Glu 


His 


Leu 


Leu 


Thr 


Thr 


His 










165 










170 










175 




Gly 


Pro 


Ser 


Asp 


Lys 


Glu 


Pro 


Arg 


Arg 


Leu 


Lys 


He 


Gin 


Glu 


Gin 


Tyr 








180 










185 










190 




Glu 


He 


Val 


Leu 


Leu 


He 


Gly 


Asp 


Asn 


Leu 


Giy 


Asp 


Phe 


His 


His 


Phe 






195 










200 






205 








Phe 


Asn 


Thr 


Lys 


Glu 


Glu 


Ser 


Gly 


Arg 


Lys 


GJn 


Ala 


Leu 


Gly 


Leu 


Thr 




210 










215 










220 










Ala 


Gly 


Glu 


Phe 


Gly 


Arg 


His 


Phe 


He 


net 


Leu 


Pro 


Asn 


Pro 


Asn 


Tyr 


225 










230 










235 










240 


Gly 


Ser 


Trp 


Glu 


Pro 


Ala 


Trp 


Tyr 


Gly 


Gly 


Lys 


Tyr 


Pro 


Pro 


Leu 


Pro 










245 










250 










255 




Glu 


Arg 


Asp 


Lys 


Ala 


Leu 




Gin 


Leu 


His 


Ser 


Gin 


Asn 


Ser 


Arg 










260 










265 










270 





70 {2) lUFORIlATION FOR SEQ ID H0:4H 



75 



ti) SEQUEtiCE CHARACTERISTICS: 

(A) LEMGTH: 417 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 PCT/AU98/01023 

348/ 490 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 

5 

(vi) ORIGItJAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival i£ 

(i:;) FEATURE: 
10 (A) IJAI1E/KEY: misc_feature 

(B) LOCATION 1. . - 417 

(xi) SEQUEIICE DESCRIPTION: SEQ ID NO: 412 

15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



net 


Ser 


Thr 


Asn 


lie 


Asp 


Val 


Gin 


Gin 


He 


Lys 


Gin 


Arg 


Phe 


Gly 


He 


1 








5 










10 










15 




lie 


Gly 


Ser 


Ser 


Fro 


Leu 


Met 


Glu 


His 


Ala 


He 


Arg 


Val 


Ala 


Ala 


Gin 








20 










25 










30 






Val 


Ala 


Pro 


Thr 


Asp 


Met 


Ser 


Val 


Leu 


Val 


Thr 


Gly 


Glu 


Ser 


Gly 


Ser 






35 










4 0 










45 








Gly 


Lys 


Glu 


Phe 


Phe 


Pro 


Gin 


He 


He 


His 


Tyr 


Tyr 


Ser 


Ala 


Arg 


Lys 




50 










55 










60 










His 


His 


Ser 


Tyr 


lie 


Ala 


Val 


Asn 


Cys 


Gly 


Ala 


He 


Pro 


Glu 


Gly 


Thr 


65 










7 0 










75 










8 0 


lie 


Asp 


Ser 


Glu 


Leu 


Phe 


Gly 


His 


Arg 


Lys 


Gly 


Ser 


Phe 


Thr 


Gly 


Ala 










85 










90 










95 




Val 


Ser 


A"p 


Arg 


Lys 


Gly 


Tyr 


Phe 


Glu 


Glu 


Ala 


Ser 


Gly 


Gly 


Thr 


He 








100 










105 










110 






Phe 


Leu 


Asp 


Glu 


Val 


Gly 


Glu 


Leu 


Pro 


Leu 


Pro 


Thr 


Gin 


Ala 


Arg 


Leu 






lis 










120 










125 








Leu 


Arg 


Val 


Leu 


Glu 


Thr 


Gly 


Glu 


Phe 


He 


Pro 


Val 


Gly 


Ala 


Ser 


Gin 




130 










135 










140 










Ser 


Gin 


Lys 


Thr 


Asp 


Val 


Arg 


He 


Val 


Ala 


Ala 


Thr 


Asn 


Val 


Asn 


Leu 


145 










150 










155 










160 


Lys 


Glu 


Ala 


Val 


Ala 


Asn 


Gly 


Lys 


Phe 


Arg 


Glu 


Asp 


Leu 


Phe 


Phe 


Arg 










165 










170 










175 




Leu 


Asn 


Thr 


Val 


Pro 


He 


Glu 


Val 


Pro 


Ala 


Leu 


Arg 


Met 


Arg 


Pro 


Asp 








180 










185 










190 






Asp 


Val 


Pro 


Leu 


Leu 


Phe 


Arg 


Arg 


Phe 


Ala 


Ala 


Asp 


Ser 


Ala 


Glu 


Lys 






195 










200 










205 








Tyr 


Arg 


net 


Pro 


Pro 


Leu 


Arg 


Leu 


Ser 


Asp 


Glu 


Ala 


Arg 


Thr 


He 


Leu 




210 










215 










220 










I\-t 


Arg 


Tyr 


Arg 


Trp 


Pro 


Gly 


Asn 


Val 


Arg 


Glu 


Leu 


Arg 


Asn 


He 


Thr 


225 










230 










235 










240 


A^p 


Arq 


Leu 


Ser 


He 


Leu 


Glu 


Glu 


Glu 


Arq 


Thr 


Val 


Ser 


Ala 


Glu 


Thr 










245 










250 










255 




lie 


Thr 


Arg 


Tyr 


Leu 


Asp 


Ala 


Glu 


Gly 


Met 


Gin 


Asp 


Leu 


His 


Pro 


Val 








2 60 










265 










270 






Val 


lie 


Arg 


Arg 


Asn 


Glu 


Thr 


Thr 


Glu 


Ala 


Asp 


Lys 


Gin 


He 


Pro 


His 






275 










280 










285 








Tyr 


Glu 


Arg 


Glu 


He 


He 


Tyr 


Gin 


Val 


Leu 


Tyr 


Asp 


14et 


Lys 


Lys 


Glu 




290 










295 










300 










lie 


Ala 


Asp 


Leu 


Lys 


Gly 


Met 


Met 


Asn 


Arg 


Leu 


Ala 


His 


His 


Glu 


Gin 


305 










310 










315 










320 


Pro 


Ser 


Trp 


Pro 


Val 


Gly 


Ser 


Asp 


Val 


Trp 


Gly 


Asn 


Asp 


Asp 


Lys 


Arg 










325 










330 










335 




Thr 


Ala 


Asp 


Pro 


Lys 


Trp 


Gly 


Val 


Ser 


Thr 


His 


Lys 


Ala 


Pro 


He 


Ala 








34 0 










345 










350 






Asn 


Ala 


Ala 


Glu 


Pro 


Val 


Glu 


Pro 


He 


Gin 


Glu 


Ala 


Ser 


Glu 


Tyr 


Thr 






355 










360 










365 








Glu 


Asp 


Pro 


Val 


Ser 


Leu 


Glu 


Glu 


Val 


Glu 


Lys 


Lys 


Met 


He 


Ser 


Leu 




370 










375 








380 










Ala 


Leu 


Glu 


Arg 


His 


Gly 


Gly 


Arg 


Arg 


Ly^ 


Gin 


Thr 


Ala 


Glu 


Glu 


Leu 


385 










390 










395 










4 00 


Lys 


lie 


Ser 


Glu 


Arg 


Thr 


Leu 


Tyr 


Arg 


Lys 


He 


Lys 


Glu 


Tyr 


Gly 


Leu 










405 










410 










41S 





Glu 



70 (2) INFORMATION FOR SEQ ID NO: 413 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 amino acids 

(B) TYPE: amino acid 
75 (D) TOPOLOGY: linear 



BNSDOCID: <WO 9929870A1_I_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



349/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(ii) liOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

( V i ) OR I G I HAL SOU RCE : 

(A) ORGAl^ I SH: Porphyromona^ gingiva lis 

(ix) FEATURE: 

(A) MAi1E/KEY: mi£c_f eat ure 

(B) LOCATIOII 1 . . .602 

ixi) SEQUENCE DESCRI PTIOI I : SEQ ID MO: 413 

net He Leu Leu Phe Gly Gly Thr Thr Glu Giy Arg Ala Ala Ala Aro 
1 5 10 15 " 

Val Leu Asp Glu Ala Gly Ser Pro Phe Phe Tyr Ser Thr Lys Gly fKsn 

20 25 30 

Leu Gin Glu He Gin Ser Ser His Gly His Ara Leu Thr Gly Ala I let 

35 40 45 

Thr Val Ala Asp Met Val Ser Phe Cys Ara Lys Glu Glu He Arg Leu 

50 55 60 

He Val Asp Ala Ala His Pro Phe Ala Glu Glu Leu His Ala Ser Val 
65 7 0 7 5 8 0 

Ala Glu Ala Ser Glu Gin Thr Gly He Pro Val Val Arg Tyr Glu Arg 

85 90 95 

Gin Tyr Pro Pro Arg Glu Glu Gly He Val Trp Cys Ala Asn Tyr Asp 

100 105 110 

Thr Ala Ala Glu Arg Met Leu Gly Asp Gly Val Gin Arg Leu Leu Met 

115 120 125 

Leu Thr Gly Val Asn Thr He Pro Lys Leu Ala Ala Phe Trp Lys Glu 

130 135 140 

Arg Thr Thr Phe Cys Arg He Leu Lys Arg Asp Glu Ser Val Ala Leu 
l^S 150 155 150 

Ala Glu Lys Asn Gly Phe Pro Ala Glu Arg He Val Phe Phe Glu Pro 

165 170 175 

His Ala Asp Glu Glu Leu Met Gin Ala Val Aro Pro Asp Ala He He 

180 195 " 190 

Thr Lys Glu Ser Gly Glu Ser Gly Tyr Phe Arg Glu Lys He Glu Ala 

155 200 205 

Ala Arg Arg Met Gly He Arg He Tyr Ala Val Val Arg Pro Pro Leu 

210 215 220 

Pro Pro Ser Phe He Pro Val Gly Gly Pro Val Gly Leu Arg Arg Ala 
225 230 235 240 

Val Glu Arg Leu Val Pro Gly Phe Phe Ser Leu Arg Ser Gly Phe Thr 

245 250 255 

Thr Gly Thr Thr Ala Thr Ala Ala Val Val Ala Ala Met Tyr Arg Leu 

260 265 270 

Met Gly Leu Gly Ser Leu Ala Glu Ala Pro Val Glu Leu Pro Ser Gly 

275 280 285 

Glu He Val Ser Leu Pro He Ala Glu He Arg Glu Glu Glu Asp Ala 

2?0 295 300 

Val Val Ser Ala Val Leu Lys Asp Ala Gly Asp Asp Pro Asp Val Thr 
305 310 315 320 

Asn Gly Met Ala Val Cys Ala Thr He Arg Leu Asn Pro Glu His Glu 

325 330 335 

Glu Val Arg Phe Leu Gin Gly Glu Gly Val Gly Val Val Thr Leu Pro 

340 345 350 

Gly Leu Gly Leu Glu Val Gly Gly Pro Ala He Asn Leu Val Pro Arg 

355 360 365 

Arg Met Met Thr Ala Glu Val Arg Arg Leu Tyr Ala Gin Gly Gly Val 

370 375 380 

Asp He Thr He Ser Val Pro Glu Gly Arg Glu Ala Ala Thr Gin Thr 
385 390 395 40O 

Phe Asn Pro Arg Leu Gly He Arg Asp Gly He Ser He He Gly Thr 

4 05 .]10 415 

Ser Gly Val Val Lys Pro Phe Ser Ala Glu Ala Phe Val Gly Ala He 

420 425 430 

Arg Lys Gin Val Gly He Ala Thr Ala Leu Gly Ala Asn His He Val 

435 440 445 

Leu Asn Ser Gly Ala Lys Ser Glu Arg Tyr Val Lys Gly Ala Tyr Pro 

450 455 460 

Ala Leu He Pro Gin Ala Phe Val Gin Tyr Gly Asn Phe Val Gly Glu 
465 470 475 480 

Ser Leu Ser Cys Val Ala Ser Phe Pro Ser Val Arg Ser Val Thr Val 



BNSDOCID: <WO 9929870A1_L> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



350/ 490 



25 



30 











*i85 










4 90 










4 95 




Gly 


He 


Met 


Leu 


Gly 


Lys 


Al a 


Val 


Lys 


Leu 


Alh 


Glu 


Gly 


Tyr 


Leu 


Asp 








500 










505 










510 






'I'hir 


Hts 


Ser 


Ly^ 


Lys 


Val 


Val 


Met 


Asn 


Arg 


Asp 


Phe 


Leu 


His 


Glu 


Leu 






515 










520 










525 








Ala 


Arg 


Gin 


Ala 


Gl y 


Cys 


Ser 


Glu 


Asp 


lie 


His 


Ala 


He 


lie 


Asp 


Ser 




530 










535 










54 0 










Leu 


Asn 


Leu 


Ala 


Arg 


Glu 


Leu 


Trp 


Thr 


Met 


Pro 


Ser 


Ala 


Glu 


Asp 


Ser 












550 










555 










560 


Asp 


Arg 


Leu 


Lsu 


Arg 


Lys 


He 


Ala 


Glu 


Arg 


Ser 


Trp 


Glu 


Thr 


Cys 


Arg 










565 










570 










57 5 




rro 


Ser 


Val 


Pro 


Ser 


Ala 


Glu 


Leu 


Glu 


Leu 


Leu 


Leu 


He 


Asp 


Glu 


Ser 








58 0 










585 










590 






Gly 


Ala 


He 


Arg 


Phe 


Arg 


He 


Gly 


Gly 


Glu 















10 



15 595 600 

(2) IlJFORnATIOII FOR SEQ ID MO: 410 

(i) SBQUEIICE CHARACTERISTICS: 
20 (A) LENGTH: 44 3 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

( V i ) OR I G 1 1 1 AL SOURCE : 

(A) ORGAIIISM : Porphyromonas gingival if 

(ix) FEATURE: 

(A) HAME/KEY: misc_feature 

(B) LOCATION 1 . . .443 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID MO: 414 

Ilet Leu Arg Thr Phe Arg He Gly Gly He His Pro Pro Glu Asn Lys 
15 10 15 

Leu Ser Ala Gly Lys Pro Val Glu Val Leu Pro He Pro Ser Gin Val 
40 2 0 2 5 30 

Val He Pro Leu Gly Gin His He Gly Ala Pro Ala Thr Ala Thr Val 

35 40 45 

Lys Lys Gly Asp Glu Val Lys Val Gly Thr He He Ala Gin Ala Gly 
50 55 60 

45 Gly Phe Val Ser Ala Asn He His Ser Ser Val Ser Gly Lys Val Leu 
65 7 0 7 5 80 

Lys He Asp Asn Val Tyr Asp Ser Ser Gly Tyr Pro Lys Pro Ala Val 

85 90 95 

Phe He Ser Val Glu Gly Asp Glu Trp Glu Glu Gly He Asp Arg Ser 
50 100 105 110 

Pro Ala He Val Lys Glu Cys Asn Leu Asp Ala Lys Glu He Val Ala 

115 120 125 

Lys He Ser Ala Ala Gly He Val Gly Leu Gly Gly Ala Thr Phe Pro 
130 135 140 

55 Thr His Val Lys Leu Ser Pro Pro Pro Gly Asn Lys Ala Glu He Leu 
145 150 155 160 

He He Asn Ala Val Glu Cys Glu Pro Tyn Leu Thr Ser Asp His Val 

165 170 175 

Leu Met Leu Glu His Gly Glu Glu He Met He Gly Val Ser He Leu 
60 100 185 190 

Met Lys Ala He Gin Val Asn Lys Ala Val He Gly Val Glu Asn Asn 

195 200 205 

Lys Lys Asp Ala He Ala His Leu Thr Lys Leu Ala Thr Ala Tyr Fro 
210 215 220 

65 Gly He Glu Val Met Pro Leu Lys Val Gin Tyr Pro Gin Gly Gly Glu 
225 230 235 240 

Lys Gin Leu He Asp Ala Val He Arg Lys Gin Val Lys Ser Gly Ala 

245 250 255 

Leu Pro He Ser Thr Gly Ala Val Val Gin Asn Val Gly Thr Val Phe 
70 260 265 270 

Ala Val Tyr Glu Ala Val Gin Lys Asn Lys Pro Leu Val Glu Arg He 

275 280 285 

Val Thr Val Thr Gly Lys Lys Leu Ser Arg Pro Ser Asn Leu Leu Val 
2 90 2 95 300 

75 Arg He Gly Thr Pro He Ala Ala Leu He Glu Ala Ala Gly Gly Leu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



351/490 



10 



15 



20 



25 



45 



50 



55 



60 



65 



70 



75 



305 










310 










315 










320 


Pro 


Glu 


Asn 


Thr 


Gly 
325 


Lys 


He 


He 


Gly 


Gly 
330 


Gly 


Pro 


Met 


Her 


Gly 
335 


Arg 


Ala 


Leu 


Leu 


Ser 
340 


Pro 


Asp 


Val 


Pro 


Val 
345 


Thr 


Lys 


Gly 


Ser 


Ser 
350 


Gly 


Val 


Lsu 


He 


Leu 
355 


Asp 


Arg 


Glu 


Glu 


Ala 
360 


Val 


Arg 


Lys 


Pro 


Met 
365 


Arg 


Asp 


Cys 


He 


Arq 
370 


Cys 


Ala 


Lys 


Cys 


Val 
375 


Gly 


Val 


Cys 


Fro 


flet 
380 


Gly 


Leu 


Asn 


Pro 


Ala 


Phe 


Leu 


Met 


Arg 


Asp 


Thr 


Leu 


Tyr 


Lys 


Ser 


Trp 


Glu 


Thr 


Ala 


Glu 


385 










390 










395 










4 00 


Lys 


Gly 


Asn 


Val 


Val 
405 


Asp 


Cys 


He 


Glu 


Cys 
410 


Gly 


Ser 


Cys 


Ser 


Phe 
415 


Thr 


Cys 


Pro 


Ala 


Asn 
420 


Arg 


Pro 


Leu 


Leu 


Asp 
425 


Tyr 


He 


Arg 


Gin 


Ala 
430 


Lys 


Lys 


Thr 


Val 


net 

^35 


Gly 


He 


Gin 


Arg 


Ala 
^40 


Arg 


Lys 


Gin 













(2) IMPORI lATIOII FOR SEQ ID MO: 415 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LEIIGTH: 479 amino acids 

(B) TVFE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) MAI IE/KEY: misc_f eature 
35 (B) LOCATIOlf 1...47 9 

(xi) SEQUEMCE DESCRIPTIOM: SEQ ID MO: 415 



40 



Met 


Lys 


Arg 


He 


Gin 


Leu 


Thr 


Leu 


He 


Ala 


Leu 


Phe 


Ala 


Ala 


Val 


Ala 


1 








5 










10 










15 




Gly 


Leu 


Val 


Ala 


Gin 


Asn 


Ala 


Tyr 


Glu 


Gly 


Val 


He 


Ser 


Tyr 


Lys 


He 








20 










25 










30 




Ser 


Leu 


Asp 


Lys 


Thr 


Gly Asn 


Lys 


Val 


Val 


Leu 


Asn 


Gly 


Ala 


Ala 


Asp 






35 










40 










45 






Met 


Ser 
50 


Asn 


Leu 


Lys 


Leu 


Lys 
55 


Ser 


Thr 


Gin 


Met 


He 

60 


He 


Val 


Thr 


Pro 


He 


Leu 


Arg 


Ser 


Glu 


Asp 


Gly 


Thr 


Ser 


Arg 


Val 


Glu 


Phe 


Pro 


Ser 


Val 


65 










7 0 








7 ^- 










8 0 


Val 


He 


Thr 


Gly 


Arg 
85 


Asn 


Arg 


Thr 


Lys 


Ala 
90 


Leu 


Lys 


Arg 


Glu 


He 
95 


Ala 


Phe 


Ser 


Ser 


Ala 


Leu 


Pro 


Gin 


Ala 


Lys 


His 


Ala 


Ala 


Gin 


Tyr 


He 


Arg 








100 










105 










110 




Arg 


His 


Asn 
115 


Gly 


Lys 


Ser 


Glu 


Gin 
120 


Phe 


Ala 


Pl-ie 


Thr 


Gly 
125 


Glu 


His 


Ala 


Tyr 


Ala 


Ser 


Trp 


Met 


Met 


Asp 


Ala 


Lys 


Phe 


Val 


Val 


Arg 


Glu 


Glu 


Val 




130 










135 










140 








Arg 


Gly 


Cys 


Ala 


Lys 


Cys 


Pro 


Val 


Gly 


Leu 


Ser 


Ser 


Asn 


He 


Val 


Pro 


145 










150 










155 










160 


Phe 


Asp 


Pro 


Leu 


Phe 


Asn 


Pro 


Ala 


Glu 


Ala 


Pro 


Tyr 


Leu 


Leu 


Ala 


His 










165 










170 








175 




He 


Thr 


Pro 


Ala 
180 


Glu 


Glu 


Val 


Glu 


Lys 
185 


Gin 


Arg 


Glu 


Ger 


Ser 
190 


Phe 


Asp 


Ala 


Tyr 


He 
195 


Asn 


Phe 


Lys 


Val 


Asn 
200 


Lys 


Ala 


Asp 


Val 


Leu 
205 


Pro 


Glu 


Tyr 


Arg 


Asn 


Asn 


Lys 


Ala 


Glu 


Leu 


Glu 


Lys 


He 


Lys 


Gl u 


Phe 


Val 


Ser 


Thr 




210 










215 








220 










Val 


Lys 


Ala 


Asn 


Pro 


Asn 


Tyr 


Ser 


Val 


Asn 


Lys 


Met 


He 


He 


Glu 


Gly 


225 










230 










235 










240 


Phe 


Ala 


Ser 


Pro 


Glu 
245 


Ala 


Ser 


He 


Ala 


His 
250 


Asn 


Lys 


Ala 


Leu 


Ser 
255 


Glu 


Arg 


Arg 


Ala 


Lys 


Arg 


Leu 


Ala 


Glu 


Glu 


Leu 


Val 


Arg 


Lys 


Tyr 


Gly 


Lys 








260 










2 65 










270 


Thr 


Leu 


Pro 


Asn 


He 


Thr 


Thr 


Glu 


Phe 


Gly 


Gly 


Glu 


Asp 


Trp 


Lys 


Gly 






275 










280 










285 




Leu 


Lys 


Leu 


Ala 


He 


Glu 


Lys 


Ser 


Asp 


He 


Ala 


Asp 


Arg 


Asp 


Arg 


Val 
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352/ 490 







290 










295 










300 












Leu 


Glu 


He 


He 


Asm 


Ser 


Asp 


Lys 


Tyr 


Ala 


Asp 


Asp 


Asp 


Ala 


Arg 


Glu 




305 










310 










315 










320 




Gin 


Ala 


Leu 


Lys 


Gl n 


Leu 


Ser 


Ser 


Tyr 


Arg 


Tyr 


He 


Leu 


Asp 


Gin 


He 


5 










325 










330 










335 






Tyr 


Pro 


Am 


Leu 
340 


Arg 


Arg 


Asn 


Thr 


He 
345 


Thr 


MeL 


Gly 


Tyr 


He 
350 


Val 


Arg 




A:?p 


Tyr 


Thr 


Leu 


Glu 


Glu 


Ala 


Arg 


Glu 


He 


He 


Lys 


Thr 


Ala 


Pro 


Lys 


10 






355 










360 










365 








Glu 


Leu 
37 0 


Ser 


Gl u 


Ala 


Glu 


net 

375 


Tyr 


Arg 


Val 


Ala 


Met 
380 


Ser 


Tyr 


Pro 


Glu 




Gly 


His 


Gin 


Glu 


Arg 


Leu 


Phe 


Ala 


Leu 


Asn 


Thr 


Thr 


Leu 


Lys 


Tyr 


Phe 




385 










390 










395 










400 




Pro 


Glu 


Ser 


Val 


Thr 


Gly 


Arg 


He 


Asn 


Leu 


Ala 


Val 


Ala 


Ala 


Phe 


Asn 


15 










4 05 










410 










415 






Gly 


Gly 


Asp 


Val 

4 20 


Gin 


Gin 


Ala 


He 


Ala 

425 


Leu 


Leu 


Ser 


Pro 


He 
430 


Gin 


Thr 




Glu 


Lys 


Gly 


Val 


Ser 


Asn 


He 


Leu 


Gly 


Ala 


Ala 


Tyr 


Ala 


Arg 


Thr 


Gly 


20 






435 










440 










4 45 








Asp 


Phe 
450 


Ala 


At<3 


Ala 


Glu 


Thr 
455 


Phe 


Phe 


Arg 


Lys 


Ala 
4 60 


Val 


Ala 


GLu 


Gly 




Asp 


Ala 


A.-^n 


Ala 


Gin 


Arg 


Asn 


Leu 


Asp 


Met 


Leu 


Leu 


Gly 


Ly^ 


Lys 






465 










4 70 










475 













25 (2) lUFORIlATIOlJ FOR SEQ ID HO: 416 

(i) SEQUEllCE CHARACTERISTICS: 

{A) LEMGTH: 383 amino acids 
(B) TYPE: amino acid 
30 ( D ) TO POLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



35 



45 



50 



55 



60 



65 



70 



75 



(vi) ORIGIMAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 
40 (A) MAME/KEY: misc_feature 

(B) LOCATION 1 . . . 383 



(xi) SEQOEHCE DESCRIPTION: SEQ ID MO: 416 



Met 


Ala 


Glu 


Lys 


Arg 


Asp 


Tyr 


Tyr 


Glu 


Val 


Leu 


Gly 


Val 


Ser 


hys 


Asn 


1 


















10 










15 




Ala 


Thr 


Asp 


Asp 


Glu 


Leu 


Lys 


Lys 


Ala 


Tyr 


Arg 


Lys 


Lys 


Ala 


He 


Gin 








20 










25 










30 






Tyr 


His 


Pro 


Asp 


Lys 


Asn 


Pro 


Gly 


Asp 


Lys 


Glu 


Ala 


Glu 


Gl u 


His 


Phe 






35 










4 0 










45 








Lys 


Glu 


Val 


Ala 


Glu 


Ala 


Tyr 


Asp 


Val 


Leu 


Ser 


Asp 


Pro 


Gin 


Lys 


Arg 




50 










55 










60 










Ser 


Gin 


Tyr 


Asp 


Gin 


Phe 


Gly 


His 


Ala 


Gly 


Leu 


Gly 


Gly 


Ala 


Ala 


Gly 


65 










70 










75 










80 


Gly 


Gly 


Phe 


Ser 


Gly 


Gly 


Gly 


!-let 


Ser 


Met 


Glu 


Asp 


He 


Phe 


Ser 


Arg 










85 










90 










95 




Phe 


Gly 


Asp 


Leu 


Phe 


Gly 


Gly 


Phe 


Gly 


Gly 


Phe 


Gly 


Gly 


Phe 


Ser 


Asp 








100 










105 










110 






Met 


Gly 


Gly 


Gly 


Ser 


Arg 


Arg 


Arg 


Val 


Arg 


Arg 


Gly 


Ser 


Asp 


Leu 


Arg 






115 










120 










125 








Val 


Arg 


Val 


Lys 


Leu 


Ser 


Leu 


Ala 


Asp 


He 


Ser 


Lys 


Gly 


Val 


Glu 


Lys 




130 










135 










140 










Lys 


Val 


Lys 


Val 


Lys 


Lys 


Gin 


Val 


Val 


Cys 


Ser 


Lys 


Cys 


Arg 


Gly 


Asp 


145 










150 










155 










160 


Gly 


Thr 


Glu 


Glu 


Ala 


Asn 


Gly 


Lys 


Thr 


Thr 


Cys 


Gin 


Thr 


Cys 


His 


Gly 










165 










170 










175 




Thr 


Gly 


Val 


Val 


Thr 


Arg 


Val 


Ser 


Asn 


Thr 


Phe 


Leu 


Gly 


Ala 


14et 


Gin 








180 










185 










190 






Thr 


Gin 


Ser 


Thr 


Cys 


Pro 


Thr 


Cys 


His 


Gly 


Glu 


Gly 


Glu 


He 


He 


Thr 






195 










200 










205 








Lys 


Pro 


Cys 


Ser 


Lys 


Cys 


Lys 


Gly 


Glu 


Gly 


Val 


Glu 


He 


Gly 


Glu 


Glu 




210 










215 










220 










Val 


He 


Ser 


Phe 


His 


He 


Pro 


Ala 


Gly 


Val 


Ala 


Glu 


Gly 


Met 


Gin 


Met 


225 










230 








235 










240 


Ser 


Val 


Asn 


G} y 


Lys 


Gly 


Asn 


Ala 


Ala 


Pro 


Arg 


Gly 


Gly 


Val 


Asn 


Gly 
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2^5 5 










250 










255 




Asp 


Leu 


lie 


Val 


Val 


He 


Ala 


Glu 


Glu 


Pro 


Asp 


Pro 


Asn 


Leu 


He 


Arg 








260 










265 










270 




Asn 


Gly 


Asn 


Asp 


Leu 


He 


Tyr 


Asn 


Leu 


Leu 


He 


Ser 


Val 


Pro 


Leu 


Ala 






275 










280 










285 








lie 


Lys 


Gly 


Gly 


Ser 


Val 


Glu 


Val 


Pro 


Thr 


He 


Asp 


Gly 


Arg 


Ala 


Lys 




290 










295 










300 










lie 


Arg 


lie 


Glu 


Ala 


Gly 


Thr 


Gin 


Pro 


Gly 


Lys 


Met 


Leu 


Arg 


Leu 


Arg 


305 










310 










315 










320 


Asn 


Lys 


Gly 


Leu 


Pro 


Ser 


Val 


Asn 


Gly 


Tyr 


Gly 


Met 


Gly 


Asp 


Gin 


Leu 










325 










330 










335 




Val 


Asn 


Val 


Asn 


Val 


Tyr 


He 


Pro 


Glu 


Ser 


He 


Asp 


Ala 


Lys 


Asp 


Glu 








3^0 




















350 




Gin 


Ala 


He 


Ala 


Ala 


Met 


Glu 


Asn 


Ser 


Asp 


Ser 


Phe 


Lys 


Pro 


Thr 


Asp 






355 










360 










365 








Ala 


Ala 


Arg 


Lys 


Asp 


He 


Asp 


Lys 


Lys 


Tyr 


Arg 


Glu 


Met 


Leu 


Asp 






370 










375 










380 









(2) IDFORMATIOII FOR SEQ ID UQ:411 

(i) SEQUENCE CHARACTERISTICS: 

tA) LEMGTH: 2 93 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

30 (vi) ORIGIIiAL SOURCE: 

(A) ORGAHISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
35 (B) LOCATION 1...293 

(>:i ) SEQUENCE DESCRIPTION: SEQ ID JJO:417 

Met 

40 



45 



Met 


Lys 


Lys 


Leu 


He 


Leu 


A] a 


Thr 


Leu 


Gly 


Leu 


Met 


Ala 


He 


Ala 


Met 


1 








5 










10 










15 




Leu 


Ser 


Cy^ 


Ser 


Ser 


Asn 


Asn 


Lys 


Asp 


Leu 


Glu 


Asn 


Lys 


Gly 


Glu 


Ala 








20 










25 










30 






Thr 


Leu 


Leu 


Val 


Thr 


Phe 


Gly 


Ser 


Ser 


Tyr 


Lys 


Ala 


Pro 


Arq 


Glu 


Thr 






35 










40 










45 








Tyr 


Ala 


Lys 


He 


Glu 


Lys 


Thr 


Phe 


Ala 


Ala 


Ala 


Tyr 


Pro 


Asp 


Gin 


Arg 




50 










55 










60 








He 


Ser 


Trp 


Thr 


Tyr 


Thr 


Ser 


Ser 


He 


He 


Arg 


Lys 


Lys 


Leu 


Ala 


Gin 


65 










7 0 










75 








8 0 


Gin 


Gly 


lie 


Tyr 


He 


Asp 


Ala 


Pro 


Asp 


Glu 


Ala 


he-u 


Glu 


Lys 


Leu 


Ala 










85 










90 










95 




Arg 


Leu 


Gly 


Tyr 


Lys 


Lys 


He 


Asn 


Val 


Gin 


Ser 


Leu 


His 


Val 


He 


Pro 








100 










1 05 










110 






GJy 


Arg 


Gl u 


Tyr 


Asp 


Glu 


Met 


He 


Asp 


Phe 


Val 


Asn 


Lys 


Phe 


Lys 


Ala 






115 










120 










125 






Ala 


His 


Ser 


Asp 


He 


Thr 


Val 


Lys 


Val 


Gly 


Ala 


Pro 


Leu 


Phe 


Asp 


Thr 




130 










135 










140 










Asp 


Glu 


Asp 


Met 


Arg 


Glu 


Val 


Ala 


Glu 


He 


Leu 


His 


Lys 


Arg 


Phe 


Gin 


1.15 










150 










155 










160 


Gin 


Thr 


He 


Glu 


Lys 


Gly 


Glu 


Ala 


He 


Val 


Phe 


Met 


Gly 


His 


Gly 


Thr 










165 










170 










175 




Gl u 


His 


Ala 


Ala 


Asn 


Asp 


Arg 


T^'r 


Ala 


Arg 


He 


Asn 


Lys 


He 


Met 


Lys 








180 










185 










190 




Asn 


Tyr 


Ser 


Lys 


Phe 


Met 


He 


Val 


Gly 


Thr 


Val 


Gl u 


Ser 


Asp 


Pro 


Ser 






195 










200 










205 








He 


Asn 


Asp 


Val 


He 


Ala 


Glu 


Leu 


Ly3 


Glu 


Thr 


Gly 


Ala 


Thr 


Ala 


Vai 




210 










215 










22 0 










Thr 


Met 


Met 


Pre 


Leu 


Met 


Ser 


Val 


Ala 


Gly 


Asp 


His 


Ala 


Thr 


Asn 


Asp 


225 










230 










235 










240 


Met 


Ala 


Gly 


Asp 


Glu 


Asp 


Asp 


Ser 


Trp 


Lys 


Thr 


Leu 


Leu 


Thr 


Asn 


Ala 










2.15 










250 










255 




Gly 


Tyf 


Thr 


Val 


Ser 


He 


Asp 


Lys 


Leu 


Asp 


Asn 


Gly 


Asn 


Phe 


Ser 


Ala 








260 










265 










270 






Leu 


Gly 


Asp 


He 


g:u 


Glu 


He 


Arg 


Asn 


He 


Trp 


Leu 


Lys 


His 


Met 


Lys 






275 










280 










285 








Ala 


Thr 


Ser 


Ala 


Arg 
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290 

(2) ItlFORtlATIOlI FOR SEQ ID IIO:418 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 356 amino acids 
{B) TYPE: amino acid 
{D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAiasil: Porphyromonas gingiva lis 

fix) FEATURE: 

(A) ilAilE/KEY: misc_feature 

( B) LOCATION 1 ... 356 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418 



25 



30 



35 



40 



45 



50 



55 



60 



65 



1 let 


Tlir 


Se r 


Val 


Ser 


His 


L'^u 


Arg 


Thr 


Tic 

lie 


Ser 


Val 


Al a 


Gly 


lie 


Leu 


1 








5 










1 C' 










1 5 




Ala 


Ala 


Leu 


Gly 

2 0 


Gly 


Ala 


Val 


Leu 


He 
25 


Leu 


Phe 


Gly 


Val 


Asn 
3 0 


Leu 


Phe 


Leu 


Gl y 


Ser 
35 


Val 


Ala 


He 


ro 


net 

40 


Ser 


Glu 


j. 1 e 


Phe 


Arg 
4 5 


His 


Leu 


Phe 


Ser 


Asp 


Arg 


Pro 


Glu 


Gly 


Gly 


Glu 


Ala 


Leu 


Val 


His 


Tyr 


Asn 


He 


Leu 




50 










O D 










60 








Trp 


Lys 


Ser 


Arg 


Leu 


Pro 


Glu 


Ala 


Leu 


Thr 


Ala 


Ala 


Phe 


Ala 


Gly 


Ala 


6 5 










f u 










75 








80 


Gly 


Leu 


Ser 


Val 


Ser 


Gly 


Leu 


Gin 


Met 


Gin 


Thr 


Val 


Phe 


Arg 


Asn 


Pro 










8 5 










90 








95 




Leu 


Ala 


Gly 


Pro 


Ser 


Val 


Leu 


Gly 


He 


Ser 


Ser 


Gly 


Ala 


Ser 


Leu 


Gly 








100 










105 








110 




Val 


Ala 


Leu 
115 


Val 


Val 


Leu 


Leu 


Ser 
120 


Gly 


Ser 


Leu 


Gly 


Gly 
125 


Val 


Ala 


Leu 


Ser 


Ser 
130 


Leu 


Gly 


Tyr 


Met 


Gly 

135 


Glu 


Val 


Ala 


Met 


Asn 
14 0 


He 


Ala 


Ala 


Ala 


Val 


Gly 


Ser 


Leu 


Ala 


Val 


Met 


Gly 


Leu 


He 


Val 


Phe 


Val 


Ser 


Thr 


Lys 


145 










150 










155 










160 


Val 


Arg 


Ser 


His 


Val 
165 


Thr 


Leu 


Leu 


He 


He 
170 


Gly 


Val 


Met 


He 


Gly 
175 


Tyr 


Va 1 


Ala 


Thr 


Ala 
180 


Val 


He 


Gly 


Val 


Phe 
185 


Lys 


Phe 


Phe 


Ser 


He 
190 


Glu 


Glu 


Asp 


He 


Arg 


Ala 


Tyr 


Val 


He 


Trp 


Gly 


Leu 


Gly 


Ser 


Phe 


Ser 


Arg 


Ala 






1?5 










200 










205 






Thr 


Asp 
210 


Ser 


Gin 


Leu 


Ser 


Phe 
215 


Phe 


Ala 


He 


Leu 


liet 

22 0 


Leu 


He 


Phe 


He 


Pro 


Ala 


Gly 


1 let 


Leu 


Leu 


Val 


Lys 


Gin 


Leu 


Asn 


Leu 


Leu 


Leu 


Leu 


Gly 


225 










230 










235 










24 0 


Glu 


Ser 


Tyr 


Ala 


Arg 


Asn 


Leu 


Gly 


Leu 


Asn 


Thr 


Arg 


Arg 


Ala 


Arg 


Leu 










245 










250 








255 




Leu 


Val 


He 


Ser 
260 


Ser 


Ala 


Gly 


Leu 


Leu 
265 


He 


Ala 


Thr 


Val 


Thr 
270 


Ala 


Tyr 


Cys 


Gly 


Pro 


He 


Gly 


Phe 


Leu 


Gly Met 


Ala 


Val 


Pro 


His 


Leu 


Ala 


Arg 






275 










280 










285 






Val 


Tie 
290 


Phe 


His 


Thr 


Ser 


Asp 
295 


His 


Arg 


He 


Leu 


Met 
300 


Pro 


Ala 


Thr 


Cys 


Leu 


He 


Gly 


Ser 


Ala 


Leu 


Ala 


Leu 


Phe 


Cys 


Asn 


He 


He 


Ala 


Arg 


Met 


305 










310 








315 








320 


Pro 


Gly 


Phe 


Glu 


Gly 
325 


Ala 


Leu 


Pro 


Val 


Asn 

330 


Ser 


Val 


Thr 


Ala 


Leu 
335 


Val 


Gly 


Ala 


Pro 


He 
34 0 


He 


Val 


Thr 


Val 


Leu 
345 


Phe 


Arg 


Arg 


Arg 


Arg 
350 


Phe 


Lys 


Glu 


Glu 


Thr 
355 


Asp 



























70 (2) INFORMATION FOR SEQ ID NO: 419 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 amino acids 

(B) TYPE: amino acid 
75 (D) TOPOLOGY: linear 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_L> 
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(ii) MOLECULE TVFE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 

(A) ORGANISM: Porphyromona s gingiva lis 

ii::) FEATURE: 

(A) MAKE/KEY; mi sc_f eature 

(B) LOCATIOM 1 ... 757 

(::i) SEQUEIICE DESCRIPTIOM: SEQ IDIfO:419 



Met 


Arg 


Thr 


Lys 


Thr 


He 


Phe 


Phe 


Ala 


He 


He 


Ser 


Phe 


He 


Ala 


Leu 


1 








5 










10 










i 5 




Leu 


Ser 


Ser 


Ser 


Leu 


Ser 


Ala 


Gin 


Ser 


Lvs 


Al a 


Val 


Leu 


Thr 


Glv 


Ser 
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30 




Val 


Ser 


Asp 


Ala 


Glu 


Thr 


Glv 


Glu 


Pro 


Leu 


Ala 


Gl V 


Ala 


Arg 


I le 


Glu 






35 










4 0 










4 5 








Val 


Lys 




Thr 


Asn 


He 


Val 


Al a 


Gl 


Ala 


Asp 


Ala 


Gly 


Gl^- 


His 


Plie 




50 










55 










60 










Gl Li 


He 




Asn 


Leu 


Pro 


Ala 


Gl-'- 

VJ J. J, 


Gl n 


His 


Thr 


He 


He 




S'^r 


Leu 


65 










7 0 










~ 5 










80 


Gl'.- 


Gly 


Tyr 


Gl-j' 


Gin 




Glu 


Glu 


Val 


Val 


Al a 


He 


Glu 


Al a 


J J. J' 


Gin 










35 










90 










95 




Thr 


L\*s 


Thr 


He 


Ser 


Fhe 


Al a 


Leu 


Arg 


Leu 


rtrg 


Til r 


Asn 


Asn 




Glu 








100 










1 05 










11 0 






Gl u 


Val 


Val 


Val 


Thr 


Gly 


Thr 


Gl V 


Thr 


A -a 




Arg 


Leu 


Val 


A«n 


Ala 






115 










120 










125 








Pro 


Val 


Ala 


Thr 


Glu 


Val 


Leu 


Thr 


Ala 


Lys 


Asp 


He 


Ala 


Ser 


Phe 


Ser 




130 










135 










140 










Ala 


Pro 


Thr 


Ser 


Glu 


Ala 


Leu 


Leu 


Gin 


Glv 


Leu 


Ser 


Pro 


Ser 


Phe 




145 










150 










1 55 










160 


Phe 


Gly 


Pro 


Asn 


Leu 


Met 


Glv 


Ser 


Phe 


Met 


Gin 


Leu 


Asn 


Gl'-' 


Leu 


Ser 










165 










170 










175 




Ser 






He 


Leu 


He 


L»u 


He 


Asp 


Gly 




rg 


Val 




m V 
^±y 


Asp 








180 










185 










190 






Val 


Gl^* 


Glj- 


Gin 


Ala 


ASD 


Leu 


Ser 


Arg 


He 


Ser 


Pro 






He 


Gl u 






195 










200 










205 








Arg 


He 


Glu 


Leu 


Val 


Lys 


Gly 


Ala 


Ser 


Ser 


Ser 


Leu 


Tyr 


Gly 


Ser 


Asp 




210 










215 










220 










Ala 


He 


Ala 


Gly 


Val 


He 


Asn 


Val 


He 


Thr 


Lys 


Lvs 


Asn 


Thr 


Asn 


Arg 


22 5 










230 










235 








240 


Leu 


Ser 


Ala 


Tyr 


Thr 


Ser 


His 


Arg 


He 


Ser 


L'-'s 


Tvr 


Asn 


ASD 


Arg 


Gin 










245 










250 










255 




Thr 


Asn 


Thr 


Ser 


Leu 


Asp 


He 


Asn 


He 


Gly 


Lys 


Fhe 


Ser 


Ser 


Asn 


Thr 








260 










265 










270 






Asr. 


Tyr 


Fhe 


Phe 


Tyr 


His 


Thr 


Asp 


Gly 


Trp 


Gin 


Asn 


Ser 


Pro 


Phe 


Glu 






275 










280 










285 








He 


Lys 


Lys 


Lys 


Lys 


Gly 


Ser 


Gly 


Glu 


Pro 


Val 


Leu 


Glu 


Glu 


Thr 


Tyr 




290 










2 95 










300 








hys 


Lys 


Thr 


Phe 


Arg 


Ala 


Gin 


Glu 


Asn 


Gin 


Gly 


Val 


Ser 


Gin 


Ser 


Leu 


305 










310 










315 










320 


Ser 


Tyr 


Tyr 


Ala 


Thx- 


Asn 


Asn 


Leu 


Ser 


Phe 


Ser 


Gly 


Asn 


Val 


Gin 


Tyr 










325 










330 










335 




hsn 


Lys 


Arg 


Gin 


He 


Phe 


Thr 


Pro 


Thr 


Phe 


Ser 


Glu 


Lys 


Ly^ 


Ala 


Tyr 








34 0 










345 










350 






hsp 


Met 


Asp 


Tyr 


Arg 


Ala 


Leu 


Thr 


Ala 


Ser 


Leu 


Gly 


Thr 


Asn 


Tyr 


Leu 






355 










360 










365 








Phe 


Pro 


Asn 


Gly 


Leu 


His 


Thr 


Leu 


Ser 


Phe 


Asp 


Ala 


Val 


Tyr 


Asp 


Arg 




37 0 










375 










300 










Phe 


Arg 


Phe 


^ly 


Tyr 


Leu 


Tyr 


His 


Asp 


Lys 


Asp 


Ser 


Ser 


Glu 


Ser 


Leu 


385 










390 










395 










4 00 


He 


Asn 


Asn 


Gin 


Gly 


Gin 


Thr 


Glu 


Gin 


Pro 


Thr 


Phe 


Phe 


Pro 


Gly 


Gin 










4 05 










410 










415 




Leu 


Arg 


Asn 


Lys 


Asn 


Asp 


Gin 


He 


Arg 


Tyr 


Thr 


Ala 


Glu 


Ala 


Arg 


Gly 








420 










425 










430 






Vol 


Phe 


Thr 


Leu 


Pro 


Tyr 


Ala 


Gin 


Lys 


Leu 


Thr 


Gly 


Gly 


Leu 


Glu 


Tyr 






435 










440 










445 






The 


Arg 


Glu 


Gl u 


Leu 


He 


Ser 


Pro 


Tyr 


Asn 


Leu 


He 


Thr 


Asp 


Lys 


Ala 




450 










455 










460 










Asp 


Ala 


Ser 


Thr 


Leu 


Ser 


Ala 


Tyr 


Val 


Gin 


Asp 


Glu 


Trp 


Lys 


Pro 


Leu 


465 










47 0 










475 










4 80 


Asp 


Trp 


Phe 


Asn 


Met 


Thr 


Ala 


Gly 


Phe 


Arg 


Leu 


Val 


His 


His 


Gin 


Glu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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wo 99/29870 PCT/AU98/01023 

356/490 



45 



50 











485 










4 90 










4 95 




Phe 


Gly 


Thr 


Arg 


Met 


Thr 


Pro 


Lys 


Val 


Ser 


He 


Leu 


Ala 


Lys 


Tyr 


Gly 








500 










505 










510 






Pro 


Leu 


Asn 


Phe 


Arg 


Ala 


Thr 


Tyr 


Ala 


Asn 


Gly 


Tyr 


Lys 


Thr 


Pro 


Thr 






515 










520 










525 








Leu 


Lys 


Glu 


Leu 


Phe 


Ala 


Arg 


Asn 


Glu 


Leu 


Thr 


Thr 


Met 


Gly 


Ser 


His 




530 










535 










540 










Asn 


Leu 


Tyr 


Leu 


Gly 


Asn 


Ala 


Asp 


Leu 


Lys 


Pro 


Gin 


Met 


Ser 


Asp 


Tyr 


SAb 










550 










555 










560 


Tyr 


Ala 


Leu 


Gly 


Leu 


Glu 


Tyr 


Asn 


Gin 


Gly 


Pro 


He 


Ser 


Phe 


Ser 


Ala 










565 










570 










575 




Th L- 


Val 


Tyr 


Asp 


Asn 


Glu 


Leu 


Arg 


Asn 


Leu 


He 


Ser 


Phe 


Met 


Asp 


He 








580 










585 










590 






Pro 


Thr 


Ser 


Pro 


Glu 


His 


Glu 


Ala 


Gin 


Gly 


He 


Lys 


Lys 


Thr 


Lys 


Gin 






595 










600 










605 








Tyr 


Ala 


Asn 


He 


Gly 


Lys 


Ala 


Arg 


Ser 


Arg 


Gly 


Leu 


Asp 


Val 


Leu 


Cys 




610 










615 










620 










A^p 


Ala 


Ser 


He 


Gly 


Trp 


Gly 


He 


Lys 


Leu 


Gly 


Ala 


Gly 


Tyr 


Ser 


Leu 


625 










630 










635 










640 


Val 


Glu 


Ala 


Lys 


Asn 


Leu 


Gin 


Thr 


Asp 


Glu 


Trp 


Leu 


Gl u 


Gly 


Ala 


Ala 










645 










650 










655 




Arg 


His 


Arg 


Ala 


Asn 


Val 


His 


Ala 


Asp 


Trp 


Val 


His 


Tyr 


Trp 


Gly 


Gin 








660 










665 










670 






Tyr 


Arg 


Leu 


Gly 


Val 


Ser 


Leu 


Phe 


Gly 


Arg 


He 


Gin 


Ser 


Glu 


Arg 


Tyr 






67 5 










680 










685 








Tyr 


Lys 


Asp 


Gly 


Asn 


Ala 


Pro 


Asp 


Tyr 


Thr 


Leu 


Trp 


Arg 


Leu 


Ala 


Thr 




6 90 










695 










7 00 










S-r 


His 


Arg 


Phe 


Ala 


His 


Phe 


Arg 


His 


He 


He 


Leu 


Asp 


Gly 


Thr 


Leu 


705 










7 1 0 










715 










720 


Giy 


He 


Asp 


Asn 


Leu 


Phe 


Asp 


Tyr 


Vai 


Asp 


Asp 


Arg 


Pro 


Met 


Gly 


Val 










725 










730 










735 




Asn 


Tyr 


Ala 


Thr 


Val 


Thr 


Pro 


Gly Arg 


Thr 


Phe 


Phe 


Ala 


Gin 


He 


Ala 








7*J0 










745 










750 






He 


Arg 


Phe 


Asn 


Asn 




























755 




























{2) 


IIIFORI'IATIOIJ 


FOR 


SEQ 


ID nO:420 

















10 



15 



20 



25 



30 



35 



(i) SEQUEtlCE CHARACTERISTICS: 
40 (A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingival is 



(ix) FEATURE: 

(A) MAI'iE/KEY: mi sc_ feature 

(B) LOCATIOM 1 . . . 331 

55 (xi) SEQUEMCE DESCRI PTIOl 1 : SEQ ID 110:420 

Met Thr Asp Asn Lys Gin Arg Asn He Val Phe Pro Ala Phe Leu Leu 
15 10 15 

Leu Leu Gly Val He Ala Val Val Thr He Val Gly Phe Phe Met Leu 
60 2 0 25 30 

Arg Pro Ala Glu Glu He He Gin Gly Gin He Glu Val Thr Glu Tyr 

35 40 45 

Arg Val Ser Ser Lys Val Pro Gly Arg He Lys Glu Leu Arg Val Ser 
50 55 60 

65 Glu Gly Gin Gin Val Gin Ala Gly Asp Thr Leu Ala Val He Glu Ala 

65 70 75 80 

Pro Asp Val Ala Ala Lys Met Glu Gin Ala Lys Ala Ala Glu Ala Ala 

85 90 95 

Ala Gin Ala Gin Asn Ala Lys Ala Leu Lys Gly Ala Arg Ser Glu Gin 
70 100 105 110 

He Gin Ala Ala Tyr Glu Met Trp Gin Lys Ala Gin Ala Gly Val Ala 

115 120 125 

He Ala Thr Lys Thr His Gin Arg Val Gin Asn Leu Tyr Asp Gin Gly 
130 135 1^0 

75 Val Val Pro Ala Gin Lys Leu Asp Glu Ala Thr Ala Gin Arg Asp Ala 
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BNSDOCID: <WO 9929870A1_L> 
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35 



50 



55 



60 



65 



70 



75 



1^5 










150 










155 










160 


Ala 


He 


Ala 


Thr 


Gin 


Lys 


Ala 


Ala 


Glu 


Ala 


Gin 


Tyr 


Asn 


Met 


Ala 


Arg 










165 










170 










175 




Asn 


Gly 


Ala 


Glu 


Arg 


Glu 


Asp 


Lys 


Leu 


Ala 


Ala 


Ser 


Ala 


Leu 


Val 


Asp 








180 








185 










ISO 






Arg 


Ala 


Arg 


Gly 


Ala 


Val 


Ala 


Glu 


Val 


Glu 


Ser 


Tyr 


He 


Asn 


Glu 


Thr 






1 ?5 










200 










205 








Tyr 


Leu 


He 


Al a 


Pro 


Arg 


Ala 


Gly 


Glu 


Val 


Ser 


Glu 


He 


Phe 


Pro 


Lys 




210 










215 










220 










Ala 


Gly 


Glu 


Leu 


Val 


Gly 


Thr 


Gly 


Ala 


Pro 


He 


Met 


Asn 


He 


Ala 


Glu 


2 25 










230 










235 












Met 


Gly 


J\ep 


Met 


Trp 


Ala 


Ser 


Phe 


Ala 


Val 


Arg 


Glu 


Asp 


Phe 


Leu 


Ser 










2Ab 










250 










255 




Ser 


Met 


Thr 


(let 


Gly 


Ala 


Val 


Leu 


Glu 


Thr 


Val 


Val 


Pro 


Ala 


Leu 


Asn 








2 60 










265 










27 0 






Glu 


Glu 


Ly^ 


Vr^l 


Arg 


Phe 


Lys 


He 


Thr 


Phe 


He 


Lys 


Asn 


Met 


Gly 


Thr 






275 










280 










285 








Tyr 


Ala 


Ala 


Trp 


Lys 


Ala 


Thr 


Lys 


Thr 


Thr 


Gly 


Gin 


Tyr 


Asp 


Leu 


Lys 




2 90 










295 










300 










Thr 


Phe 


Gl u 


Val 


Ly^ 


Ala 


Thr 


Leu 


Ala 


Asp 


Ly^ 


Asp 


Lys 


Ala 


Gin 


Lys 


305 










310 










315 










320 


Leu 


Arg 


Pro 


Gly 


Met 


Ser 


Val 


He 


He 


Arg 


Ly^ 













10 



15 



20 



325 330 

25 (2) IIIE'ORMATIOM FOR SEQ ID HO: 4 21 

(i) SPJQUFJiCE: CHARACTERISTICS: 

(A) LENGTH: 2 67 amino acids 
(E) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



( V i ) OR I G 1 1 lAL SOURCE : 

(A) ORGANISM: Porphyromona s gingival is 



(ix) FEATURE: 
40 (A) MA! 'IE /KEY: mi sc_ feature 

(B) LOCATIOM 1 . . .267 



45 





(xi) 


1 SEQUEIICE DESCRIPTIOM: ; 


SEQ : 


ID HO: ^21 










Met 


Arg 


He 


Val 


Ser 


Asn 


Phe 


Leu 


Phe 


Val 


Ser 


Phe 


Ser 


Val 


Leu 


Leu 


1 








5 










10 










15 




Phe 


Ala 


Ser 


n-.-c- 
- J ^ 

2 0 


Arg 


Ser 


Gin 


Arg 


Glu 

25 


Lys 


Val 


Val 


Tyr 


Leu 
30 


Gin 


Asp 


li- 


Gin 


Thr 


Pile 


Asn 


Arg 


Glu 


He 


He 


Ala 


Lys 


Pro 


Tyr 


Asp 


Va 1 


Lys 






35 










AQ 










45 








ne 


Gl u 


Lys 


Asp 


Asp 


Val 


Leu 


Asn 


He 


Leu 


Val 


Ser 


Ser 


Arg 


Asp 


Pro 




50 










55 










60 










Glu 


Leu 


Ser 


Thr 


Pro 


Tyr 


Asn 


Gin 


Val 


Leu 


Thr 


Thr 


Arg 


Ala 


Leu 


Ala 


65 










70 










75 










SO 


Arg 


Asn 


Gly 


Tyr 


Gly 


Thr 


Asn 


Ser 


Asn 


Glu 


Gly 


Phe 


Leu 


Val 


Asp 


Ser 










85 










90 










95 




Li'S 


Gly 


Tyr 


He 


Asn 


Tyr 


Pro 


He 


Leu 


Gly 


Gin 


He 


Tyr 


Val 


Glu 


Gly 








100 










105 










HO 






Leu 


Thr 


Arg 


Thr 


Glu 


Leu 


Gl u 


Lys 


Glu 


He 


Gin 


Lys 


Arg 


He 


He 


Ser 






115 










120 










125 








Ser 


Gly 


Phe 


He 


Lys 


Asp 


Pro 


Thr 


Val 


Thr 


Val 


Gin 


Leu 


Gin 


Asn 


Phe 




130 










135 










140 










Lys 


Va i 


Ser 


Val 


Leu 


Gly 


Glu 


Val 


Asn 


His 


Pro 


Gly 


Ser 


Met 


Ser 


Val 


1.15 










150 










155 










160 


Lys 


Gly 


Glu 


Arg 


He 


Thr 


Leu 


Leu 


Glu 


Ala 


He 


Gly 


Met 


Ala 


Gly 


Asp 










165 










170 










175 




Leu 


Thr 


He 


Tyr 


Gly 


Arg 


Arg 


Asp 


Arg 


Val 


Phe 


Val 


He 


Arg 


Glu 


Thr 








180 










185 










190 






Asp 


Gly 


His 


Arg 


Glu 


Val 


Phe 


Gin 


Thr 


Asp 


Leu 


Arg 


Lys 


Ala 


Asp 


Leu 






195 










200 










205 








Leu 


Ala 


Ser 


Pro 


Val 


Tyr 


Tyr 


Leu 


His 


Gin 


Asn 


Asp 


Val 


He 


Tyr 


Val 




210 










215 










220 










Glu 


Pro 


Asn 


A-p 


Lys 


Lys 


Thr 


Gin 


Met 


Ser 


Glu 


He 


Asn 


Gin 


Asn 


As n 


225 










230 










235 












Asn 


Val 


Asn 


Val 


Trp 


Leu 


Ser 


Val 


Thr 


Ser 


Tl^r 


Leu 


Val 


Ser 


He 


Ser 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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245 250 
Thr Leu Thr lie Thr He He Asp hys Thr hys 
260 265 



255 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(2) lUFORMATIOIJ POR SEQ ID tJO:4 22 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

(ii) i'lOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 

{A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) MAI IE/ KEY: misc_feature 
{B) LOCATION 1 . . , 569 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:'J22 



1 let 


Ly^ 


Ly.- 


Thr 


Asn 


Leu 


Phe 


Leu 


Ser 


Leu 


Leu 


Val 


He 


Phe 


He 


Thr 


1 


















10 










15 




Gly 


Ser 


Fhe 


net 


Thr 


Ala 


■"J 


Ala 


Gin 


Lys 


Ser 


Lys 


Thr 


Asn 


Lys 


Leu 


















2 5 










30 




Thr 


Glu 


Glu 


Ai?p 


Arg 


Ser 


Arg 


Asn 


Glu 


Tyr 


Val 


Gin 


Ser 


Met 


Asp 


Val 






35 










4 0 










45 








Leu 


Ser 


Asn 


He 


He 


Gly 


Asn 


Val 


Arg 


Leu 


Tyr 


Phe 


Val 


Asp 


Thr 


He 




50 










55 










60 










Ser 


He 


hys 


His 


r-iet 


Thr 


Arg 


Arg 


Gly 


He 


Asp 


Ala 


Met 


Leu 


Gly 


Gly 


65 










70 










75 










80 


Leu 


A5p 


Pro 


Tyr 


Thr 


Glu 


Tyr 


He 


Pro 


Tyr 


Glu 


Glu 


Met 


Asp 


Glu 


Leu 










85 










90 










95 




Lys 


Leu 


Ilet 


Thr 


Thr 


Gly 


Glu 


Tyr 


Ala 


Gly 


Val 


Gly 


Ala 


He 


He 


Ser 








100 










105 










110 






Gin 


Arg 


Pro 


Asp 


Ser 


Ala 


Val 


He 


He 


Gin 


Arg 


Pro 


I4et 


Glu 


Gly 


Met 






115 










120 










125 








Pro 


Ala 


A5p 


Glu 


Ala 


Gly 


Leu 


He 


Ala 


Gly 


Asp 


Arg 


He 


Leu 


Tlir 


He 




130 










135 










140 










Asp 


Gly 


Lys 


Asp 


Phe 


Arg 


Lys 


Ser 


Thr 


Thr 


Pro 


Lys 


Val 


Ser 


Gin 


Ala 


145 










150 










155 










160 


Leu 


Lys 


Gly 


He 


Ala 


Gly 


Thr 


Val 


Ala 


Lys 


Val 


Thr 


Val 


Met 


Arg 


Tyr 










165 










170 










175 




Gly 


Glu 


Thr 


Ly^ 


Pro 


Arg 


Thr 


Phe 


Ser 


Val 


Lys 


Arg 


Gin 


Lys 


Val 


He 








180 










185 










190 






Met 


Asn 


Ser 


Val 


Thr 


Tyr 


Ser 


Gly 


Met 


Leu 


Asp 


Gly 


Ser 


He 


Gly 


Tyr 






195 










200 










205 








He 


Arg 


Leu 


Asn 


Asn 


Phe 


Thr 


Asp 


Ly^ 


Ser 


Ala 


Glu 


Glu 


Val 


Arg 


Thr 




210 










215 










220 








Ala 


Leu 


Leu 


Asp 


Leu 


Arg 


Asp 


Lys 


Gin 


Gly 


Ala 


Lys 


Gly 


Leu 


He 


Leu 


225 










230 










235 










240 


Asp 


Leu 


Arg 


Gly 


Asn 


Gly Gly 


Gly 


Leu 


Met 


Gin 


Ala 


Ala 


He 


Glu 


He 










245 










250 










255 




Val 


Asn 


Leu 


Phe 


Va 1 


Pro 


Lys 


Gly 


Lys 


Glu 


Va 1 


Val 


Thr 


Thr 


Lys 


Gly 








260 










265 










270 




Arg 


He 


Ala 


Glu 


Ser 


Ala 


Ser 


Val 


Phe 


Arg 


Thr 


Leu 


Thr 


Glu 


Pro 


He 






275 










2eo 










285 








Asp 


Thr 


hys 


Leu 


Pro 


He 


Val 


Val 


Leu 


He 


Asp 


Gly 


Gin 


Ser 


Ala 


Ser 




290 










295 










300 










Ser 


Ser 


Glu 


He 


Val 


Ala 


Gly 


Ala 


Leu 


Gin 


Asp 


Met 


Asp 


Arg 


Ala 


Val 


305 










310 










315 










320 


Leu 


Ilet 


Gly 


Gin 


Lys 


Ser 


Tyr 


Gly 


Lys 


Gly 


Leu 


Val 


Gin 


Thr 


Thr 


Arg 










325 










330 










335 




Gin 


Leu 


Pro 




Asn 


Gly 


Val 


He 


Lys 


Leu 


Thr 


Thr 


Ala 


Lys 


Tyr 


Tyr 








340 










345 










350 




He 


Pro 


Ser 


Gly 


Arg 


Cys 


He 


Gin 


Arg 


Leu 


Asp 


Tyr 


Ser 


Arg 


Thr 


Asn 






355 










350 










365 








Arg 


Thr 


Gly 


Met 


Ala 


Thr 


Ala 


He 


Pro 


Asp 


Ser 


Leu 


His 


Lys 


He 


Phe 




370 










375 










380 










Tyr 


Thr 


Ala 


Ala 


Gly 


Arg 


Arg 


Val 


Glu 


Asp 


Ala 


Gly 


Gly 


He 


Leu 


Pro 


385 










390 










395 










400 


Asp 


He 


Glu 


Val 


Lys 


Gin 


Asp 


Thr 


Ala 
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25 (i) SEQUEMCE CHARACTERISTICS: 
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Ser 


His 


Gin 


Ala 


Asn 


Val 




450 










455 










4 60 










Asn 


Gly 


Phe 


Ala 


Gin 


He 


Thr 


Pro 


He 


Lys 


Gly 


Leu 


Thr 


Leu 


Lys 


Ala 


465 










470 










47S 










4 80 


Gin 


Ala 


Gly 


Val 


Ar-p 


He 


Thr 


Asn 


Thr 


Arg 


Thr 


Ser 


Ser 


Lys 


Arg 


Met 










4 85 










4 V'J 










4 95 




Puo 


Asn 


Asn 


Pro 


Tyr 


Asp 


Ser 


Thr 


Pro 


Leu 


Giy 


Glu 


Arg 


Arg 


Glu 


Arg 








500 










505 










510 






Ala 


Tyr 


Arg 


Asp 


Val 


Ser 


Lys 


Ser 


Phe 


Thr 


Asn 


Thr 


Ala 


Glu 


Tyr 


Lys 






515 










520 










525 








Phe 


Ser 


He 


Asp 


Glu 


Lys 


His 


Asp 


Leu 


Thr 


Ala 


Leu 


Met 


Gly 


His 


Glu 




530 










535 










540 










'I'yr 


lie 


Glu 


Tyr 


Glu 


Gly 


Asp 


Val 


He 


Gly 


Ala 


Ser 


Ser 


Lys 


'31y 


Phe 


5 45 










550 










555 










560 


Glu 


^er 


Asp 


Lys 


Leu 


Met 


Leu 


Leu 


Ser 


Gin 


Gly 


Lys 


Thr 


Gly 


Asn 


Ser 










565 










57 0 










575 




Leu 


Ser 


Leu 


Pro 


Glu 


His 


Arg 


Val 


Ala 


Glu 


Tyr 


Ala 


Tyr 


Leu 


Ser 


Phe 








580 










585 








590 






Phe 


Ser 


Arg 


Phe 


Asn 


Tyr 


Gly 


Phe 


Asp 


Lys 


Trp 


Met 


Tyr 


He 


Asp 


Phe 






595 










600 










605 








Sen 


Val 


Arg 


Asn 


Asp 


Gin 


Ser 


Ser 


Arg 


Phe 


Gly 


Ser 


Asn 


Asn 


Arg 


Ser 




610 










615 










620 










Ala 


Trp 


Phe 


Tyr 


Ser 


Val 


Gly 


Gly 


Met 


Phe 


Asp 


He 


Tyr 


Asn 


Lys 


Phe 


625 










630 










635 










640 


lie 


Gin 


Glu 


Ser 


Asn 


Trp 


Leu 


Ser 


Asp 


Leu 


Arg 


Leu 


Lys 


Met 


Ser 


Tyr 










645 










650 










655 




Gly 


Thr 


Thr 


Gly 


Asn 


Ser 


Glu 


He 


Gly 


Asn 


Tyr 


Asn 


His 


Gin 


Ala 


Leu 








660 










665 










670 






Val 


Tlir 


Val 


Asn 


Asn 


Tyr 


Thr 


Glu 


Asp 


Ala 


Met 


Gly 


Leu 


Ser 


He 


Ser 






67 5 










68 0 










685 








Thr 


Ala 


Gly 


Asn 


Pro 


Asp 


Leu 


Ser 


Trp 


Glu 


Lys 


Gin 


Ser 


Gin 


Phe 


Asn 




690 










695 








7 00 










Phe 


Gly 


Leu 


Ala 


Ala 


Gly 


Ala 


Phe 


Asn 


Asn 


Arg 


Leu 


Ser 


Ala 


Glu 


Val 


705 










710 










715 










720 


Asp 


Phe 


Tyr 


Val 


Arg 


Thr 


Thr 


Asn 


Asp 


Met 


Leu 


He 


At-p 


Val 


Pro 


Met 










725 










7 30 










735 




Pro 


Tyr 


lie 


Ser 


Gly 


Phe 


Phe 


Ser 


Gin 


Tyr 


Gin 


Asn 


Val 


Gly 


Ser 


llet 








74 0 










745 










750 






Lys- 


Af n 


Thr 


Gly 


Val 


Asp 


Leu 


Ser 


Leu 


Lys 


Gly 


Thr 


He 


Tyr 


Gin 


Asn 






755 










760 










765 








Lys 


Asp 


Trp 


A^n 


Val 


Tyr 


Ala 


Ser 


Ala 


Asn 


Phe 


Asn 


Tyr 


Asn 


Arg 


Gin 




77 0 










775 










7 80 










Glu 


lie 


Th r 


Lys 


Leu 


Phe 


Phe 


Gly 


Leu 


Asn 


Lys 


Tyr 


I let 


Leu 


Pro 


Asn 


7 85 










7 90 










7 95 










800 


Th c 


J- i' 


Thr 


He 




Glu 


He 


Gly 




Pre 




Ser 


Ph^ 




M*^ t 


Ala 










805 










810 










815 




Glu 


Tyr 


Ala 


Gly 


He 


Asp 


Lys 


Lys 


Thr 


Gly 


Lys 


Gin 


Leu 


Trp 


Tyr 


Val 








820 










825 










830 






Pro 


Gly 


Gin 


Val 


Asp 


Ala 


Asp 


Gly 


Asn 


Ly^ 


Val 


Thr 


Thr 


Ser 


Gin 


Tyr 






835 










84 0 










845 








Ser 


Ala 


Asp 


Leu 


Glu 


Thr 


Arg 


He 


Asp 


Lys 


Ser 


Val 


Thr 


Pro 


Pro 


He 




850 










855 










860 










Thr 


Gly 


Gly 


Phe 


Ser 


Leu 


Gly 


Ala 


Ser 


Trp 


Lys 


Gly 


Leu 


Ser 


Leu 


Asp 


865 










870 










875 










880 


Ala 


A^p 


Phe 


Ala 


Tyr 


He 


Val 


Gly 


Lys 


Trp 


Met 


He 


Asn 


Asn 


Asp 


Arg 










885 










8 90 










8 95 




Tyr 


Phe 


Thr 


Glu 


Asn 


Ala 


Gly Gly 


Leu 


Met 


Gin 


Leu 


Asn 


Lys 


Asp 


Lys 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: ^WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



363/490 



25 



30 









900 










905 










910 






net 


Leu 


Leu 


Asn 


Ala 


Trp 


Thr 


Glu 


Asp 


Asn 


Lys 


Glu 


Thr 


Asp 


Val 


Pro 






915 










920 










925 








Ly5 


Leu 


Gly 


Gin 


Ser 


Pro 


Gin 


Phe 


Asp 


Thr 


His 


Leu 


Leu 


Glu 


Asn 


Ala 




930 










935 










940 










Ser 


Phe 


Leu 


Arg 


Leu 


Lys 


Asn 


Leu 


Lys 


Leu 


Thr 


Tyr 


Val 


Leu 


Pro 


Asn 


9Ab 










950 










955 










960 


Ser 


Leu 


Phe 


Ala 


Gly 


Gin 


Asn 


Val 


He 


Gly 


Gly 


Ala 


Arcj 


Val 


Tyr 


Leu 










965 










970 










975 




Met 


Ala 


Arg 


Asn 


Leu 


Leu 


Thr 


Val 


Thr 


Lys 


Tyr 


Lys 


Gly 


Phe 


Asp 


Pro 








900 










985 










990 






Glu 


Ala 


Gly 


Gly 


Asn 


Val 


Gly 


Lys 


Asn 


Gin 


Tyr 


Pro 


Asn 


Ser 


Lys 


Gin 






995 










1000 








1005 






Tyr 


Val 


Ala 


Gly 


He 


Gin 


Leu 


Ser 


Phe 

















10 



15 1010 1015 

(2) IIIPORtiATIOM FOR SEQ ID UO: 4 25 

(i) SEQUEIICE CHARACTERISTICS: 
20 (A) LEtlGTH: 1014 amino acids 

(BJ TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

( V i ) OKI G I HAL SOU RCE : 

(A) ORGAMI3M: Porphyromonas gingivalis 

(i;:) FEATURE: 

(A) HAME/KEY: misc_feature 

(B) LOCATIOII 1 . . .1014 



35 (xi) SEQUEIICE DESCRIPTIQI I : SEQ ID 110:425 

Met Thr Leu Phe Phe Leu Cys Leu Leu Thr Ser He Gly Trp Ala Met 
15 10 15 

Ala Gin Asn Arg Thr Val Lys Gly Thr Val He Ser Ser Glu Asp Asn 
40 20 25 30 

Glu Pro Leu He Gly Ala Asn Val Val Val Val Gly Asn Thr Thr He 

35 40 45 

Gly Ala Ala Thr Asp Leu Asp Gly Asn Phe Thr Leu Ser Val Pro Ala 
50 55 60 

45 Asn Ala Lys Met Leu Arg Val Ser Tyr Ser Gly Met Thr Thr Lys Glu 
65 70 75 80 

Val Ala He Ala Asn Val Met Lys He Val Leu Asp Pro Asp Ser Lys 

8 5 90 95 

Val Leu Glu Gin Val Val Val Leu Gly Tyr Gly Thr Gly Gin Lys Leu 
50 100 105 110 

Ser Thr Val Ser Gly Ser Val Ala Lys Val Ser Ser Glu Lys Leu Ala 

115 120 125 

Glu Lys Pro Val Ala Asn He Met Asp Ala Leu Gin Gly Gin Val Ala 
130 135 140 

55 Gly Met Gin Val Met Thr Thr Ser Gly Asp Pro Thr Ala Val Ala Ser 
145 150 155 160 

Val Glu He His Gly Thr Gly Ser Leu Gly Ala Ser Ser Ala Pro Leu 

165 170 175 

Tyr He Val Asp Gly Met Gin Thr Ser Leu Asp Val Val Ala Thr Met 
60 180 185 190 

Asn Pro Asn Asp Phe Glu Ser f-let Ser Val Leu Lys Asp Ala Ser Ala 

195 200 205 

Thr Ser He Tyr Gly Ala Arg Ala Ala Asn Gly Val Val Phe He Gin 
210 215 220 

65 Thr Lys Lys Gly Lys Met Ser Glu Arg GLy Arg He Thr Phe Asn Ala 
225 230 235 240 

Ser Tyr Gly He Ser Gin He Leu Asn Thr l.ys Pro Leu Asp Asn Met 
24 5 250 2 55 

net Thr Glv Aso 

70 



75 



net 


Thr 


Gly 


Asp 


Glu 


Leu 


Leu 


Asp 


Phe 


Gin 


Val 


Lys 


Ala 


Gly 


Phe 








260 










265 










270 




Gly 


Asn 


Asn 


Gin 


Thr 


Val 


Gin 


Lys 


Val 


Lys 


Asp 


Met 


He 


Leu 


Ala 






275 










280 










285 






Ala 


Glu 


Asp 


Leu 


Tyr 


Gl y 


Asn 


Tyr 


Asp 


Ser 


Leu 


Lys 


Asp 


Glu 


Tyr 




2 90 










295 










300 








Lys 


Thr 


Leu 


Phe 


Pro 


Val 


Asp 


Phe 


Asn 


His 


Asp 


Ala 


Asp 


Trp 


Leu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



305 










310 










315 










320 


Ala 


Leu 


Phe 


hys 


Thr 


Ala 


Pro 


Thr 


Ser 


Gin 


Gly 


Asp 


He 


Ser 


Phe 


Ser 










325 










330 










335 




Gly 


Gly 


Ser 


Gin 


Gly 


Thr 


Ser 


Tyr 


Tyr 


Ala 


Ser 


He 


Gly 


Tyr 


Phe 


Asp 








34 0 










34 5 










350 






Gin 


Gl u 


Gly 


Met 


Ala 


Arg 


Glu 


Pro 


Ala 


Asn 


Phe 


Lys 


Arg 


Tyr 


Ser 


Gly 






355 










360 










365 








Arg 


Leu 


Asn 


Phe 


Glu 


Ser 


Arg 


He 


Asn 


Glu 


Trp 


Leu 


Lys 


Val 


Gly 


Ala 




37 0 










37 5 










380 










Asn 


Leu 


Ser 


Gly 


Ala 


He 


Ala 


Asn 


Arg 


Arg 


Ser 


Ala 


Asp 


Tyr 


Phe 


Gly 


385 










390 










395 










400 


Ly^ 


Tyr 


Tyr 


Met 


Gly 


Ser 


Gly 


Thr 


Phe 


Gly 


Val 


Leu 


Thr 


Met 


Pro 


Arg 










405 










410 










415 




Tyr 


Tyr 


Asn 


Pro 


Phe 


Asp 


Val 


Asn 


Gly 


Asp 


Leu 


Ala 


Asp 


Val 


Tyr 


Tyr 








4 20 










425 










430 






llet 


Tyr 


Gly 


Ala 


Thr 


Arg 


Pro 


Ser 


Met 


Thr 


Glu 


Pro 


Tyr 


Phe 


Ala 


Lys 






435 










440 










445 








Met 


Arg 


Pro 


Phe 


Ser 


Ser 


Glu 


Ser 


His 


Gin 


Ala 


Asn 


Val 


Asn 


Gly 


Phe 




4 50 










455 










460 










Ala 


Gin 


lie 


Thr 


Pro 


lie 


Lys 


Gly 


Leu 


Thr 


Leu 


Lys 


Ala 


Gin 


Ala 


Gly 


4 65 










4 70 










475 










4 80 


Val 


Asp 


lie 


The 


Asn 


Thr 


Arg 


Thr 


Ser 


Ser 


Lys 


Arg 


!'tet 


Pro 


Asn 


Asn 










485 










4 90 










4 95 




Pro 


Tyr 


Asp 


Ser 


Thr 


Pro 


Leu 


Gly 


Glu 


Arg 


Arg 


Glu 


Arg 


Ala 


Tyr 


Arg 








500 










505 










510 






Asp 


Va 1 


Ser 


Lys 


Ser 


Phe 


Thr 


Asn 


Thr 


Ala 


Glu 


Tyr 


Lys 


Phe 


Ser 


He 






515 










520 










5 5 








A"p 


Glu 


Lys 


His 


Asp 


Leu 


Thr 


Ala 


Leu 


Met 


Gly 


His 


Glu 


Tyr 


He 


Gl u 




530 










535 










54 0 










Tyr 


Glu 


Gly 


Asp 


Val 


He 


Gly 


Ala 


Ser 


Ser 


Lys 


Gly 


Phe 


Glu 


Ser 


Asp 


545 










550 










555 










560 


Lys 


Leu 


[let 


Leu 


Leu 


Ser 


Gin 


Gly 


Lys 


Thr 


Gly 


Asn 


Ser 


Leu 


Ser 


Leu 










565 










570 










575 




Pro 


Glu 


His 


Arg 


Val 


Ala 


Glu 


Tyr 


Ala 


Tyr 


Leu 


Ser 


Phe 


Phe 


Ser 


Arg 








580 










585 










590 






Phe 


Asn 


Tyr 


Gly 


Phe 


Asp 


Lys 


Trp 


Pfet 


Tyr 


He 


Asp 


Phe 


Ser 


Val 


Arg 






595 










600 










605 








Asn 


Asp 


Gin 


Ser 


Ser 


Arg 


Phe 


Gly 


Ser 


Asn 


Asn 


Arg 


Ser 


Ala 


Trp 


Phe 




610 










615 










620 










Tyr 


Ser 


Val 


Gly 


Gly 


Met 


Phe 


Asp 


He 


Tyr 


Asn 


Lys 


Phe 


He 


Gin 


Glu 


625 










630 










635 










640 


Ser 


Asn 


Trp 


Leu 


Ser 


Asp 


Leu 


Arg 


Leu 


Lys 


Met 


Ser 


Tyr 


Gly 


Thr 


Thr 










645 










650 










655 




Gly 


Asn 


Ser 


Glu 


lie 


Gly 


Asn 


Tyr 


Asn 


His 


Gin 


Ala 


Leu 


Val 


Thr 


Val 








660 










665 










670 






Asri 


Asn 


Tyr 


Thr 


Glu 


Asp 


Ala 


Met 


Gly 


Leu 


Ser 


He 


Ser 


Thr 


Ala 


Gly 






67 5 










680 










685 








Asn 


Pro 


Asp 


Leu 


Ser 


Trp 


Glu 


Lys 


Gin 


Ser 


Gin 


Phe 


Asn 


Phe 


Gly 


Leu 




690 










695 










7 00 










Ala 


Ala 


Gly 


Ala 


Phe 


Asn 


Asn 


Arg 


Leu 


Ser 


Ala 


Glu 


Val 


Asp 


Phe 


Tyr 


705 










710 










715 










720 


Val 


Arg 


Thr 


Thr 


Asn 


Asp 


Met 


Leu 


He 


Asp 


Val 


Pro 


Met 


Pro 


Tyr 


He 










725 










730 










735 




Ser 


Gly 


Phe 


Phe 


Ser 


Gin 


Tyr 


Gin 


Asn 


Val 


Gly 


Ser 


Met 


Lys 


Asn 


Thr 








740 










745 










750 






Gly 


Val 


Asp 


Leu 


Ser 


Leu 


Lys 


Gly 


Thr 


He 


Tyr 


Gin 


Asn 


Lys 


Asp 


Trp 






755 










760 










765 








Asn 


Va 1 


Tyr 


Al a 


Ser 


Ala 


Asn 


Phe 


Asn 


Tyr 


Asn 


Arg 


Gin 


Glu 


He 


Thr 




7 7 0 










77 5 










780 










Lys 


Leu 


Phe 


Phe 


Gly 


Leu 


Asn 


Lys 


Tyr 


Me t 


Leu 


Pro 


Asn 


Thr 


Gly 


Thr 


7 85 










7 90 










795 










SOO 


lie 


Trp 


Glu 


lie 


Gly 


Tyr 


Pro 


Asn 


Ser 


Phe 


Tyr 


Met 


Ala 


Glu 


Tyr 


Ala 










805 










810 










815 




Gly 


lie 


Asp 


Lys 


Lys 


Thr 


Gly 


Lys 


Gin 


Leu 


Trp 


Tyr 


Val 


Pro 


Gly 


Gin 








820 










825 










830 






Val 


Asp 


Ala 


Asp 


Gly 


Asn 


Lys 


Val 


Thr 


Thr 


Ser 


Gin 


Tyr 


Ser 


Ala 


Asp 






835 










840 










845 








Leu 


Glu 


Thr 


Arg 


lie 


Asp 


Lys 


Ser 


Val 


Thr 


Pro 


Pro 


He 


Thr 


Gly 


Gly 




850 










855 










860 










Phe 


Ser 


Leu 


Gly 


Ala 


Ser 


Trp 


Lys 


Gly 


Leu 


Ser 


Leu 


Asp 


Ala 


Asp 


Phe 


665 










87 0 










875 










880 


Ala 


Tyr 


Xle 


Val 


Gly 


Lys 


Trp 


Met 


He 


Asn 


Asn 


Asp 


Arg 


Tyr 


Phe 


Thr 










885 










8 90 










895 




Glu 


Asn 


Ala 


Gly 


Gly 


Leu 


Met 


Gin 


Leu 


Asn 


Lys 


Asp 


Lys 


Met 


Leu 


Leu 








900 










905 










910 
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wo 99/29870 PCT/AU98/01023 

365/490 



10 



15 



35 



40 



45 



50 



55 



60 



65 



70 



75 





Ala 


Trp 


Thr 


Glu 


Asp 


Asn 


Lys 


Glu 


Thr 


Asp 


Val 


Pro 


Lys 


Leu 


Gly 






915 










920 










925 








Gin 


Ser 


Pro 


Gin 


Phe 


Asp 


Thr 


His 


Leu 


Leu 


Glu 


Asn 


Ala 


Ser 


Phe 


Leu 




930 










935 










940 










Arg 


Leu 


Lys 


A3n 


Leu 


Lys 


Leu 


Thr 


Tyr 


Val 


Leu 


Pro 


Asn 


Ser 


Leu 


Phe 


945 










95 0 










955 










960 


Ala 


Gly 


Gin 


Asn 


Val 


lie 


Gly 


Gly 


Ala 


Arg 


Val 


Tyr 


Leu 


Met 


Ala 


Arg 










965 










97 0 










97 5 




Asn 


Leu 


Leu 


Tlir 


Val 


Thir 


Lys 


Tyr 


Lys 


Gly 


Phe 


Asp 


Pro 


Gl u 


Ala 


Gly 








980 










985 










99 0 






Gly 


Asn 


1 


Gly 


Lys 


Asn 


Gin 


Tyr 


Pro 


Asn 


Ser 


Lys 


Gin 


Tyr 


Val 


Ala 






9 95 










1000 








1005 






Gly 


lie 


Gin 


Leu 


Ser 


Phe 























1010 

(2) IIIFORMATIOtJ FOR SEQ ID IIO:4 26 



(i) SEOUEIICE CHARACTERISTICS: 

(A} LENGTH: 821 amino acids 
20 iB) TVPE: amino acid 

(D) TOPOLOGY: linear 

(ii) liOLECULE TYPE: protein 

25 (iii) HYPOTHETICAL: YES 

( V i ) OR I G 1 1 1 AL S OU RC E : 

(A) ORGAHISI'I: Pcrphyromonas gingivalis 

30 {i;-:) FEATURE: 

(A) MAT IE/KEY: mi sc_f ea ture 
( E} LOCATIOII 1 ... 821 



(xi) SEQUENCE DESCRI PTIOl J : SEQ ID ]]0:42G 



Ilet 


Lys 


Lys 


Lys 


Asn 


Phe 


Leu 


Leu 


Leu 


Gly 


He 


Phe 


Val 


Ala 


Leu 


Leu 


1 


















10 










15 




Thr 


Plie 


lie 


Gly 


Ser 


Met 


Gin 


Ala 


Gin 


Gin 


Ala 


Lys 


Asp 


Tyr 


Phe 


Asn 








2 0 










25 










30 






Phe 


Arp 


Glu 


Arg 


Gly 


Glu 


Ala 


Tyr 


Phe 


Ser 


Phe 


Lys 


Val 


Pro 


Asp 


Arg 






35 










40 










45 








Ala 


Val 


Leu 


Gin 


Glu 


Leu 


Ala 


Leu 


He 


Met 


Ser 


He 


Asp 


Glu 


Phe 


Asp 




50 










55 










60 










Pro 


Val 


Thr 


Asn 


Glu 


Ala 


He 


Ala 


Tyr 


Ala 


Ser 


Glu 


Glu 


Gl u 


Phe 


Glu 


65 










70 










75 










eo 


Ala 


Phe 


Leu 


Arg 


Tyr 


Gly 


Leu 


Lys 


Pro 


Thr 


Phe 


Leu 


Thr 


Fro 


Pro 


Ser 










85 










90 










95 




Met 


Gin 


Arg 


AID 


Val 


Glu 


Met 


Phe 


Asp 


Tyr 


Arg 


Ser 




Glu 


Lys 


Tyr 








100 










105 










110 






Glu 


Trp 


Asn 


Ala 


Tyr 


Pro 


Thr 


Tyr 


Glu 


Ala 


Tyr 


lie 


Ser 


Ilet 


Met 


Glu 






115 










120 










125 








Glu 


Phe 


Gin 


Thr 


Lys 


Tyr 


Pro 


Ser 


Leu 




Thr 


Thr 


Ser 


Val 


He 


Gly 




13 0 










135 










140 










Ser 


Val 


Lys 


Asp 


Arg 


Lys 


Leu 


Met 


He 


Cys 


Lys 


Leu 


Thr 


Ser 


Ser 


145 










150 










155 










160 


Ala 


Asn 


Thr 


Gly 


Lys 


Lys 


Pro 


Arg 


Val 


Leu 


Tyr 


Thr 


Ser 


Thr 


Met 


His 










165 










170 










175 




Gly 


Asp 


Glu 


Thr 


Thr 


Gly 


Tyr 


Val 


Val 


Leu 


Leu 


Arg 


Leu 


He 


Asp 


His 








180 










185 










190 






Leu 


Leu 


Ser 


Asn 


Tyr 


Glu 


Ser 


Asp 


Pro 


Arg 


He 


Lys 


Asn 


He 


Leu 


Asp 






1 95 










2 00 










205 








Lys 


Thr 


Glu 


Val 


Trp 


He 


Cys 


Pro 


Leu 


Thr 


Asn 


Pro 


Asp 


Gly 


Ala 


Tyr 




210 










215 










220 










Arg 


Ala 


Gly 


Asn 


His 


Thr 


Val 


Gin 


Gly 


Ala 


Thr 


Arg 


Tyr 


Asn 


Ala 


Asn 


';* tr 










230 










235 






24 0 


Asn 




Asp 


Leu 


Asn 


Arg 


Asn 


Phe 


Lys 


Asp 


Asp 


Val 


Ala 


Gly 


Asp 


His 










245 










2 50 














Pro 


Asp 


Gly 


Lys 


Pro 


Trp 


Gin 


Pro 


Gl u 


Ala 


Thr 


Ala 


Phe 


Met 


Asp 


Leu 








2 60 










265 










27 0 






Glu 


Gly 


Asn 


Thr 


Ser 


Phe 


Val 


Leu 


Gly 


Ala 


Asn 


He 


His 


Gly 


Gly 


Thr 






275 










2 80 










285 








Glu 


Val 


Val 


Asn 


Tyr 


Pro 


Trp 


Asp 


Asn 


Lys 


Lys 


Glu 


Arg 


His 


Ala 


Asp 




2 90 










295 










300 










Asp 


Glu 


Trp 


Tyr 


Lys 


Leu 


He 


Ser 


Arg 


Asn 


Tyr 


Ala 


Ala 


Ala 


Cys 


Gin 


305 










310 










315 










320 
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Ser He Ser Ala Ser Tyr Met Thr Ser Glu Thr Asn Ser Gly He He 

325 330 335 

Asn Gly Ser Asrp Trp Tyr Val He Arg Gly Ser Arg Gin Asp Asn Ala 
340 345 350 

5 Asn Tyr Phe His Arg Leu Arg Glu He Thr Leu Glu He Ser Asn Thr 
355 360 365 

Lys Leu Y«l Pro Ala Ser Gin Leu Pro Ly5 Tyr Trp Asn Leu Asn Lys 

370 375 380 

Glu Ser Leu Leu Ala Leu He Glu Glu Ser Leu Tyr Gly He His Gly 
10 335 390 395 400 

Thr Val Thr Ser Ala Ala Asn Gly Gin Pro Leu Lys Cys Gin He Leu 

4 05 410 415 

He Glu Asn His Asp Lys Arg Asn Ser Asp Val Tyr Ser Asp Ala Thr 
420 425 430 

15 Thr Gly Tyr Tyr Val Arg Pro He Lys Ala Gly Thr Tyr Thr Val Lys 
435 440 445 

Tyr Lys 7\la Glu Gly Tyr Pro Glu Ala Thr Arg Thr He Thr He Lys 

450 455 460 

Asp Lys Glu Thr Val He Met Asp He Ala Leu Gly Asn Ser Val Pro 
20 4 65 470 47 5 480 

Leu Pro Val Pro Asp Phe Thr Ala Ser Pro Het Thr He Ser Val Gly 

4 85 4 90 4 95 

Glu Ser Val Gin Phe Gin Asp Gin Thr Thr Asn Asn Pro Thr Asa Trp 
500 505 510 

25 Glu Trp Thr Phe Glu Gly Gly Gin Pro Ala Het Ser Thr Glu Gin Asn 
515 520 525 

Pro Leu Val Ser Tyr Ser His Pro Gly Gin Tyr Asp Val Thr Leu Lys 

530 535 54 0 

Vai Trp Asn Ala Ser Gly Ser Asn Thr He Thr Lys Glu Lys Phe He 
30 545 550 555 560 

Tl:ir Val Asn Ala Val Met Pro Val Ala Glu Phe Val Gly Thr Pro Thr 

565 57 0 57 5 

Glu He Glu Glu Gly Gin Thr Val Ser Phe Gin Asn Gin Ser Thr Asn 
580 585 590 

35 Ala Thr Asn Tyr Val Trp He Phe Asp Gly Gly Thr Pro Aia Thr Ser 
595 600 605 

Glu Asp Glu Asn Pro Thr Val Leu Tyr Ser Lys Ala Gly Gin Tyr Asp 

610 615 620 

Val Thr Leu Lys Ala He Ser Ala Ser Gly Glu Thr Val Lys Thr Lys 
40 625 630 635 640 

Glu Lys Tyr He Thr Val Lys Lys Ala Pro Val Pro Ala Pro Val Ala 

645 650 655 

Asp Phe Glu Gly Thr Pro Arg Lys Val Lys Lys Gly Glu Thr Val Thr 
660 665 67 0 

45 Phe Lys Asp Leu Ser Thr Asn Asn Pro TJir Ser Trp Leu Trp Val Phe 
675 680 685 

Glu Gly Gly Ser Fro Ala Thr Ser Thr Glu Gin Asn Pro Val Val Thr 

6 90 6 95 7 00 

Tyr Asn Glu Thr Gly Lys Tyr Asp Val Gin Leu Thr Ala Thr Asn Glu 
50 7 05 710 715 720 

Gly Gly Ser Asn Val Lys Lys Aia Glu Asp Tyr He Glu Val He Leu 

725 730 735 

Asp Asp Ser Val Glu Asp He Val Ala Gin Thr Gly He Val He Arg 
740 74 5 7 50 

55 Pro Gin Asn Gly Thr Lys Gin He Leu He Glu Ala Asn Ala Ala He 
755 760 765 

Lys Ala He Val Leu Tyr Asp He Asn Gly Arg Val Val Leu Lys Thr 

770 775 780 

Thr Pro Asn Gin Leu Arg Ser Thr Val Asp Leu Ser He Leu Pro Glu 
60 765 790 795 800 

Gly He Tyr Thr He Asn He Lys Thr Glu Lys Ser Aia Arg Thr Glu 

805 BIG 815 

Lys lie His He Gly 
820 



65 



(2) lllE'ORIIATIOn FOR SEQ ID MO: 4 27 



(i) SEQUEUCE CHARACTERISTICS: 

I A) LEMGTH: 7 6 amino acids 
70 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(11) llOLECULE TYPE: protein 

75 (iii) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



367/490 



10 



30 



35 



50 



55 



60 



65 



70 



75 



{vi) ORlGIiJAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival if 

ii::) FEATURE: 

(A) IJAIIE/p:Ei : misc_teature 

(B) LOCATION 1 ... 76 

(::i) SEQUENCE DESCRIPTION: SEQ ID NO: 4 27 





Met 


Arg 


Leu 


He 


Lys 


Ala 


Phe 


Leu 


Val 


Gin 


Leu 


Leu 


Leu 


Leu 


Pro 


He 




1 








5 










10 










15 






Phe 


Phe 


Tyr 


Lys 
20 


Arg 


Phe 


lie 


Ser 


Pro 
25 


Leu 


Thr 


Pro 


Pro 


Ser 
30 


Cys 


Arg 


15 


Phe 


Thr 


Pro 
35 


Ser 


Cys 


Ser 


Ser 


Tyr 
40 


Ala 


He 


Glu 


Ala 


Leu 
45 


Arg 


Lys 


Tyr 




Gly 


Pro 
50 


Gly 


Lys 


Gly 


Leu 


Leu 
5 5 


Leu 


Ser 


He 


Lys 


Arg 
60 


He 


Leu 


Arg 


Cys 


20 


His 


Pro 


Trp 


Gly 


Gly 


Ser 


Gly 


Tyr 


Asp 


Pro 


Val 


Pro 










65 










70 










75 













(2) INEORriATION E'OR SEQ ID 1!0:428 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 859 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 



{ii} MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival i£ 



(i::) FEATURE: 

(A) NAME/ KEY; misc_feature 

(B) LOCATION 1 ... 859 

40 (::i) SEQUENCE DESCRIPTION: SEQ ID NO: 428 



45 



Met 


Al 5 


Tyr 


Asp 


Phe 


Thr 


Gin 


Thr 


Phe 


Arg 


Asn 


Ser 


Leu 


Glu 


Tyr 


Ser 


1 








c 










10^ 










15 




Tyr 


Gin 


Glu 


Ala 


Thr 


Arg 


Leu 


Gly 


Val 


Val 


Ala 


Val 


Thr 


Gin 


Asp 


Met 








20 










25 










30 






Leu 


Vz.1 


Leu 


Gly 


He 


He 


Arg 


Asp 


Gly 


Asp 


Asn 


Gly 


Ala 


He 


Asp 


He 






35 










40 










45 








Met 


Arg 

50^ 


His 


Tyr 


Gly 


He 


Asn 


Leu 


Tyr 


Glu 


Leu 


Lys 

60 


Arg 


Leu 


He 


Glu 


Leu 


Glu 


Ala 


He 


Ala 


Glu 


55 
Ser 


Leu 


Pro 


Ala 


Ser 


Pro 


Glu 


Gly 


Ser 


Pro 


65 










70 










75 










80 


He 


Phe 


Thr 


Pro 


Ser 


Ala 


Arg 


Glu 


Ala 


He 


Asp 


Asp 


Ala 


Thr 


Asp 


He 










85 










90 










95 




cys 


Ala 


Asp 


Met 


Glu 


Asp 


Glu 


Ala 


Val 


Ser 


Pro 


Val 


His 


Leu 


Leu 


Leu 








100 










105 










110 






Ser 


He 


Leu 


Asn 


Ser 


Thr 


Gin 


Glu 


Ser 


Leu 


Val 


Gin 


Lys 


He 


Phe 


Met 






115 










120 










125 








Lys 


Gin 


Gly 


He 


Lys 


Tyr 


Asp 


Thr 


He 


Leu 


Ser 


Asp 


Tyr 


Phe 


Gly 


Gin 




130 










135 










1 4 0 










Arg 


Asn 


Pro 


Ser 


Glu 


Gly 


Lys 


Ser 


Pro 


Ser 


Glu 


Met 


Gl u 


He 


Leu 


Asp 


145 










150 










155 










160 


Gly 


Tyr 


Gl n 


Asp 


Asn 


Asp 


Phe 


Asp 


Asp 


Glu 


Glu 


Asp 


Glu 


Ser 


Ser 


Fro 










165 










170 










175 




Pro 


Ser 


Gly 


Asn 


Ser 


Gly 


Thr 


Gly 


Gly 


Gly 


Ser 


Gly 


Asp 


Ala 


Pro 


Glu 








180 










185 










190 






Gin 


A^^n 


Thr 


Gly 


Gly 


Gly 


Asp 


Thr 


Thr 


Thr 


Tlir 


Thr 


Arg 


Ser 


Gly 


Gly 






1?5 










200 










205 








Asp 


Thr 


Pro 


Ala 


Leu 


Asp 


Thr 


Phe 


Gly 


Thr 


Asp 


He 


Thr 


Ala 


Met 


Ala 




210 










215 




















Ala 


Ala 


Gly 


Lys 


Leu 


Asp 


Pro 


Val 


Val 


Gly 


7\rg 


Glu 


Gin 


Glu 


He 


Glu 












230 










235 










24 0 


Arg 


Val 


He 


Gin 


He 


Leu 


Ser 


Arg 


Arg 


Lys 


Lys 


Asn 


Asn 


Pro 


Val 


Leu 










245 










250 










255 




He 


Gly 


Gl u 


Pro 


Gly 


Val 


Gly 


Lys 


Ser 


Ala 


He 


Val 


Glu 


Gly 


Leu 


Ala 








260 










265 










270 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Glu 


Arg 


11 e 


Val 


Asn 


Arg 


Lys 


Val 


Ser 


Arg 


He 


Leu 


Phe 


Asp 


Lys 


Arg 






275 










280 










285 








He 


He 


Ser 


Leu 


Asp 


Leu 


Ala 


Gin 


Met 


Val 


Ala 


Gly 


Thr 


Lys 


Tyr 


Arg 




2 90 










295 










300 










Gly 


Gin 


Phe 


Glu 


Glu 


Arg 


Leu 


Lys 


Aia 


Val 


Leu 


Asp 


Glu 


Leu 


Lys 


Lys 


305 










310 










315 










32 0 


Asn 


Pro 


Gin 


He 


He 


Leu 


Phe 


He 


Asp 


Glu 


He 


His 


Thr 


He 


Val 


Gly 










325 










330 










335 




Ala 


Gly 


Ser 


Ala 


Ala 


Gly 


Ser 


Met 


Asp 


Thr 


Ala 


Asn 


Met 


Leu 


Lys 


Pro 








34 0 










34 5 










350 






Ala 


Leu 


Ala 


Arg 


Gly 


Gin 


Val 


Gin 


Cys 


He 


Gly 


Ala 


Thr 


Thr 


Leu 


Asp 






355 










360 










365 








Glu 


Tyr 


Arg 


Lys 


Asn 


He 


Glu 


Lys 


Asp 


Gly 


Ala 


Leu 


Glu 


Arg 


Arg 


Phe 




37 0 










375 










380 










Gl n 


Lys 


Va 1 


Pro 


He 


Ala 


Pro 


Ser 


Thr 


Ala 


Glu 


Glu 


Thr 


Leu 


Thr 


He 


385 










390 










395 










400 


Leu 


Gin 


Asn 


He 


Lys 


Glu 


Lys 


Tyr 


Glu 


Asp 


Tyr 


His 


Gly 


Val 


Arg 


Tyr 










405 










410 










415 




Thr 


Asp 


Glu 


Ala 


He 


Lys 


Ala 


Ala 


Val 


Glu 


Leu 


Thr 


Asp 


Arg 


Tyr 


Val 








4 20 










425 










4 30 






Ser 


Asp 


Arg 


Phe 


Phe 


Pro 


Asp 


Lys 


Ala 


He 


Asp 


Ala 


Met 


Asp 


Glu 


Ala 






435 










440 










445 








Gly 


Ala 


Ser 


Val 


His 


He 


Thr 


Asn 


Val 


Val 


Ala 


Pro 


Lys 


Glu 


He 


Glu 




450 










455 










4 60 










He 


Leu 


Glu 


Ala 


Glu 


Leu 


Ala 


Ser 


Val 


Arg 


Glu 


Asn 


Lys 


Leu 


Ser 


Ala 


465 










470 










475 










480 


Val 


Lys 


Al a 


Gin 


Asn 


Tyr 


Glu 


Leu 


Ala 


Ala 


Ser 


Phe 


Arg 


Asp 


Gin 


Glu 










485 










4 90 










4 95 




Arg 


Arg 


Thr 


Gin 


Gin 


Gin 


He 


Ala 


Glu 


Glu 


Lys 


Lys 


Lys 


Trp 


Glu 


Glu 








500 










505 










510 






Gin 


Met 


Ser 


Lys 


His 


Arg 


Glu 


Thr 


Val 


Asp 


Glu 


Asn 


Val 


Val 


Ala 


His 






515 










520 










525 








Val 


Val 


Ala 


Leu 


1 let 


Thr 


Gly 


Val 


Pro 


Ala 


Glu 


Arg 


Leu 


Ser 


Thr 


Gly 




530 










535 










540 










Glu 


Gly 


Glu 


Arg 


Leu 


Arg 


Thr 


Met 


Ala 


Asp 


Asp 


Leu 


Lys 


Thr 


Lys 


Val 


545 










550 










555 










560 


Val 


Gly 


Gin 


Asp 


Thr 


Ala 


He 


Glu 


Lys 


Met 


Va 1 


His 


Ala 


He 


Gin 


Arg 










565 










570 










575 


Asn 


Arg 


Leu 


Gly 


Leu 


Arg 


Asn 


Glu 


Lys 


Lys 


Pro 


He 


Gly 


Ser 


Phe 


Leu 








580 










585 










590 






Phe 


Leu 


Gly 


Pro 


Thr 


Gly 


Val 


Gly 


Lys 


Thr 


Tyr 


Leu 


Ala 


Lys 


Ly^ 


Leu 






595 










600 










605 








Ala 


Glu 


Tyr 


Leu 


Phe 


Glu 


Asp 


Glu 


Asn 


Ala 


Met 


He 


Arg 


Val 


Asp 


Met 




610 










615 










620 










Ser 


Glu 


Tyr 


Met 


Glu 


Lys 


Phe 


Ser 


Val 


Ser 


Arg 


Leu 


Val 


Gly 


Ala 


Pro 


625 










630 










635 










64 0 


Pro 


Gly 


Tyr 


Val 


Gly 


Tyr 


Glu 


Glu 


Gly 


Gly 


Gin 


Leu 


Thr 


Glu 


Arg 


Val 










645 










65 0 










655 




Arg 


Arg 


Lys 


Pro 


Tyr 


Ser 


Val 


Val 


Leu 


Leu 


Asp 


Glu 


He 


Glu 


Lys 


Ala 








660 










665 










670 






His 


Ala 


Asp 


Val 


Phe 


Asn 


Leu 


Leu 


Leu 


Gin 


Val 


Met 


Asp 


Glu 


Gly 


Gin 






675 










680 










685 








Leu 


Thr 


Asp 


Ser 


Leu 


Gly Arg 


Arg 


Val 


Asn 


Phe 


Lys 


Asn 


Thr 


Val 


He 




690 










695 










7 00 










He 


He 


Thr 


Ser 


Asn 


Val 


Gly 


Thr 


Arg 


Gin 


Leu 


Lys 


Asp 


Phe 


Gly 


Gin 


705 










710 










715 










720 


Gly 


He 


Gly 


Phe 


Arg 


Ser 


Glu 


Lys 


Asp 


Glu 


Glu 


Ala 


Asn 


Lys 


Glu 


His 










725 










730 








735 




Ser 


Arg 


Ser 


Val 


He 


Gin 


Lys 


Ala 


Leu 


Asn 


Lys 


Thr 


Phe 


Ser 


Pro 


Glu 








740 










745 










750 






Phe 


Leu 


Asn 


Arg 


Leu 


Asp 


Asp 


He 


He 


Leu 


Phe 


Asp 


Gin 


Leu 


Gly 


hys 






755 










760 










765 








Thr 


Glu 


He 


Arg 


Arg 


net 


Val 


Asp 


He 


Glu 


Leu 


Lys 


Ala 


Val 


Leu 


Ala 




77 0 










775 










780 










Arg 


He 


His 


Arg 


Ala 


Gly 


Tyr 


Asp 


Leu 


Val 


Leu 


Thr 


Asp 


Glu 


Ala 


Lys 


785 










7 90 










795 










800 


Asp 


Val 


He 


Ala 


Thr 


Lys 


Gly 


Tyr 


Asp 


Leu 


Gin 


Tyr 


Gly 


Ala 


Arg 


Pro 










805 










810 










815 




Leu 


Lys 


Arg 


Thr 


Leu 


Gin 


Asn 


Glu 


Val 


Glu 


Asp 


Arg 


Leu 


Thr 


Asp 


Leu 








02O 










825 










830 






He 


Leu 


Ser 


Gly 


Gin 


He 


Glu 


Lys 


Gly 


Gin 


Thr 


Leu 


Thr 


Leu 


Ser 


Ala 






835 










840 










845 








Arg 


Asp 


Gly 


Glu 


He 


He 


Val 


Gin 


Glu 


Gin 


Ala 














050 










855 
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(2) IMFORIIATIOU FOR SEQ ID 110:^29 

(i) SEQUEdCE CHARACTERISTICS: 

(A) LENGTH: 417 amino acid^ 
5 (B) TYPE: amino acid 

(D) TOPOLOGV: linear 

(ii) llOLECULE TYPE: protein 

10 (iii) HYPOTHETICAL: YES 

r.-i) ORIGINAL SOURCE: 

(A) ORGAMISIl: Porphyromonas gingival is 

15 ( ix) FEATURE: 

(A) IIAJIE/KEY: mi sc_f eatur- 
{ B) LOCATIOM 1 ... 417 



20 



(:-:i) SEQUEiiCE DESCRI PTIOII : SEQ ID 110:429 



tiet Asn Tyr Leu Tyr He Leu He Thr Leu Leu Leu Ser Gly Phe Phe 
1 5 10 15 

Ser Gly Ala Giu He Ala Phe Leu Ser Ser Asp Lys Leu Arg Leu Glu 
2 C' 25 30 

25 Leu Asp Arg Asn Arg Gly Asp Leu Thr Gly Arg Ala Leu Asn Leu Leu 
35 4 0 '45 

Tyr Arg His Pro Asp Gin Leu Val Thr Thr Leu Leu Val Gly Asn Asn 

50 55 60 

He Val Leu Val Val Tyr Gly Leu Leu Met Ala Gly Leu Leu Ala Ala 
30 65 70 75 80 

Pro Leu Ala Gin Trp He Asp Asn Asp Ala tlet He Val Val Leu Gin 

85 90 95 

Ser Val Leu Ser Thr He He He Leu Phe Thr Gly Glu Phe Leu Pro 
100 105 110 

35 Lys Ala He Phe Lys Thr Asn Ala Asn llet t-Iet Met Arg Val Phe Ala 
115 120 125 

Leu Pro He Val Ala He Tyr Tyr Leu Leu Tyr Pro Leu Ser Lys Leu 

130 135 140 

Phe Thr Gly Leu Ser Arg Ser Phe He Arg Leu Val Asp Lys Asn Tyr 
40 145 150 155 160 

Val Pro Thr Thr Val Gly Leu Gly Arg Val Asp Leu Asp His Tyr Leu 

165 170 175 

Ala Glu Asn Met Ser Gly Glu Asn Glu Gin Asn Asp Leu Thr Thr Glu 
180 185 190 

45 Val Lys He He Gin Asn Ala Leu Asp Phe Ser Gly He Gin Val Arg 
195 200 205 

Asp Cys Met He Pro Arg Asn Glu Met He Ala Cys Glu Leu Gin Thr 

210 215 22 0 

Asp lie Glu Val Leu Lys Thr Thr Phe He Asp Thr Gly Leu Ser Lys 
50 225 230 235 240 

He He He Tyr Arg Gin Asn He Asp Asp Val Val Gly Tyr He His 

245 250 255 

Ser Ser Glu Met Phe Arg Gly Gin Asp Trp Gin Lys Arg He Asn Thr 
260 265 270 

55 Thr Val Phe Val Pro Glu Ser Met Tyr Ala Asn Lys Leu Met Arg Leu 
275 280 285 

Leu Met Gin Arg Lys Lys Ser He Ala He Val He Asp Glu Leu Gly 

290 295 300 

Gly Thr Ala Gly Met Val Thr Leu Glu Asp Leu Val Glu Glu He Phe 
60 305 310 315 320 

Gly Asp He Glu Asp Glu His Asp Thr Arg Lys He He Ala Lys Gin 

325 330 335 

Leu Gly Pro His Thr Tyr Leu Val Ser Gly Arg Met Glu He Asp Asp 
340 345 350 

65 Val Asn Glu Arg Phe Gly Leu Ser Leu Pro Glu Ser Asp Asp Tyr Leu 
355 360 365 

Thr Val Ala Gly Phe He Leu Asn Ser His Gin Asn He Pro Gin Ala 

370 375 380 

Asn Glu Val Val Glu He Ala Pro Tyr Thr Phe Thr He Leu Arg Ser 
70 385 390 395 400 

Ser Ser Thr hys He Glu Leu Val Lys Met Ser He Asp Asp Gin Ser 
•105 410 415 

Asn 

75 
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(2) inFORIIATIOIJ FOR SEQ ID NO: 4 30 

(i) SEQUEIJCE CHARACTERISTICS; 

(A) LENGTH: 2 93 amino acid£^ 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) nOLECULE TYPE: protein 

10 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISII: Porphyromonas gingival is 

15 (i:-:) FEATURE: 

(A) rJAIlE/KEY: niisc_f eature 

(B) LOCATION 1 ... 293 



70 



(::i) SEQUENCE DESCRIPTION: SEQ ID NO: 430 



20 



25 



30 



35 



40 



45 



50 



55 



60 (2) INFORMATION FOR SEQ ID NO: -131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acid^ 
{B) TYPE: amino acid 
65 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



r let 


Lyr 


Gin 


A?n 


Tyr 


Phe 


Lys 


Arg 


Val 


Cys 


Ser 


Leu 


Leu 


Trp 


Leu 


Val 


1 








5 










10 










15 




Leu 


Pro 


I let 


Leu 


He 


Met 


Pro 


Leu 


Glu 


Val 


Ala 


Ala 


Gin 


Glu 


He 


He 








2 0 










2 5 










30 






Pro 


A^n 


Gl u 


Gl u 


Val 


Leu 


Glu 


Ser 


Leu 


Thr 


Fhe 


Val 


Ala 


Pro 


Val 


Glu 
















4 0 










45 








Glu 


Thr 


Asp 


Ala 


He 


Glu 


Ala 


Glu 


Val 


Glu 


Ala 


Leu 


Gin 


Glu 


He 


Val 














55 










60 










Ala 


Thr 


Glu 


Glu 


He 


Ala 


Glu 


Gin 


Ala 


Val 


Arg 


Ser 


Tyr 


Thr 


Tyr 


Thr 


65 










70 










75 










80 


Val 


Tyr 


Arg 


A^-p 


Gly 


Val 


Lys 


He 


Ala 


Ser 


Gly 


Leu 


Thr 


Glu 


Pro 


Thr 










85 










90 










95 




Phe 


Leu 


Asp 


Glu 


Asp 


Val 


Pro 


Ala 


Gly 


Glu 


His 


Thr 


Tyr 


Cys 


Val 


Glu 








100 










105 










110 






Val 


Gin 


Tyr 


Gin 


Gly 


Gly 


Val 


Ser 


Asp 


Lys 


Val 


Cys 


Val 


Asp 


Val 


Glu 






115 










120 










125 








Val 


hys 


Asp 


Phe 


Lys 


Pro 


Val 


Thr 


Asn 


Leu 


Thr 


Gly 


Thr 


Ala 


Ser 


Asn 




130 










135 










140 










A^p 


Glu 


Val 


Ser 


Leu 


Asp 


Trp 


Asp 


Gly 


Val 


Glu 


Glu 


Lys 


Ala 


Glu 


Glu 












150 










155 










160 


Pro 


Ala 


Ser 


Asp 


Lys 


Ala 


Val 


Ser 


Tyr 


Asn 


Val 


Tyr 


Lys 


Asn 


Gly 


Thr 










165 










170 










175 




Leu 


He 


Gly 


Asn 


Thr 


Ala 


Glu 


Thr 


His 


Tyr 


Val 


Glu 


Thr 


Gly 


Val 


Ala 








180 










185 










190 






Asn 


Gly 


Thr 


Tyr 


He 


Tyr 


Glu 


Val 


Glu 


Val 


Lys 


Tyr 


Pro 


Asp 


Gly 


Val 






195 










200 










205 




Ser 


Fro 


Lys 


Val 


Ala 


Val 


Thr 


Val 


Thr 


Val 


Thr 


Asn 


Ser 


Ser 


Leu 


Ser 




210 










215 










220 










Arn 


Val 


Asp 


Gly 


Gin 


Ala 


Pro 


Tyr 


Thr 


Leu 


Arg 


Val 


Glu 


Gly 


Ly^ 


Lys 


225 










230 










235 








240 


He 


He 


Ala 


Glu 


Ala 


His 


Gly 


Met 


He 


Thr 


Leu 


Tyr 


Asp 


He 


Asn 


Gly 










245 










250 










255 




Arg 


Thr 


Val 


Ala 


Val 


Ala 


Pro 


Asn 


Arc 


Leu 


Glu 


Tyr 


Met 


Ala 


Gin 


Thr 








260 










265 










270 






Gly 


Phe 


Tyr 


Ala 


Val 


Arg 


Phe 


Asp 


Val 


Gly 


Asn 


Lys 


His 


His 


Val 


Ser 






275 










280 










285 








Lyo 


He 


Gin 


Val 


Arg 


























2 90 































(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 



(i::) FEATURE: 
75 {A) NAME/KEY: misc feature 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



371/ 490 



(B) LOCATION 1 . . . 312 

(xi) SEQUENCE DESCRIPTIOII: SEQ ID MO: 431 

5 M'=-t lie Pro Leu Ser Glu Ser Phe Glu Ser Gly He Pro Ala He Trp 
15 10 15 

Lys Thr He Asp Ala Asp Gly Asp Gly Tyr Asn Trp Met His Leu Thr 

20 , 2 5 30 

Asn Phe Thr Gly Gin Ser Gly Leu Cys Va 1 Ser Ser Ala Ser Tyr He 
10 35 40 A5 

Gly Gly Val Gly Ala Leu Thr Pro Asp Asn Tyr Leu He Thr Pro Glu 

50 55 60 

Leu Lys Leu Pro Thr Asp Ala Leu Val Glu He He Tyr Trp Val Cys 
65 70 75 80 

15 Thr Gill Asp Leu Thr Ala Pro Ser Glu His Tyr Ala Val Tyr Ser Ser 

85 90 95 

Ser Thr Gly Asn Asn Ala Ala Asp Phe Val Asn Leu Leu Tyr Glu Glu 

100 105 HO 

Thr Leu Thr Ala Lys Arg He Gin Ser Pro Glu Leu He Arg Gly Asn 
20 lis 120 125 

Arcj Thr Gin Gly Val Trp Tyr Gin Arg Lys Val Val Leu Pro Asn Asp 

130 135 140 

Thr Lys Tyr Val Ala Phe Arg His Phe Asn Ser Thr Asp Asn Phe Trp 
1.15 150 155 160 

25 Leu Asn Leu Asp Glu Val Ser He Leu Tyr Thr Pro Leu Pro Arg Arg 

165 170 175 

Ala Pro Cys Fro His Pro Gly Gly Tyr Thr Tyr Sec Val Phe Arg Asp 

180 185 190 

Gly Gin Lys He Ala Ser Gly Leu Ser Ala Leu Ala Tyr He Asp Thr 
30 195 200 205 

Asp Val Pro Tyr Gly Thr Gin Asp Tyr Cys Val Gin Val Asn Tyr Leu 

210 215 220 

Gin Gly Asp Ser Tyr Lys Val Cys Lys Asn He Val Val Ala Asn Ser 
225 230 235 240 

35 Ala Asn He Tyr Gly Ala Asp Lys Pro Phe Ala Leu Thr Val Val Gly 

245 250 255 

Lys Thr He Val Ala Ser Ala Phe Lys Gly Glu He Thr Leu Tyr Asp 

260 265 270 

He Arg Gly Arg Leu He Ala Ser Gly Cys Asp Thr Leu Arg Tyr Lys 
40 275 280 285 

Ala Glu Asn Gly Phe Tyr Leu He Lys He Gin Val Asn Gly Thr Val 

290 295 300 

Tyr Thr Glu Lys He Gin He Gin 
305 310 



45 



65 



70 



75 



(2} IIJEORMATIOII FOR SEQ ID 110:432 



{i} SEQUEMCE CHARACTERISTICS: 

(A) LEMGTH: 843 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 (iiil HYPOTHETICAL: YES 

(vi) ORIGIIJAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

60 (i::) FEATURE: 

(A) MATIE/KEY: misc_feature 
(B } LOCATIOJf 1 ... 843 



(::i) SEQUEIICE DESCRIPTIOII: SEQ ID lfO:432 



Met 


Lys 




Ser 


Phe 


Leu 


Leu 


Ala 


He 


Val 


Met 


Leu 


Phe 


Gly 


He 


Ala 


1 








5 










10 










15 




Met 


Gl Ii 


Gly 


His 

20 


Ser 


Ala 


Pro 


Val 


Thr 
25 


Lys 


Glu 


Arg 


Ala 


Leu 
30 


Ser 


Leu 


Ala 


Arg 


Leu 

35 


Ala 


Leu 


Arg 


Gin 


Val 
4 0 


Ser 


Leu 


Arg 


Met 


Gly 
45 


Gin 


Thr 


Ala 


Val 


Ser 
50 


Asp 


Lys 


He 


Ser 


He 
55 


Asp 


Tyr 


Val 


Tyr 


Arg 

60 


Gin 


Gly 


Asp 


Ala 


Glu 


Arg 


Gly 


He 


Thr 


Ser 


Gin 


Glu 


Glu 


Gly 


Ser 


Pro 


Ala 


Tyr 


Phe 


Tyr 


65 










70 










75 










80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCtD; <WO 9929870A1J_> 



wo 99/29870 PCT/AU98/01 023 

372/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Va 1 


Al d 


As n 


Arg 


Gly 


Asn 


Asn 


Glu 


Gly 


Tyr 


Al a 


Leu 


Val 


Al a 


Ala 


Asp 










85 










90 










95 






Ar cj 


T 1 o 

J. X e 


Pro 


Thr 


lie 


Leu 


Al a 


Tyr 


Ser 


Pro 


He 


Gly 


Arg 


Phe 


Asp 








100 










105 










lie* 








Asp 


S°r 


1*1 eL 


Pro 


Asp 


As n 


Leu 


Arg 


Met 


Trp 


Leu 


1^ J. n 


Tic 

lie 


Tyr 


Asp 






115 










12 0 


















Gin 


Glu 


He 


J,- 


L*u 


11 *=• 


T cii 1 

jjeu 


Ser 




Lys 


A± d 


'j J. n 


Leu 


Asn 


tjl u 


Glu 




130 




















1 4 L' 










T 1 c 


Leu 


Ar Q 


Thr 


Glu 




Va 1 


Pro 


Al a 


Glu 


Va 1 


His 


Al a 


Leu 


Met 


Asp 












-L O L' 




















1 0 1/ 


Asn 


<ji 


Mis 


Ph*^ 


Ala 


n 


Asp 


Pro 


i'le c 


Arg 


Trp 


A^ n 


tal n 


'ji J 


Tyr 


Pro 










165 










1 T n 










1 7 c 
X ' ^ 




Trp 




sn 




(jIu 


Pro 




liSU 


Fro 


Asn 


1 • - 

Ol J 


As n 


His 


Al a 


i J r 


Th r 








r o 










18 5 










190 








's 


V 3 J. 


HI = 
MJ. a 


1 nr 


AJ. a 


Al a 


Al a 


Gin 


He 


I let 


Arg 


Tyr 


His 


Ser 


Trp 






1 95 










2 00 










205 






Pro 


Leu 


Gin 


Gly 


Glu 


Gly 


Ser 


Phe 


Asp 


Tyr 


His 


Al a 


Gly 


Ser 


Leu 


Val 




210 










215 










22 0 










Gly 


Asn 


Trp 


Se r 


Gly 


Thr 


Phe 


Gly 


Glu 


Met 


Tyr 


Asp 


Trp 


He 


Asn 


Met 


22 5 










230 










235 










24 0 


Pro 


Gly 


Asn 


Pro 


Asp 


Leu 


Asp 


Asn 


Leu 


Thr 


Gin 


Ser 


Gin 


Val 


Asp 


Ala 










24 5 










250 










255 




Tyr 


Ala 


Thr 


Leu 


I'let 


Arg 


Asp 


Val 


Ser 


Ala 


Ser 


Val 


Ser 


Met 


Ser 


Phe 








2 60 










265 










27 0 






Tyr 


Glu 


Asn 


Gly 


Ser 


Gly 


Thr 


Tyr 


Ser 


Val 


'^'yr 


Va 1 


Val 


Gly 


Ala 


Leu 






27 5 










28 0 










2 85 








A r<g 


Asn 


Asn 


Ph e 


Arg 


Tyr 


Lys 


Arg 


Ser 


Leu 


Gin 


Leu 


His 


Val 


Arg 


Ala 




2 90 










^ ^ ^ 










3 0 0 












Tyr 


Thr 


Ser 


(jin 


LjI U 


Trp 


His 


Asp 


Met 


He 


Arg 


Gly 


Glu 


Leu 


Al a 


3''i5 










J 1 'J 










315 










32 0 


3*=" r 


(■^1 \,- 
'oi V 




Pro 


V a 1 


rp, . 


Tyr 


Al a 


Gl y 


Asn 


As n 


kjx n 


Ser 


He 


Gly 


His 










"5 9 

J z o 










330 










335 




Ala 




Va 1 


Cys 


Asp 




Tyr 


Al a 


Ser 


Asp 


Gly 


Th r 


Phe 


His 


Phe 


Asn 








34 0 










34 5 










350 






1 rp 


Ljiy 


1 rp 


Lj J- y 


GI V 


V d J. 


Se r 


Asn 


Gly 


Phe 


Tyr 


Lys 


Leu 


Thr 


Leu 


Leu 
















360 










3 65 








Ser 


Pro 


Thr 


Ser 


Leu 


Gly 


He 


Gly 


Gly 


Glu 


Gly 


He 


Gly 


Phe 


Thr 


1 1 e 




37 0 










37 5 










380 










Tyr 




Gl u 


Tin 

r J- e 


Tin 

lie 


Thr 


Gly 


He 


Gl u 


Pro 


Ala 


L'j's 


Thr 


Pro 


Ala 


Glu 












3 90 










3 95 










4 00 


n 1 ^ 

MJ. d 




Th r 


Asp 


Al ct 


Leu 


Pro 


He 


Leu 


Ala 


Leu 


Lys 


Asp 


He 


Glu 


Al a 










4 05 










410 










415 




Glu 


Tyr 


Lys 


Ser 


Glu 


Ser 


Gly 


Leu 


Asn 


Val 


Gly 


Tyr 


Ser 


He 


Tyr 


Asn 








4 20 










4 25 










4 30 






Thr 


Gly 


Gl u 


Glu 


Gin 


Ser 


Asn 


Leu 


Asp 


Leu 


Gl J- 


Tyr 


Arg 


Leu 


Asn 


Lys 






4 35 










4 4 0 










4 4 5 








Al a 


Asp 


Gly 


Glu 


Val 


He 


Glu 


Val 


Lys 


Thr 


Ser 


Ser 


He 


Asn 


He 


Ser 














4 55 










4 60 










u rp 


1 yr 


Gly 


Tyr 


Gly 


Gl u 


His 


Pro 


Glu 


Ser 


Phe 


Ser 


Leu 


Al a 


Pro 


Asn 


HO ^ 










4 7 0 










4 7 5 










4 80 


^3 J. n 


LiSU 


Ser 


Gl n 


L)l j' 


lie 


Asn 


Thr 


He 


Thr 


Leu 


Leu 


Tyr 


Arg 


Arg 


Thr 










4 85 










4 90 










4 95 




Gly 


Thr 


Gl u 


Gin 


Trp 


Glu 


Pro 


Val 


Arg 


His 


Ala 


Gin 


Gly 


Gly 


Tyr 


Val 








500 










505 










51 0 






As'n 


Ser 


He 


Lys 


Val 


Asn 


Thr 


Thr 


Asp 


Pro 


Asn 


Asn 


Val 


Val 


Val 


Thr 






51 5 










520 










525 








\.'- ^ 


Asp 


Asn 


Asn 


Glu 


Gl^' 


Lys 


Leu 


Ser 


He 


Val 


Pro 


Asn 


Ser 


Phe 


Val 




530 










535 










54 0 










li 1 - 

r\± u 


Asp 


Le u 


Asn 


Ser 


Tyr 


Glu 


His 


Ser 


Thr 


He 


Thr 


Val 


Gin 


Fhe 


Asn 


54 5 




















555 










560 


S*^r 


rtsp 


oer 


P ro 


Asp 


O J. U 


lie 


Arg 


Thr 


Pro 


Va 1 


Al a 


Phe 


Ala 


Leu 


Ser 




















57 0 










57 5 




Thr 


Gly 


Ala 


Thr 


Ala 


Asp 


Asp 


Val 


He 


Ser 


Leu 


Gl*^^ 
J J. J 


T rp 


Val 


I let 


Al a 








580 










585 










590 






Glu 


Val 


Pro 


Gly 


Gly 


ser 


Ser 


Asn 


Tyr 


Fro 


Val 


Val 


Trp 


ser 


Lys 


Asp 






595 










600 










605 








Val 


Leu 


Thr 


Leu 


Ser 


Glu 


Gly 


Asp 


Tyr 


Thr 


Leu 


Trp 


Tyr 


Arg 


Phe 


Ser 




610 










615 










620 










He 


Asn 


As n 


Gin 


Lys 


Asp 


Glu 


Trp 


Lys 


Lys 


He 


Gly 


Ser 


Val 


Ser 


Val 


625 










630 










635 










640 


Ly.T 


Thr 


Pro 


Thr 


Glu 


Tyr 


Thr 


His 


Pro 


Leu 


Phe 


Glu 


Val 


Gly 


His 


Asn 










64 5 










650 










655 




Gin 


Thr 


Ser 


Thr 


Tyr 


Thr 


Leu 


Asp 


ilet 


Ala 


His 


Asn 


Arg 


Val 


Leu 


Pro 








660 










665 










670 






Asp 


Fhe 


Thr 


Leu 


Lys 


Asn 


Leu 


Gly 


Leu 


Pro 


Phe 


Asn 


Gly 


Glu 


Leu 


Val 



BNSDOCID: <WO 99e9870A-l J_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 
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373/ 490 



615 680 685 



40 



45 



50 



55 



60 



65 



70 



75 



Val 


Val 


Phe 


Arq 


Gl n 


Thr 


Gin 


Ser 


Ser 


Ser 


Gly 


Ser 


Leu 


Trp 


Ala 


Ala 




690 










695 










7 00 










Gin 


Glu 


Thr 


Val 


His 


He 


Lys 


Gin 


Gly 


Glu 


Thr 


Phe 


Val 


Tyr 


Lys 


Pro 


705 










710 










715 










7 20 


Val 


Val 


Glu 


Gly 


Pro 


He 


Pro 


Asp 


Gly 


Ser 


Tyr 


Arg 


Ala 


Thr 


Leu 


His 










725 










730 










735 




Ala 


Phe 


Va 1 


Asn 


'^ly 


Gin 


Gin 


Gin 


Leu 


Tyr 


Leu 


Ly^ 


Gly 


Lys 


Arg 


As n 








740 










7^5 










7 50 






Tyr 


Thr 


Val 


Lys 


He 


Val 


Asn 


Gly 


Thr 


Ala 


Val 


Glu 


Ala 


He 


Glu 


Ser 






755 










7 60 










765 








Sei- 


Glu 


Glu 


He 


Arg 


Val 


Phe 


Pro 


Asn 


Pro 


Ala 


Arg 


Asp 


Tyr 


Val 


Glu 




770 










775 










780 










ne 


Ser 


Ala 


Pro 


Cys 


He 


Pro 


Gin 


Glu 


Thr 


Ser 


ri- 


He 


Leu 


Phe 


Asp 


7C5 










790 










7 95 










800 


Leu 


Ser 


Gly 


Ly- 


He 


Val 


Met 


Lys 


Asn 


Ser 


Leu 


ser 


Ala 


Gly 


His 


Gly 










805 










810 










815 




Arg 


Met 


Asp 


Val 


Ser 


Arg 


Leu 


Pro 


Asn 


Gly 


Ala 


Tyr 


He 


Leu 


Lys 


Val 








820 










825 










830 






Asp 


Gly 


Tyr 


Thr 


Thr 


Lys 


He 


Asn 


He 


Val 


His 













10 



15 



20 

835 840 
(2) IliTORIiATIOn FOR SEQ ID HO: 4 33 

25 (i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 2 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (ii) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 
35 (A) ORGAIIISM: Po rphyromonas gingival is 

(ix) FEATURE: 

(A) IIAIIE/KEY: misc_f eature 

(B) LOCATION 1 ... 290 



(;:i) SEQUEtfCE DESCRIPTION: SEQ ID NO: 4 33 



Met 


Lys 


Lys 


Leu 


Phe 


Leu 


Ser 


Leu 


Thr 


Ser 


Leu 


Val 


Met 


Val 


Phe 


Ala 


1 








5 










10 










15 




Val 


Ala 


Ser 


Cys 


Asp 


He 


He 


Asp 


Lys 


Asp 


Gin 


Thr 


Leu 


Leu 


Pro 


Ala 








20 










25 










30 






Pro 


Thr 


Asn 


Val 


Thr 


Pro 


Asp 


Asn 


Pro 


Asp 


Asp 


Asn 


Pro 


Ser 


Glu 


He 






35 










4 0 










45 








Asp 


He 


Thr 


Gin 


Thr 


His 


Thr 


Glu 


Ly- 


Tyr 


Val 


Leu 


Ala 


Glu 


Glu 


Phe 




50 










c c 










60 










Thr 


Gly 


Gin 


Lys 


C^'-s 


Leu 


Asn 


Cys 


Pro 


Lys 


Gly 


His 


Arg 


Lys 


Leu 


Ala 


65 










70 










75 










80 


Ala 


Leu 


Lys 


Glu 


Gin 


Tyr 


Gly 


Lys 


Arg 


Leu 


Thr 


Val 


Val 


Gly 


He 


His 










85 










90 










95 




Ala 


Gly 


Pro 


Gly 


Ser 


Leu 


Val 


Pro 


Pro 


Leu 


Phe 


Arg 


Thr 


Glu 


Ala 


Gly 








100 










105 










HO 






Asp 


Ala 


Tyr 


Tyr 


Ser 


Lys 


Phe 


Ala 


Asn 


Asn 


Thr 


Pro 


Leu 


Pre 


Ala 


Leu 






115 










120 










125 








net 


Val 


Ser 


Arg 


Lys 


Lys 


Phe 


Gly 


Ser 


Ser 


Tyr 


Val 


Tyr 


Asp 


Lys 


Ser 




130 










135 










140 










Tyr 


Lys 


Thr 


Trp 


Asp 


Va 1 


Pro 


He 


Ala 


Glu 


Gin 


tlet 


Glu 


Gin 


Lys 


Ala 


145 










150 










155 










loO 


Lys 


He 


Asn 


He 


Phe 


Ala 


Val 


Ala 


Glu 


Tyr 


Thr 


Asp 


Thr 


Gin 


Lys 


He 










165 










170 










175 




Lys 


Val 


Thr 


Val 


Lys 


Gly 


Lys 


He 


Leu 


Glu 


Gly 


Asn 


Thr 


Leu 


Pro 


Lys 








100 










185 










1 90 






Ser 


Het 


Val 


Gin 


Val 


Tyr 


Leu 


Leu 


Glu 


Asp 


Lys 


Leu 


He 


Ala 


Pro 


Gin 






1?5 










2 00 










205 








Val 


Asp 


Gly 


Asn 


Thr 


Thr 


Val 


Glu 


Asn 


Tyr 


Glu 


His 


Asn 


His 


Val 


Leu 




210 










215 










220 










Arg 


Gly 


Ala 


Val 


Asn 


Gly 


He 


Trp 


Gly 


Glu 


Glu 


Phe 


Val 


Asn 


Leu 


L-ys 


225 










230 










235 










24 0 


Asp 


Tyr 


Leu 


Tyr 


Thr 


Tyr 


Ala 


Val 


Glu 


Pro 


Leu 


Ser 


Gly 


Met 


Ser 


Phe 










245 










250 










255 




Val 


Ala 


Glu 


Asn 


Tyr 


Ser 


He 


Vol 


Ala 


The 


Val 


Tyr 


Asp 


Val 


Gin 


Thr 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 PCT/AU98/01023 

374/490 

260 265 270 

Phe GIu Vul Tyr Asp Val Val His Val Lys Tie 7\sn Pro Gin Ser Asp 
21b 280 285 

Gly Lys 
5 290 

(2) IliPORlIATIOM FOR SEQ ID MO: 4 3^ 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 223 amino acid:? 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



15 



20 



35 



40 



45 



50 



55 



60 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona s gingiva lie 



{ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION 1 ... 223 

25 SEQUENCE DESCRIPTION: SEQ ID NO: 43^ 



30 



I lot 


Lys 


Lys 


Ser 


Ser 


Val 


Val 


Ala 


Ser 


Val 


Leu 


Ala 


Val 


Ala 


Leu 


Val 


1 








5 










10 










15 




Ph- 


Ala 


Gly 


Cys 


Gly 


Leu 


Asn 


Asn 


I-ie t 


Ala 


Lys 


Gly 


Gly 


Leu 


He 


Gly 








20 










25 










30 




Ala 


GI y 


Val 


Gly 


Gly 


Ala 


li- 


Gly 


Ala 


Gly 


Val 


Gly 


Asn 


Val 


Ala 


Gly 






35 










40 










45 








Asn 


Thr 


Ala 


Val 


Gly 


Ala 


ne 


Val 


Gly 


Thr 


Ala 


Val 


Gly 


Gly 


Ala 


Ala 




SO 










55 










60 










Gly 


Ala 


Leu 


He 


Gly 


Lys 


Lys 


Met 


Asp 


Lys 


Gin 


hys 


Lys 


Glu 


Leu 


Glu 


65 










70 










75 










80 


Ala 


Ala 


Val 


Pro 


Asp 


Ala 


Thr 


He 


Gin 


Thr 


Val 


Asn 


Asp 


Gly 


Glu 


Ala 










85 










90 










95 




He 


Leu 


Val 


Thr 


Phe 


Asp 


Ser 


Gly 


He 


Leu 


Phe 


Ala 


Thr 


Asn 


Ser 


Ser 








100 










105 










110 






Thr 


Leu 


Ser 


Pro 


Asn 


Ser 


Arg 


Thr 


Ala 


Leu 


Thr 


Lys 


Phe 


Ala 


Ala 


Asn 






115 










120 










125 








I iet 


A^n 


Lys 


Asn 


Pro 


Asp 


Thr 


Asp 


He 


Arg 


He 


Val 


Gly 


His 


Thr 


Asp 




13 0 










135 










140 










A£rn 


Thr 


Gly 


Ser 


Asp 


Lys 


He 


Asn 


Asp 


Pro 


Leu 


Ser 


Glu 


Arg 


Arg 


Ala 


145 










150 










155 








160 


Ala 


Ser 


Val 


Tyr 


Ser 


Phe 


Leu 


Asn 


Ser 


Gin 


Gly 


Val 


Ser 


Met 


Ser 


Arg 










165 










170 










175 


Met 


A I a 


Ala 


Glu 


Gly 


Arg 


Gly 


Ser 


His 


Glu 


Pro 


Val 


Ala 


Asp 


Asn 


Ser 








180 










185 










190 






Thr 


Val 


Ala 


Gly 


Arg 


Ser 


Ala 


Asn 


Arg 


Arg 


Val 


Glu 


Val 


Tyr 


He 


Leu 






195 










200 










205 








Pro 


Asn 


Ala 


Lys 


Met 


He 


Glu 


Gin 


Ala 


Gin 


Gin 


Gly 


Thr 


Leu 


Lys 






210 










215 










22 0 









(2) INFORMATION FOR SEQ ID NO: 4 35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

65 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porpl^iyromonas gingival is 

70 (i::) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1 . . . 337 



75 



(Mi) SEQUENCE DESCRIPTION: SEQ ID NO:435 



BNSDOCtD; <WO 9929870A1_1. > 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



375/490 



10 



15 



20 



25 



30 



35 



40 



45 



65 



70 



75 



net 


Ser 




Lys 


Ser 


He 


Leu 


Leu 


Leu 


Cys 


cys 


Ser 


Leu 


cys 


Phe 


He 


1 




5 










10 










15 




Ser 


Ala 


Thr 


Lys 


Ala 


Val 


Thr 


Pro 


Val 


Arg 


Asn 


Val 


Arg 


Asn 


Ser 


Gin 








20 










25 










30 






Val 


Asn 


Ser 


Lys 


Ala 


Lys 


Thr 


Glu 


Arg 


Thr 


Lys 


Pro 


Ser 


Asp 


Ser 


Val 






35 










40 










45 








Arg 


Tyr 


He 


Ser 


Asn 


Met 


He 


Ala 


Asp 


Arg 


Leu 


Glu 


Phe 


Arg 


Asn 


Lys 




50 










55 










60 










He 


Ser 


Ser 


Glu 


Lys 


Glu 


Val 


Arg 


Lys 


Ala 


Glu 


Tyr 


Glu 


Asn 


Arg 


Leu 


65 










70 










75 










80 


Ala 


Met 


Glu 


Ala 


Leu 


Asn 


Tyr 


Pro 


Ala 


He 


Asp 


Leu 


Tyr 


Gly 


Glu 


Asp 










85 










90 










95 




Ser 


Trp 


Ser 


Glu 


T^-r 


Val 


Asn 


Pro 


Phe 


Val 


Gly 


Ala 


Gly 


Thr 


Asp 


Val 








100 










105 










110 






Glu 


He 


Pro 


Asn 


Ser 


Tyr 


Asp 


He 


Asp 


Cys 


Ser 


Ser 


Phe 


Val 


Met 


Pro 






115 










120 










125 








Val 


Glu 


Asp 


Lys 


Gin 


Val 


Thr 


Ser 


Gin 


Phe 


Gly 


Tyr 


Arg 


Arg 


Arg 


Phe 




130 










135 




















Gly 


Arg 


Met 


His 


Tyr 


Gly 


He 


Asp 


Leu 


Ser 


Val 


Asn 


Arg 


Gly 


Asp 


Thr 


1^5 










150 










155 










160 


He 


Aug 


Ala 


Ala 


Phe 


Asp 


Gly 


Lys 


Val 


Arg 


\'al 


Arg 


Ser 


Tyr 


Glu 


Ala 










165 










170 










175 




Arg 


Gly 


Tyr 


Gly 


Tyr 


Tyr 


He 


Val 


Leu 


Arg 


His 


Pro 


Asn 


Gly 


Leu 


Glu 








180 










185 










190 






Thr 


Val 


Tyr 


Gly 


His 


Met 


Ser 


Arg 


Gin 


Leu 


Val 


Asp 


Glu 


Asn 


Gin 


He 






195 










200 










2 05 








Val 


Arg 


Ala 


Gly 


Gin 


Pro 


He 


Gly 


Leu 


Gly 


Gly 


Ser 


Thr 


Gly 


Arg 


Ser 




210 










215 










220 










Thr 


Gly 


Pro 


His 


Leu 


His 


Phe 


Glu 


Thr 


Arg 


Fhe 


Met 


Gly 


He 


Pro 


He 


225 










230 










235 










2^0 


Asn 


Pro 


Ser 


Thr 


He 


He 


Asp 


Phe 


Asp 


Asn 


'31 y 


Val 


Pro 


Leu 


Arg 


Asp 










245 










250 










255 




He 


Tyr 


Thr 


Phe 


Lys 


Arg 


Gly 


Ser 


Asn 


Ser 


Arg 


Tyr 


Ala 


Lys 


Ala 


Ser 








260 










265 










270 






Lys 


Thr 


Ser 


Ser 


Arg 


Tyr 


Ala 


Lys 


Lys 


Gly 


Lys 


Lys 


Gly 


Arg 


Gin 


Ala 






275 










280 










285 








Sei- 


Ser 


Pro 


Met 


Thr 


Tyr 


Arg 


He 


Lys 


Lys 


Gly 


Asp 


Thr 


Leu 


Glu 


Thr 




290 










295 










300 










ne 


Ala 


Lys 


Arg 


His 


Gly 


Thr 


Ser 


Val 


Gin 


Lys 


Leu 


Cys 


Ala 


Thr 


Asn 


305 










310 










315 










320 


Gly 


He 


Gly 


Lys 


Ser 


Lys 


He 


Leu 


Thr 


Pro 


Gly 


Lys 


Ala 


Leu 


Arg 


He 










325 










330 










335 




Lys 

































(2) lUFORIIATIOH FOR SEQ ID IJO:436 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LEHGTH: 151 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 (iii) ilYPOTHETICAL: YES 

(vi) GRIG in AL SOURCE: 

(A) ORGAUISM: Porphyromonas gingival i£ 

60 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATIOn 1 ... 151 



i>:i) SEQUENCE DESCRIPTIOM: SEQ ID 110:436 



Met 


Ala 


Lys 


li- 


Asn 


Phe 


Tyr 


Ala 


Glu 


Gly 


Vnl 


Ser 


Leu 


Pro 


Arg 


He 


1 








5 










10 










15 




Arg 


Arg 


Arg 


ne 

2 0 


Val 


Gly 


Lys 


Trp 


He 

25 


Ala 


Glu 


Val 


Cys 


Ser 
30 


Arg 


Tyr 


Gly 


Lys 


Ala 
35 


Val 


Gly 


Glu 


He 


Ser 
40 


Tyr 


Leu 


Fhe 


cys 


Asp 
45 


Asp 


Glu 


Tyr 


He 


Leu 
50 


Lys 


Ala 


Asn 


Gin 


Glu 
55 


Phe 


Leu 


Asp 


His 


Asp 
60 


Tyr 


Tyr 


Thr 


Asp 


li- 


He 


Thr 


Phe 


Asp 


Ser 


cys 


Glu 


Ala 


Asp 


Thr 


Val 


Asn 


Gly 


Asp 


Leu 


es 










70 










7 5 










80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 PCT/AU98/01 023 

376/ 490 



Leu lie Ser Leu Asp Thr Val Arg Ser Asn Ala Arg Ala Leu Asp Leu 

85 90 95 

Arg Tyr Glu A^p Glu Leu His Arg Val lie He His Gly He Leu Hi^ 
100 105 110 

5 Leu Cy^ Gly Leu Lys Asp Lys Ser Lys Lys Asp Glu Ala Gin Met Arg 
115 120 125 

Ala Ala Glu Glu Lys Ala Leu Val Met Leu Arg Glu Thr He Gly Ser 

130 135 140 

Glu Leu Ser Leu Leu His Thr 
10 145 150 

(2) IirrORMATIOM FOR SEQ ID HO: ^37 

(i) SEQUEIICE CHARACTERISTICS: 
15 (A) LENGTH: 391 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



25 



40 



45 



50 



55 



60 



65 



70 



75 



(ii) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 

(AJ ORGAMISIi: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) IJAI-IE/KEY: misc_f ea ture 

(B) LOCATION 1 ... 391 

30 {:-:i) SEQUENCE DESCRIPTION: SEQ ID NO:'j37 



35 



Met 


Lys 


V d -L 


Lys 


Tyr 


Leu 


Met 


Leu 


Thr 


Leu 


Val 


Gly 


Al a 


He 


Al a 


Leu 


1 








5 










10 










15 




Asn 


Ala 


Ser 


Ala 


Gin 


Glu 


Asn 


Thr 


Val 


Pro 


Ala 


Thr 


Gly 


Gin 


Leu 


Pro 








20 










25 








30 






Ala 


Lys 


Asn 


Val 


Ala 


Phe 


Ala 


Arg 


Asn 


Lys 


Ala 


Gly 


Ser 


Asn 


Trp 


Phe 






35 










40 










45 








Val 


Thr 


Leu 


Gin 


Gly 


Gly 


Val 


Ala 


Ala 


Gin 


Phe 


Leu 


Asn 


Asp 


Asn 


Asn 




50 










55 










60 










Asn 


Lys 


Asp 


Leu 


Met 


Asp 


Arg 


Leu 


Gly 


Ala 


He 


Gly 


Ser 


Leu 


Ser 


Val 


65 










70 










75 










80 


Gly 


Lys 


Tyr 


His 


Ser 


Pro 


Phe 


Phe 


Ala 


Thr 


Arg 


Leu 


Gin 


He 


Asn 


Gly 










85 










90 










95 




Gly 


Gin 


Ala 


His 


Thr 


Phe 


Leu 


Gly 


Lys 


Asn 


Gly 


Glu 


Gin 


Glu 


He 


Asn 








100 










105 










110 






Thr 


Asn 


Phe 


Gly 


Ala 


Ala 


His 


Phe 


Asp 


Phe 


Met 


Phe 


Asp 


Val 


Val 


Asn 






115 










120 










125 








Tyr 


Phe 


Ala 


Pro 


Tyr 




Glu 


Asn 


Arg 


Phe 


Phe 


His 


Leu 


He 


Pro 


Trp 




130 










135 










140 








Val 


Gly 


Val 


Gly 


Tyr 


Gin 


His 


Lys 


Phe 


He 


Gly 


Ser 


Glu 


Trp 


Ser 


Lys 


1^5 










150 










155 










160 


Asp 


Asn 


Val 


Glu 


Ser 


Leu 


Thr 


Ala 


Asn 


Val 


Gly 


Val 


Met 


Met 


Ala 


Phe 










165 










170 










175 




Arg 


Leu 


Gly 


Lys 


Arg 


Val 


Asp 


Phe 


Val 


He 


Glu 


Ala 


Gin 


Ala 


Ala 


His 








180 










185 










190 






Ser 


Asn 


Leu 


Asn 


Leu 


Ser 


Arg 


Ala 


Tyr 


Asn 


Ala 


Lys 


Lys 


Thr 


Pro 


Val 






195 










200 










2 05 








Phe 


Glu 


Asp 


Pro 


Ala 


Gly 


Arg 


Tyr 


Tyr 


Asn 


Gly 


Phe 


Gin 


Gly 


Met 


Ala 




210 










215 










220 










Thr 


Ala 


Gly 


Leu 


Asn 


Phe 


Arg 


Leu 


Gly 


Ala 


Val 


Gly 


Phe 


Asn 


Ala 


He 


22 5 










230 










235 










240 


UnK 


Pro 


Met 


Asp 


Tyr 


Ala 


Leu 


lie 


Asn 


Asp 


Leu 


Asn 


Gly 


Gin 


He 


Asn 










2^5 










250 










255 




Arg 


Leu 


Arg 


Ser 


Glu 


Val 


Glu 


Glu 


Leu 


Ser 


Lys 


Arg 


Pro 


Val 


Ser 


Cys 








260 










265 










270 






Pro 


Glu 




Pro 


Glu 


Val 


Thr 


Pro 


Val 


Thr 


Lys 


Thr 


Glu 


Asn 


He 


Leu 






275 










280 










285 








Thr 


Glu 


Lys 


Ala 


Val 


Leu 


Phe 


Arg 


Phe 


Asp 


Ser 


His 


Val 


Val 


Asp 


Lys 




290 










295 










300 






Asp 


Gin 


Leu 


li- 


Asn 


Leu 


Tyr 


Asp 


Val 


Ala 


Gin 


Phe 


Val 


Lys 


Glu 


Thr 


305 










310 










315 








320 


Asn 


Glu 


Pro 


ne 


Thr 


Val 


Val 


Gly 


Tyr 


Ala 


Asp 


Pro 


Thr 


Gly 


Asn 


Thr 










325 










330 










335 




Gin 


Tyr 


As n 


Glu 


Lys 


Leu 


Ser 


Glu 


Arg 


Arg 


Ala 


Lys 


Ala 


Val 


Val 


Asp 








34 0 










345 










350 







SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



377/490 



Val Leu Thr Gly Lys Tyr Gly Val Pro Ser Glu Leu He Ser Val Glu 

355 360 365 

Trp Lys Gly Asp Ser Thr Gin Pro Phe Ser Lys Lys Ala Trp Asn Arg 
370 375 380 

5 Val Val He Val Arg Ser Lys 
385 390 

{2) IIJFORMATIOM FOR SEQ ID HO: 438 

10 (i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 385 amino acids 

( B ) TYPE : amino acid 
( D) TOPOLOGY : linear 

15 (ii) liOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 
20 (A) ORGAMISri: Porphyromonas gingival is 

(i::) FEATURE: 

(A) Ii;^JlE/KEY: mi sc_f ea tu r e 
{B) LOCATIOIJ 1 ... 385 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(xi) SEQUEHCE DESCRI PTIO! I : SEQ ID 110:438 



Met 


Thr 


Tyr 


Arg 


He 


Met 


Lys 


Ala 


Lys 


Ser 


Leu 


Leu 


Leu 


Ala 


Leu 


Ala 


1 








5 










10 










15 




Gly 


Leu 


Ala 


Cys 


Thr 


Phe 


Ser 


Ala 


Thr 


Ala 


Gin 


Glu 


Ala 


Thr 


Thr 


Gin 








20 










25 










30 






Asn 


Lys 


Ala 


Gly 


Met 


His 


Thr 


Ala 


Phe 


Gin 


Arg 


Asp 


Lys 


Ala 


Ser 


Asp 






35 










40 










45 








His 


Trp 


Phe 


He 


Asp 


He 


Ala 


Gly 


Gl y 


Ala 


Gly 


Met 


Ala 


Leu 


Ser 


Gly 




50 










55 










60 










Trp 


Asn 


Asn 


Asp 


Val 


Asp 


Phe 


Val 


Asp 


Arg 


Leu 


Ser 


He 


Val 


Pro 


Thr 












70 










75 










80 


Phe 


Gly 


He 


Gly 


Lys 


Trp 


His 


Glu 


Pro 


Tyr 


Phe 


Gly 


Thr 


Arg 


Leu 


Gin 










05 










90 










95 




Phe 


Thr 


Gly 


Phe 


Asp 


He 


Tyr 


Gly 


Phe 


Pro 


Gin 


Gly 


Ser 


Lys 


Glu 


Arg 








100 










105 










110 






Asn 


His 


Asn 


Tyr 


Phe 


Gly 


Asn 


Ala 


His 


Leu 


Asp 


Phe 


Met 


Phe 


Asp 


Leu 






115 










120 










125 








Thr 


Asn 


Tyr 


Phe 


Gly 


Val 


Tyr 


Arg 


Pro 


Asn 


Arg 


Val 


Phe 


His 


He 


He 




130 










135 










140 










Pro 


Trp 


Ala 


Gly 


He 


Gly 


Phe 


Gly 


Tyr 


Lys 


Phe 


His 


Ser 


Glu 


Asn 


Ala 


1^15 










150 










155 










160 


Asn 


Gly 


Glu 


Lys 


Val 


Gly 


Ser 


Lys 


Asp 


Asp 


Met 


Thr 


Gly 


Thr 


Val 


Asn 










165 










170 










175 




Val 


'■3ly 


Leu 


Met: 


Leu 


Lys 


Phe 


Arg 


Leu 


Ser 


Arg 


Val 


Val 


Asp 


Phe 


Asn 








180 










185 










190 






He 


Glu 


Gly 


Gin 


Ala 


Phe 


Ala 


Gly 


Lys 


Met 


Asn 


Phe 


He 


Gly 


Thr 


Lys 






195 










200 










205 








Arg 


Gly 


Lys 


Ala 


Asp 


Phe 


Pro 


Val 


Met 


Ala 


Thr 


Ala 


Gly 


Leu 


Thr 


Phe 




210 










215 










220 










Asn 


Leu 


Gly 


Lys 


Thr 


Glu 


Trp 


Thr 


Glu 


He 


Val 


Pro 


Met 


Asp 


Tyr 


Ala 


225 










230 










235 










240 


Leu 


Val 


Asn 


Asp 


Leu 


Asn 


Asn 


Gin 


He 


Asn 


Ser 


Leu 


Arg 


Gly 


Gin 


Va 1 










24 5 










250 










255 




Glu 


Glu 


Leu 


Ser 


Arg 


Arg 


Pro 


Val 


Ser 


Cys 


Pro 


Glu 


Cys 


Pro 


Glu 


Pro 








260 










265 










270 






Thr 


Gin 


Pro 


Thr 


Val 


Thr 


Arg 


Val 


Val 


Val 


Asp 


Asn 


Val 


Va 1 


Tyr 


Phe 






275 










280 










285 








Arg 


He 


Asn 


Ser 


Ala 


Lys 


He 


Asp 


Arg 


Asn 


Gin 


Glu 


He 


Asn 


Val 


Tyr 




290 










295 










300 










Asn 


Thr 


Ala 


Glu 


Tyr 


Ala 


Lys 


Thr 


Asn 


Asn 


Ala 


Pro 


He 


Lys 


Val 


Val 


305 










310 










315 










32 0 


Gly 


Tyr 


Ala 


Asp 


Glu 


Lys 


Thr 


Gly 


Thr 


Ala 


Ala 


Tyr 


Asn 


Met 


Lys 


Leu 










325 










330 










335 




Ser 


Glu 


Arg 


Arg 


Ala 


Lys 


Ala 


Val 


Ala 




Met 


Leu 


Glu 


Lys 


Tyr 


Gly 








340 










345 










350 






Val 


Ser 


Ala 


Asp 


Arg 


He 


Thr 


He 


Glu 


Trp 


Ly- 


Gly 


Ser 


Ser 


Gl u 


Gin 






365 










360 










365 








He 


Tyr 


Glu 


Glu 


Asn 


Ala 


Trp 


Asn 


Arg 


He 


Val 


Val 


Met 


Thr 


Ala 


Ala 




370 










375 










380 











SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



378/490 



10 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Glu 
385 

(2) ItlFORMATION FOR SEQ ID NO: 4 39 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LEllGTH: 190 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



15 (vi) ORIGIilAL SOURCE: 

(A) ORGAMISJl: Porphyromona^ gingival i^ 



Ux) FEATURE: 

(A) IIAIIE/KEY: misc_feature 
( B) LOCATIOII 1 ... 190 

(xl) SEQUENCE DESCRIPTIOU: SEQ ID NO: 439 



[let 


Glu 


Phe 


Fhe 


Met 


Leu 


Fhe 


He 


Ala 


Ala 


Val 


Fhe 


Val 


Asn 


Asn 


Val 


1 








5 










10 










15 




Val 


L-u 


Ser 


Gin 
20 


Phe 


Leu 


Gly 


He 


Cys 
25 


Pro 


Phe 


Leu 


Gly 


Val 
30 


Ser 


Lys 


Lys 


Val 


A^p 


Thr 


Ser 


He 


Gly 


Met 


Gly Ala 


Ala 


Val 


Thr 


Phe 


Val 


Leu 


Ala 




35 










40 










45 








Leu 


Ala 


Thr 


Leu 


Val 


Thr 


Phe 


Leu 


He 


Gin 


Lys 


Phe 


Val 


Leu 


Asp 




50 










55 










60 








Arg 


Fhe 


Gly 


Leu 


Gly 


Phe 


Met 


Gin 


Thr 


He 


Ala 


Phe 


He 


Leu 


Val 


He 


65 










70 










75 








80 


Ala 


Ala 


Leu 


Val 


Gin 
85 


Met 


Val 


Glu 


He 


He 
90 


Leu 


Lys 


Lys 


Val 


Ser 

95 


Pro 


Pro 


Leu 


Tyr 


Gin 


Ala 


Leu 


Gly 


Val 


Phe 


Leu 


Pro 


Leu 


He 


Thr 


Thr 


Asn 








100 










105 










110 




Cys 


Cys 


Val 


Leu 


Gly 


Val 


Ala 


He 


Leu 


Val 


He 


Gin 


Lys 


Asp 


Tyr 


Thr 






115 










120 










125 




Leu 


Leu 


Gin 


Ser 


Phe 


Val 


Tyr 


Ala 


li- 


Ser 


Thr 


Ala 


He 


Gly 


Phe 


Thr 




130 










135 










140 








Leu 


Ala 


Met 


Val 


Thr 


Phe 


Ala 


Gly 


ne 


Arg 


Glu 


Gin 


Leu 


Asp 


Met 


Thr 


1 4 5 










150 










155 








160 


Asn 


Leu 


Pro 


Lys 


Ala 


Met 


Lys 


Gly 


He 


Pro 


Ser 


Ala 


Leu 


Leu 


Ala 


Ala 


Gly 








165 










170 










175 




lie 


Leu 


Ala 


Ilet 


Ala 


Phe 


Met 


Gly 


Phe 


Ser 


Gly 


He 


Ala 












100 










135 








190 







{ 2 ) T I J FORI lAT I OU FOR SEQ ID MO : 4 4 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IBS amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION i . . .186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: ^40 

Met Leu Phe He Ala Ala Val Phe Val Asn Asn Val Val Leu Ser Gin 

1 S 10 15 

Phe Leu Gly He Cys Pro Phe Leu Gly Val Ser Lys Lys Val Asp Thr 

20 25 30 

Ser He Gly IJet Gly Ala Ala Val Thr Phe Val Leu Ala Leu Ala Thr 

35 40 45 

Leu Val Thr Phe Leu He Gin Lys Phe Val Leu Asp Arg Phe Gly Leu 



75 
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50 






55 










60 












Gl V 

VJ J. i 


Phe 


Met Gin Thr 


He 


Ala 


Phe 


He 


Lsu 


Val 


He 


Ala 


Ala 


Leu 


Val 




65 






70 










75 










80 




Gin 


Met 


Val Glu He 


He 


Leu 


Lys 


Lys 


Val 


Ser 


Pro 


Pro 


Leu 


Tyr 


Gin 


5 






85 










90 










95 






Ala 


Leu 


Gly Val Phe 


Leu 


Pro 


Leu 


He 


Thr 


Thr 


Asn 


Cys 


Cys 


Val 


Leu 








100 








105 










110 








Gly 


Val 


Ala He Leu 


Val 


He 


Gin 


Lys 


Asp 


Tyr 


Thr 


Leu 


Leu 


Gin 


Ser 








lis 






120 










125 








10 


Phe 


Val 


Tyr Ala He 


Ser 


Thr 


Al a 


He 


Gly 


Phe 


Thr 


Leu 


Ala 


Met 


Val 






130 






135 










140 












Thr 


Phe 


Ala Gly He 


Arg 


Glu 


Gin 


Leu 


Asp 


Met 


Thr 


Asn 


Leu 


Pro 


Lys 




1^5 






150 










155 










160 




Ala 


Met 


Lys Gly He 


Pro 


Ser 


Ala 


Leu 


Leu 


Ala 


Ala 


Gly 


He 


Leu 


Ala 


Id 






165 










170 










17 5 






Met 


Ala 


Phe net Gly 


Phe 


Ser 


Gly 


He 


Ala 




















180 








105 


















(2) 


IN FORMAT I OI) FOR 


SEQ 


ID NO: 441 
















on 
































(i) 


SEQUENCE CHARACTERISTICS: 






















(A) LEMGTH: 833 amino 


acids 




















(B) TYPE: 


amino acid 
























(D) TOPOLOGY: 


1 inear 


















25 


































(ii} 


MOLECULE Tj 


'PE: 


protein 




















(iii) 


HYPOTHETICAL: YES 




















30 




(vi) 


ORIGINAL SOURCE: 


























(A) ORGANISM: 


Porphyromonas gingivalis 












(ix) 


FEATURE : 
























35 






(A) tIAl>iE/KEY: 


misc feature 


















( B) LOCATION 1 . . . 833 






















(xi) 


SEQUENCE DESCRIPTION: SEQ ID HO: 441 












Met 


Lys 


Gin Leu Asn 


He 


He 


Ser 


Phe 


He 


He 


Ala 


Phe 


Leu 


Phe 


Leu 


40 


1 


5 










10 










15 






Gly 


Thr 


Ser Ala Ser 


Ala 


Gin 


Gin 


Ser 


Gly 


Gly 


Ser 


Val 


Thr 


Gly 


Thr 








20 








25 










30 








Val 


Val 


Asp Lys Ser 


Ser 


Lys 


Glu 


Pro 


He 


Ala 




Val 


Gin 


Val 


Phe 


45 






35 






40 










45 








Val 


Lys 


Gly Thr Thr 


Leu 


Gly 


Thr 


Ser 


Thr 


Asp 


Ala 


Asn 


Gly 


Asn 


Tyr 






50 






55 










60 












Ser 


He 


Lys Gly He 


Pro 


Ser 


Gly 


Asn 


Gin 


Thr 


He 


Val 


Ala 


Arg 


Leu 




65 






7 0 










75 










80 


50 


Met 


Gly 


Tyr Ser Thr 


J - 


Glu 


Glu 


Lys 


Val 


His 


He 


Glu 


Lys 


Gly 


Gly 






85 










90 










95 






Ser 


Arg 


His Val Asp 


Leu 


Tyr 


Leu 


Thr 


Glu 


Glu 


He 


Leu 


Ser 


Leu 


Asp 








100 








105 










110 








Gly 


Val 


Val Val Ser 


Ala 


Asn 


Arg 


Asn 


Glu 


Thr 


Phe 


Arg 


Arg 


Gin 


Ala 


55 






115 






120 










125 








Pro 


Ser 


Leu Val Thr 


Val 


Leu 


Ser 


Pro 


Glu 


Leu 


Phe 


Leu 


Lys 


Thr 


Asn 






130 






135 










140 












Ser 


Thr 


Asn Leu Ser 


Gin 


Gly 


Leu 


Lys 


Phe 


Gin 


Pro 


Gly 


Leu 


Arg 


Val 




145 






150 










155 










160 


60 


Glu 


Asp 


Asn Cys Gin 


Asn 


Cys 


Gly 


Phe 


Asn 


Gin 


Val 


Arg 


He 


Asn 


Gly 






165 










170 










175 






Leu 


Glu 


Gly Ala Tyr 


Ser 


Gin 


He 


Leu 


He 


Asp 


Ser 


His 


Pro 


He 


Phe 








180 








185 










190 








Ser 


Sei: 


Leu Ala Gly 


Val 


Tyr 


Gly 


Leu 


Glu 


Gin 


Met 


Pro 


Ala 


Asn 


Met 


65 






195 






200 










205 








He 


Glu 


Arg Val Glu 


Val 


He 


Arg 


Gly 


Gly 


Gly 


Ser 


Ala 


Leu 


Phe 


Gly 






210 






215 










220 












Ser 


Asn 


Ala Val Gly 


Gly 


Val 


He 


Asn 


Val 


He 


Thr 


Lys 


Glu 


Pro 


Leu 




225 






230 










235 










240 


70 


Arg 


Asn 


Ser Ala Glu 


He 


Ser 


His 


Ser 


Thr 


Met 


Thr 


Phe 


Asp 


His 


Ala 






245 










250 










255 






Lys 


Gly Trp Gly Ser 


Phe 


Gin 


Asn 


Thr 


Thr 


Gin 


Phe 


Asn 


Gly 


Ser 


Met 








260 








265 










270 








Leu 


Thr 


Glu Asp Arg 


Lys 


Ala 


Gly 


Val 


Met 


Val 


Phe 


Gly 


Gin 


His 


Asn 


75 






275 






280 










285 








Tyr 


Arg 


Pro Gly Gin 


Asp 


He 


Asp 


Gly 


Asp 


Asn 


Phe 


Thr 


Glu 


Leu 


Pro 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCtD: <WO 9929870A1_I_> 



wo 99/29870 PCT/AU98/01023 

380/ 490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 





290 










2 95 










300 












Leu 


Arg 


Asn 


Arg 


Ser 


Leu 


Gly 


Phe 


Arg 


Ser 


Tyr 


Tyr 


Lys 


Thr 


Gly 


305 










310 










315 










320 


Leu 


Tyr 


Ser 


Lys 


Ala 


Thr 


Leu 


Glu 


Tyr 


His 


Ser 


Met 


Gin 


Glu 


Tyr 


Arg 










325 








330 










335 




Arg 


Gly 


Gly 


A^p 


Arg 


Leu 


Asp 


Asn 


Pro 


Pro 


Phe 


Glu 


Ala 


Gin 


He 


Ala 








340 










345 










350 






Glu 


Tyr 


Leu 


Gin 


His 


Tyr 


He 


Asn 


Gly 


Gly 


Ser 


Phe 


Lys 


Phe 


Asp 


Gin 






355 










360 










365 








Giy 


Phe 


Ser 


Gly 


Gly 


Lys 


Asp 


Phe 


Phe 


Ser 


Leu 


Tyr 


Ala 


Ser 


Ala 


Gin 




37 0 










375 










380 










Asp 


Val 


Gl n 


Arg 


Arg 


Ser 


Tyr 


Tyr 


Gly 


Gly 


Gly 


Asp 


Tyr 


Thr 


Glu 


Asn 


385 










390 










395 










400 


Leu 


Leu 


As-n 


Gly 


Ala 


Val 


Gin 


Ser 


Gly 


Ser 


Thr 


Glu 


Ser 


Asp 


Glu 


Tyr 










4 05 










4 1 0 










415 




A^n 


A^p 


Ala 


Phe 


Thr 


Ala 


Leu 


Thr 


Ser 


Tyr 


Gly 


Thr 


Thr 


Ly5 


Gly 


Phe 








4 20 










425 










430 






Arp 


Leu 


Gin 


Gly 


Gly 


Gly 


1-Jet 


Tyr 


Arg 


His 


Th r 


Phe 


Gly 


Glu 


Asn 


Trp 






435 










440 










4 45 








Aep 


Phe 


Thr 


Gly 


Gly 


Leu 


Glu 


Tyr 


He 


Tyr 


Gly 


Gin 


Leu 


Asp 


Asp 


Arg 




450 










4 55 










4 60 










Ser 


Gly 


Tyr 


Arg 


Pro 


Ser 


Lys 


He 


Asp 


Gl n 


.Asn 


Thr 


Ser 


Thr 


Phe 


Ser 












4 70 










4"5 










4 80 


Gin 


Tyr 


A-p 


Gl n 


Leu 


Glu 


Tyr 


Lys 


Ttir 


Glu 


Lys 


Leu 


Ser 


Ala 


Leu 


He 










4 85 










4 90 










4 95 




Gly 


Ala 


Arg 


He 


Asp 


Tyr 


Val 


Leu 


Leu 


Asn 


Gin 


Asp 


Gly 


Lys 


Arg 


Tyr 








500 










505 










510 






He 


Asp 


Pro 


Leu 


Phe 


He 


Phe 


Ser 


Pro 


Arg 


Al a 


Asn 


Val 


Arg 


Tyr 


Asn 






515 










520 










525 








Pro 


Asn 


Lys 


Asn 


Leu 


Ser 


Phe 


Arg 


Leu 


Ser 


Tyr 


Ser 


Glu 


Gly 


Phe 


Arg 




530 










535 










540 










Ala 


Pro 


Gin 


Tyr 


Phe 


Asp 


Glu 


Asp 


Leu 


His 


Val 


Glu 


Leu 


Ala 


Gly 


Gly 












550 










555 










560 


Thr 


Pro 


He 


Ser 


Arg 


Val 


Leu 


Ser 


Pro 


Asn 


Leu 


Ly^ 


Glu 


Glu 


Arg 


Ser 










565 










57 0 










575 




Arg 


Ser 


He 


Sec 


Ala 


Ser 


Phe 


Asp 


Tyr 


Tyr 


His 


Arg 


Ala 


Asp 


Glu 


Trp 








580 










585 










590 






Gin 


Phe 


As n 


He 


Met 


Gly 


Glu 


Ala 


Phe 


Ser 


Thr 


Phe 


He 


Ser 


Asn 


Gin 






595 










600 










605 








Phe 


Lys 


Pro 


Ser 


Asp 


Lys 


Val 


Glu 


Thr 


Thr 


Ser 


Asp 


Gly 


Lys 


Glu 


Trp 




610 










615 










620 










He 


He 


Arg 


Thr 


He 


Tyr 


Asn 


Asp 


Lys 


Asp 


Gly 


Val 


Ser 


Lys 


Val 


Tyr 


62 5 










630 










635 










640 


Gly 


Val 


Asn 


Leu 


Glu 


Gly 


Arg 


He 


Ala 


Tyr 


Asn 


Ly5 


Ser 


Phe 


Asp 


Leu 










645 










650 










655 




Gin 


Leu 


Gly 


Gly 


Thr 


Trp 


Gin 


Arg 


Ser 


Arg 


Tyr 


Gly 


Ser 


He 


Tyr 


Thr 








660 










665 










670 






Ala 


Val 


Glu 


Ala 


Asp 


Lys 


Thr 


Thr 


Gly 


Gin 


Al a 


Glu 


He 


Ser 


Val 


Lys 






67 5 










680 










685 








Asp 


Tyr 


Val 


Arg 


Thr 


Pro 


Asn 


Leu 


Tyr 


Gly 


Tyr 


Phe 


Val 


Ala 


Thr 


Va 1 




690 










6 95 










7 00 










Arg 


Pro 


Thr 


Glu 


His 


Phe 


Ala 


lie 


Asn 


Leu 


Ser 


Gly 


Thr 


Phe 


Thr 


Gly 


706 










710 










715 










720 


Lys 


net 


Asp 


Val 


Val 


His 


Glu 


Ala 


Tyr 


Glu 


Gly 


Asp 


He 


Pro 


Ala 


Glu 










725 










730 










735 




His 


He 


Ala 


Fro 


Asp 


Gly 


Ser 


Phe 


Asp 


Phe 


Glu 


Met 


Asn 


Gly 


Gin 


Gin 








7^0 










745 










750 






E-'he 


Lyr 


Gly 


Leu 


Ala 


Glu 


Gly 


His 


Ala 


Lys 


Leu 


Va 1 


Lys 


Thr 


Pro 


Ala 






755 










760 










7 65 








The 


Ala 


Asp 


He 


Asp 


Leu 


Lys 


Leu 


Ser 


His 


Asp 


Phe 


His 


Leu 


Ala 


Ser 




770 










775 










730 










Thr 


Met 


Thr 


Leu 


Gl u 


Leu 


Asn 


Ala 


Gly 


He 


Gl n 


Asn 


He 


Phe 


Asn 


Ser 


785 










7 90 








7 95 










800 


Tyr 


Gin 


Lys 


Asp 


Thr 


Asp 


Lys 


Gly 


Pro 


Gly 


Arg 


Ala 


Ser 


Thr 


Tyr 


Val 










8 05 










810 










815 




Tyr 


Gly 


Pro 


Met 


Gin 


Pro 


Arg 


Arg 


He 


Phe 


Va 1 


Gly 


Thr 


Lys 


He 


Asn 








820 










825 










830 






Phe 

































(2) TIJF'ORMATIOM FOR SEQ ID NO: 4 42 



(i) SEQUEHCE CHARACTERISTICS: 

(A) LEIIGTH: 891 amino acids 
75 (B) TYPE; amino acid 



BNSDOCtD: <WO 9929870A1_I_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



381/ 490 

(D) TOPOLOGY: linear 

(ii) MOLECULE TVPE: protein 

5 (iiil HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAIJISM: Porphyrornorias gingivalis 

10 (ix) FEATURE: 

(A) IIAIIE/KEY: misc_feature 
IB) LOCATIOIl 1 ... 891 



15 



SEQUENCE DESCRI PTIOtI : SEQ ID HO: 442 



net Tyr Ly^ Ly^ lie lie Ala Val Ala Ala Leu Phe Cys Ala Ser lie 
15 10 15 

Gly lie Leu Lys Gly Gin Ser Ser A.-p Leu Thr Pro Gin Asp Thr He 
20 2S 30 

20 Tyr Ser Pro Glu lie Ser Tyr Ala hys Pro He Hi^ Lys Thr He Aln 
35 40 45 

Ser He Glu He Glu Gly liet Arg Ser Phe Asp Asp Phe Val Leu Arg 

50 55 60 

Asn Leu Ser Gly Leu Ala Val Gly Asp Glu Val Leu He Pro Gly Asp 
25 65 "7 0 7 5 80 

Ala net Ser Ala 7^1 a Val Asn Arg He Met Arg Gin Gly Tyr Phe Ser 

8 5 9 0 ? 5 

Asn Val Arg He He Ala Asp Lys T'yr Val Gly Asn Lys Val Tyr Leu 
100 105 HO 

30 Lys He He Val Thr Glu Arg Pro Arg He Ser Lys Val Thr Phe Ser 
115 120 125 

Gly Val Lys Lys Ser Glu Arg Glu Asp Leu Glu Met Lys He Gly Leu 

130 135 140 

Arg Glu Gly He Gin Met Thr Arg Asn Asn Glu Asp Lys Val Arg Gin 
35 145 150 155 160 

He Val Gin Lys Tyr Phe Ser Glu Lys Gly Tyr Arg Asp Ala Ser He 

165 170 175 

Arg He Thr Gin Glu Pro Asp Leu Ser Lys Asp Gly Phe Val Asn Val 
180 185 190 

40 Leu He Ser He Glu Lys Lys Ser Lys Thr Lys Val Asn Glu He Tyr 
195 200 205 

Phe Ser Gly Asn Lys Ala Leu Ser Asn His Lys Leu Arg Met Ala Met 

210 215 220 

Lys Asn Thr Asn Ala Lys Phe Ser Leu Arg Lys His He Arg Ser Ser 
45 225 230 235 240 

Phe Leu Lys Leu Phe Ser Thr His Lys Phe Val Glu Glu Ser Tyr Arg 

245 250 255 

Glu Asp Leu Val Arg Leu He Glu Lys Tyr Gin Glu Tyr Gly Tyr Arg 
260 265 27 0 

50 Asp Ala Glu He Leu Thr Asp Ser Val Val Lys Ala Pro Asp Gly Lys 
275 280 285 

Arg Val Asp He Tyr Leu Asn He Glu Glu Gly Gin Lys Tyr Tyr He 

290 295 300 

Lys Asp Val Asn Phe Val Gly Asn Ser Gin Tyr Pro Ser Glu Tyr Leu 
55 305 310 315 320 

Glu Arg Val Leu Gly He Lys Ser Gly Asp Val Tyr Asn Gin Arg Arg 

325 330 335 

Leu Ala Lys Arg Leu Asn Glu Asp Glu Asp Ala Val Gly Asn Leu Tyr 
340 345 350 

60 Tyr Asn Asn Gly Tyr He Phe Ala Trp Val Asp Pro Val Glu Thr Asn 
355 360 365 

Val Val Gly Asp Ser Val Ser Leu Asp He Arg He Ala Glu Gly Lys 

370 375 380 

Gin Ala Asn He Asn Lys Val He He Lys Gly Asn Thr Val Val Tyr 
65 385 390 395 400 

Glu Asp Val Val Arg Arg Glu Leu Tyr Thr Lys Pro Gly Gin Leu Phe 

405 410 415 

Ser Arg Glu Asp He He Asn Ser He Arg Leu He Asn Gin Leu Gly 
4 20 4 25 4 30 

70 His Phe Asp Ala Glu Lys Ser He Pro Arg Pro He Pro Asn Pro Glu 
4 35 4 40 4 45 

Thr Gly Thr Val Asp He Glu Tyr Asp Leu Val Pro Arg Ser Ser Asp 

450 455 4G0 

Gin Leu Glu Leu Ser Val Gly Trp Ser Gin Ser Gly Leu Leu Phe Arg 
75 465 470 475 480 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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55 



60 



Gly 


Ala 


He 


Lys 


L'he 


Thr 


Asn 


Phe 


Ser 


Val 


Gly 


Asn 


Leu 


Leu 


His 


Pro 










485 










490 










4 95 




Ser 


Met 


Tyr 


Lys 


Lys 


Gly 


He 


He 


Pro 


Gin 


Gly 


Asp 


Gly 


Gin 


Thr 


Leu 








500 










505 










510 






Ser 


Leu 


Ser 


Ala 


Gin 


Thr 


Asn 


Gly 


Lys 


Tyr 


Tyr 


Gin 


Gin 


Tyr 


Ser 


Val 






515 










520 










525 








Thr 


Phe 


I'let 


Asp 


Pro 


Trp 


Phe 


Gly 


Gly 


Lys 


Arg 


Pro 


Asp 


Met 


Phe 


Ser 




530 










535 










540 










Phe 


Ser 


Ala 


Phe 


Tyr 


Ser 


Lys 


Thr 


Thr 


Ala 


He 


Asp 


Ser 


Lys 


Phe 


Tyr 












550 










555 










560 


Asn 


Ser 


Asn 


Ala 


Gly 


Asn 


Tyr 


Tyr 


Asn 


Ala 


Tyr 


Tyr 


Asn 


Ser 


Tyr 


Tyr 










565 










57 0 










575 




Asn 


Asn 


Tyr 


Asn 


Ser 


Tyr 


Tyr 


Asn 


Gly 


Met 


Ser 


Asn 


Tyr 


Thr 


Gly 


Asp 








50 0 










535 










590 






L-u 


Tyr 


Thr 


Gin 


Ala 


Ser 


Asp 


Pro 


Asp 


Arg 


Ser 


Leu 


Gin 


Met 


Leu 


Gly 






595 










600 










605 








Thr 


Ser 


He 


Gly 


Tyr 


Gly 


Lys 


Arg 


Leu 


Thr 


Trp 


Pro 


Asp 


Asn 


Trp 


Phe 




610 










615 










620 










Gin 


He 


Tyr 


Tlir 


Ser 


Leu 


Asn 


Tyr 


Thr 


Tyr 


Tyr 


Arg 


Leu 


Arg 


Asn 


Trp 


625 










630 










635 










640 


Ser 


Tyr 


Asn 


Thr 


Phe 


Gin 


Asn 


Phe 


His 


His 


Gly 


Ser 


Ala 


Asn 


Asp 


Leu 










64 5 










65 0 










655 




Asn 


Leu 


Glu 


Leu 


Arg 


Leu 


Ser 


Arg 


Thr 


Ser 


He 


Asp 


Asn 


Pro 


He 


Tyr 








660 










665 










67 0 






Thr 


Arg 


Ser 


Gly 


Ser 


Asp 


Phe 


Met 


Val 


Ser 


Val 


Ala 


Ala 


Thr 


Leu 


Pro 






675 










680 










685 








Tyr 


Ser 


Leu 


Trp 


Asp 


Asn 


His 


Asp 


Tyr 


Al a 


Ser 


Gin 


Asn 


Leu 


Ser 


Val 




6 90 










695 










7 00 










Ser 


Asp 


Arg 


Tyr 


Arg 


Phe 


He 


Glu 


Tyr 


His 


Lys 


Trp 


Lys 


Phe 


Arg 


Gly 


705 










710 










715 










720 


Arg 


Val 


Phe 


Thr 


Fro 


Leu 


Leu 


Asn 


Pro 


Ala 


Thr 


His 


Ly^^ 


Tyr 


Thr 


Pro 










725 










730 










735 




Val 


Leu 


Met 


Ser 


Arg 


Val 


Glu 


Gly 


Ala 


Val 


Leu 


Gly 


Ser 


Tyr 


Asn 


Ser 








14 0 










745 










750 






Asn 


Lys 


Ly5 


Ser 


Pro 


Phe 


Gly 


Thr 


Phe 


Tyr 


Met 


Gly 


Gly Asp 


Gly 


Met 






755 










760 










765 








Ser 


Ser 


Tyr 


Tyr 


Gly 


Gly 


Tyr 


Met 


Asn 


Glu 


Thr 


He 


Gly 


Leu 


Arg 


Gly 




770 










775 










7 80 










Tyr 


Lys 


Asn 


Gly 


Ser 


He 


Ala 


Gly 


Asn 


Asn 


Tyr 


Asp 


Tyr 


Ala 


Tyr 


Ala 


785 










790 










7 95 










800 


Tyr 


Met 


Arg 


Leu 


Thr 


Met 


Glu 


Leu 


Arg 


Phe 


Pro 


He 


Leu 


Phe 


Glu 


Asn 










805 










810 










815 




Ser 


Phe 


Asn 


Ala 


Trp 


Leu 


Leu 


Ala 


Phe 


Ala 


Glu 


Ala 


Gly 


Asn 


Ala 


Trp 








820 










825 










830 






Arg 


Ser 


He 


Asp 


Asn 


Tyr 


Asn 


Pro 


Phe 


A.sn 


Leu 


Lys 


Arg 


Ser 


Ala 


Gly 






835 










840 










845 








Val 


Gly 


Leu 


Arg 


Val 


Thr 


Leu 


Pro 


Met 


Val 


Gly 


Me t 


Leu 


Gly 


He 


Asp 




850 










855 










860 










Trp 


Gly 


Tyr 


Gly 


Phe 


Asp 


Arg 


Pro 


Asp 


Asn 


Ser 


Leu 


Gin 


Arg 


Gly 


Gly 


865 










870 










875 










68 0 


Ser 


Asn 


Val 


His 


Phe 


Val 


Leu 


Gly 


Gin 


Glu. 


Phe 













10 



15 



20 



25 



30 



35 



40 



45 



50 

Ser Asn Val His 

885 890 

(2) lUrORHATIOH FOR SEQ ID NO: 443 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 174 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) noLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

65 (vi) ORIGItfAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

{A) MAI IE/ KEY: mi sc_ feature 
70 (B) LOCATION l..,17 4 

(Mi} SEQUENCE DESCRIPTION: SEQ ID NO: 44 3 

Met Asn Gly Asp Met Lys Arg Phe Leu He Leu He Gly Phe Ala Leu 
75 1 5 10 15 



BNSDOCID: <WO 9929870A1J_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



383/490 



10 



15 



20 



30 



35 



Ala 


Val 


Ala 


Phe 
20 


Ser 


Gly 


Phe 


Ser 


Gin 
25 


Lys 


Phe 


Ala 


Leu 


Val 
30 


Asp 


Met 


Glu 


Tyr 


li- 
as 


Leu 


Arg 


Asn 


He 


Pro 
40 


Asp 


Tyr 


Glu 


Met 


Met 
45 


Asn 


Glu 


Gin 


Leu 


Glu 

50 


Gin 


Val 


Ser 


Lys 


Lys 
55 


Trp 


Gin 


Asn 


Glu 


He 
60 


Glu 


Ala 


Leu 


Glu 


J\sn 


Glu 


Ala 


Gin 


Ser 


Met 


Tyr 


Lys 


Lys 


Tyr 


Gin 


Ser 


Asp 


Leu 


Val 


Phe 


65 










70 










75 










80 


Leu 


Ser 


Ala 


Ala 


Gin 


Lys 


Lys 


Thr 


Gin 


Glu 


Glu 


Ala 


He 


Val 


Lys 


Lys 










85 






90 










95 




Glu 


Gin 


Gin 


Ala 
100 


Ser 


Glu 


Leu 


Lys 


Arg 
105 




Tyr 


Phe 


Gly 


Pro 
110 


Glu 


Gly 


Glu 


Leu 


Tyr 


Lys 


Lys 


Arg 


Ser 


Asp 


Leu 


(let 


Lys 


Pro 


He 


Gin 


Asp 


Glu 






115 








120 










125 








lie 


Trp 
130 


Asn 


Ala 


He 


Lys 


Glu 
135 


He 


Ala 


Lys 


Arg 


Asn 
140 


Asn 


Tyr 


Gin 


Met 


Val 


Leu 


Asp 


Arg 


Gly 


Thr 


Ser 


Gly 


He 


He 


Phe 


Ala 


Ser 


Pro 


Ser 


He 


145 






150 










155 










160 


Asp 


lie 


Ser 


Asp 


Leu 
165 


Val 


Leu 


Ser 


Lys 


net 

170 


Gly 


Phe 


Ser 


Lys 







[2) II I FORI lAT I Oil FOR SEQ ID MO: 44 4 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 170 amino acici = 

(B) TYPE: amino acid 
( D ) TO POLOG Y : linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIllAL SOURCE: 

(A) ORGAllISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) MAI IE/ KEY: misc_f eature 
( B) LOCATIOti 1 ... 170 



40 (xi) SEQUEHCE DESCRIPTION: SEQ ID MO: 444 





Met 


Lys 


Arg 


Phe 


Leu 


He 


Leu 


He 


Gly 


Phe 


Ala 


Leu 


Ala 


Val 


Ala 


Phe 




1 




5 










10 










15 






Ser 


Gly 


Phe 


Ser 


Gin 


Lys 


Phe 


Ala 


Leu 


Val 


Asp 


Met 


Glu 


Tyr 


He 


Leu 


45 






20 










25 










30 








Arg 


Asn 


li- 


Pro 


Asp 


Tyr 


Glu 


Met 


Met 


As[i 


Glu 


Gin 


Leu 


Glu 


Gin 


Val 






as 










4 0 










45 










Ser 


Lys 
50 


Lys 


Trp 


Gin 


Asn 


Glu 
55 


He 


Glu 


Ala 


Leu 


Glu 

60 


Asn 


Glu 


Ala 


Gin 


50 


Ser 


Met 


Tyr 


Lys 


Lys 


Tyr 


Gin 


Ser 


Asp 


Leu 


Val 


Phe 


Leu 


Ser 


Ala 


Ala 




65 








70 










75 










80 




Gl n 


Lys 


Lys 


Thr 


Gin 


Glu 


Glu 


Ala 


He 


Val 


Lys 


Lys 


Glu 


Gin 


Gin 


Ala 










85 










90 










95 






Ser 


Glu 


Leu 


Lys 


Arg 


Lys 


Tyr 


Phe 


Gly 


Pro 


Glu 


Gly 


Glu 


Leu 


Tyr 


Lys 


55 








100 










105 










110 








Lys 


Arg 


Ser 
115 


Asp 


Leu 


Met 


Lys 


Pro 
120 


He 


Gin 


Asp 


Glu 


He 
125 


Trp 


Asn 


Ala 




He 


Lys 
130 


Glu 


He 


Ala 


Lys 


Arg 
135 


Asn 


Asn 


Tyr 


Gin 


Met 

140 


Va] 


Leu 


Asp 


Arg 


60 


Gly 


Thr 


Ser 


Gly 


He 


He 


Phe 


Ala 


Ser 


Pro 


Ser 


He 


Asp 


He 


Ser 


Asp 




145 








150 










155 










160 




Leu 


Val 


Leu 


Ser 


Lys 
165 


Met 


Gly 


Phe 


Ser 


Lys 
170 















65 (2) IIIFORMATION FOR SEQ ID NO: 445 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 
70 ( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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50 



55 



60 



65 



70 



75 



(vi) ORIGillAL SOURCE: 

(A) ORGAIIISM: Porphyromona s gingival is 

(ix) FEATURE: 

(A) MA] IE/ KEY: misc_feature 

(B) LOCATIon 1 , . . 163 

(xi) SEQUENCE DESCRIPTIOIJ : SEQ ID NO: 445 



10 



15 



20 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 44 6 

35 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 82 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) t40LECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGillAL SOURCE: 
45 (A) ORGAWISII: Porphyromonas gingiva lis 

(i::) FEATURE: 

(A) lIAI^iE/KEY: misc_feature 

(B) LOCATION 1 . . . 827 



net 


Lys 


Ly^ 


Phe 


Phe 


Leu 


Met 


Leu 


Leu 


Met 


Ala 


Leu 


Pro 


Leu 


Ser 


Leu 


1 








5 










10 










15 




Leu 


Ala 


Gin 


Lys 


Val 


Ala 


Val 


Val 


Asn 


Thr 


Glu 


Glu 


He 


He 


Ser 


Lys 








20 










25 










30 






f let 


Pro 


Gl u 


Gin 


Val 


Ala 


Ala 


Thr 


hys 


Gin 


Leu 


Asn 


Glu 


Leu 


Ala 


Glu 






35 










40 










45 








hys 


Tyr 


Arg 


Leu 


Asp 


Leu 


Lys 


Ser 


Met 


Asp 


Asp 


Glu 


Phe 


Ala 


Lys 


Lys 




50 










5 5 










60 








Thr 


Glu 


Glu 


Phe 


Val 


Lys 


Glu 


Lys 


Asp 


Ser 


Leu 


Leu 


Glu 


Asn 


He 


Arg 


65 










70 










75 










eo' 


Asn 


Arg 


Arg 


Gin 


Gin 


Glu 


Leu 


Gin 


hsp 


lie 


Gin 


Thr 


Arg 


Tyr 


Gin 


Gin 










85 










90 










95 




Sei- 


Tyr 


Gin 


Thr 


net 


Gin 


Glu 


A^p 


Leu 


Gin 


Lys 


Arg 


Gin 


Gin 


Gin 


Leu 








100 










105 










110 






Phe 


Ala 


Fro 


He 


Gl n 


Gin 


Ly.<r 


Va 1 


Ala 


Asp 


Ala 


He 


Lys 


Lys 


Val 


Gly 






115 










1 2 0 










125 






J-iSp 


Glu 


Glu 


A-n 


r- -c- 
- 7 *- 


Ala 


Tyr 


He 


Met 


Glu 


Ala 


Gl y 


1 let 


Met 


Leu 


Tyr 




130 










135 










14 0 










Thr 


Gly 


Ala 


Thr 


Ala 


He 


A^p 


Leu 


Thr 


Ala 


Lys 


Val 


Lys 


Ala 


Lys 


Leu 


145 










150 










155 








160 


Gly 


He 


Ly- 





























(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 6 



Met 


Lys 


Glu 


Ala 


He 


Pro 


Arg 


Lys 


Asn 


Lys 


Tyr 


He 


Lys 


Leu 


Asn 


Gly 


1 








5 










10 










15 


He 


Tyr 


Arg 


Leu 

20 


Ser 


Phe 


He 


Leu 


Leu 
25 


Cys 


Cys 


Leu 


Leu 


Cys 
30 


Ser 


Gin 


Ala 


Ala 


Met 


Ala 


Gin 


Gly 


Val 


Arg 


Val 


Ser 


Gly 


Tyr 


Val 


Leu 


Asp 


Arg 






35 










40 










45 






Gly 


Glu 


Lys 


Pro 


He 


Pro 


Phe 


Ala 


Gly 


Val 


Lys 


Val 


Arg 


Gly 


Thr 


Gly 




50 










55 










60 








Thr 


Gly 


Ala 


Thr 


Thr 


Asn 


Leu 


Lys 


Gly 


Tyr 


Tyr 


Glu 


Phe 


Arg 


Met 


Lys 


65 










70 








75 








8 0 


Ala 


Thr 


Thr 


Asp 


Ser 


He 


Thr 


He 


Glu 


Phe 


Ser 


Ser 


I !et 


Gly 


Tyr 


Gin 










85 










90 








95 




Gly 


Val 


Ser 


Arg 
100 


Ser 


Phe 


Pro 


Ser 


Leu 
105 


Thr 


Lys 


Asp 


Thr 


Arg 
110 


Leu 


Asn 


Val 


Arg 


Leu 
115 


Ala 


Glu 


Ala 


Glu 


1 let 

120 


Glu 


Leu 


Ser 


Ser 


Val 
125 


Thr 


Val 


Gin 


Ala 


Thr 


Lys 


Arg 


Arg 


Leu 


Asn 


Thr 


Met 


Gl u 


Arg 


Val 


Asn 


Thr 


Arg 


Asp 




130 










135 










1^0 








Leu 


Arg 


Val 


Asn 


Ala 


Gly 


Pro 


Thr 


Gly 


Gly 


Val 


Glu 


Ser 


Leu 


He 


Ser 


145 










150 










155 










160 


Thr 


Tyr 


Ala 


Gly 


Val 
165 


Thr 


Gin 


Asn 


Asn 


Glu 
170 


Leu 


Ser 


Ser 


Gin 


Tyr 
175 


Ser 


Val 


Arg 


Gly 


Gly 


Ser 


Tyr 


Asp 


Glu 


Asn 


Met 


Val 


Tyr 


Val 


Asn 


Gly 


Val 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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100 185 190 

Glu Val Tyr Arg Pro Leu Leu V£kl Arg Ser Ala Glti Gin Glu Gly Leu 

195 200 205 

Ser Phfe Vai Asn Pro Asp Leu Thr Gin Ser Val Gin Phe Ser Ala Gly 
5 210 215 220 

Gly Phe Thr Ala Asp Tyr Gly Asp Lys Het Ser Ser Val Leu Asp lie 
225 230 235 240 

Ara Tyr Lys Gin Pro Gin Glu Lys Glu Gly Ala Vai Leu Leu Gly Met 
2^5 ' 250 255 

10 Leu Gin Ser Ser Ala Tyr Tyr Gly Ser Ser Ala Gly Ala Phe Ser Gin 
2 60 2 65 27 0 

lie Thr Gly Val Arg Tyr Lys Ser Ala Lys Ser Leu Leu Gly Thr Thr 

275 280 285 

Asp Thr Lys Ala Glu Tyr Asp Pro lie Tyr Ala Asp Gly Gin Thr Phe 
15 290 295 300 

Met Thr Tyr Ara Phe Ser Pro Lys Leu Ser Val Ser Phe Leu Gly Asn 
305 ' 310 315 320 

He Ser Gin Thr Arg Tyr Lys Fhe Val Pro Gin Thr Arg Glu Thr Ser 
325 330 335 

20 Phe Gly Thr Leu Ser Asp Ala Lys Lys Leu Lys He Phe Phe Asp Gly 
3.;0 3^5 350 

Gin Glu Gin Asp Arg Phe Leu Thr Tyr Phe Gly Ala Phe Ser Met Asn 

355 360 365 

Phie Val Pro Asp Asp Lys Gin Arg His Thr Val Thr Leu Ser Ala Phie 
25 370 375 380 

Asn Ser Asn Glu Arg Glu Thr Tyr Asp He Gin Gly Glu Tyr Phe Leu 
385 390 3 95 4 00 

Asn Asp Val Gin Leu Gly Ala Asp Gly Thr Ala Ser Met Ala Ser Gly 
40b 410 415 

30 Ser Glu Asn Ser Asn Gly Leu Gly He Gly Arg Asn His Glu His Ala 
420 425 430 

Arg Asn Arg Leu Ser Tyr Arg Val Leu Asn I let Gly Tyr Arg Gly Glu 

4 35 440 4 45 

Met Lys Leu Asn Glu Lys His Arg Leu Gin Ala Gly Val Ser Ala Gin 
35 450 455 460 

Met Glu Lys He Ala Asp His He Ser Glu Trp Glu Arg Arg Asp Ser 
465 470 475 480 

Val Gly Tyr Asn Leu Pro His Ser Glu Thr Val Leu Leu Met Tyr Asn 
485 490 495 

40 Asn Leu Tyr Ala Asp Thr Gin Met Arg Gly Thr Arg Leu Ser Ala Phe 
500 505 510 

Val Gin Asp Arg Phe Asn Phe Ser I let Gly Gly Gly Thr Phe Ser Leu 

515 520 525 

He Pro Gly He Arg Ala Ser Trp Trp Ser Phe Asn Lys Glu Leu Leu 
45 530 " 535 540 

Val Ser Pro Arc He Ser Val Gly Tyr Ser Pro Glu Ser Asn Pro Ala 
545 " 550 555 560 

Leu Val Leu Arg Ala Ala Ala Giy Leu Tyr Tyr Gin Ala Pro Phe Tyr 
5 65 57 0 57 5 

50 Lys Glu Leu Ara Gin Thr His Lys Asp Ala Glu Gly Asn Asn Val Val 
580 585 590 

Val Leu Asn Glu Lys He Arg Ser Gin Gly Ala Phe His He Leu Ala 

595 600 605 

Gly Ala Asp Tyr Thr Phe Glu Het Gly Gly Arg Lys Tyr Lys Fhe Thr 
55 610 615 620 

Ala Glu Ala Tyr Tyr Lys Ser Leu Phe Asn He Asn Pro Tyr He He 
625 630 635 640 

Glu Asn Val Lys He Arg Tyr Leu Gly Glu Asn He Gly Ser Gly Tyr 
64 5 650 65 5 

60 Ala Ala Gly He Asp Leu Lys Leu Phe Gly Glu Leu Val Pro Gly Val 
660 665 670 

Asp Ser Trp Leu Thr Ala Ser He He Lys Ala Arg Gin Lys Leu Asp 

675 6e0 685 

Gly Tyr Gly Ser Leu Pro Leu Met Asn Ala Pro Thr Tyr Asn Phe Ser 
65 690 695 700 

Phe Phe Leu Gin Glu Tyr Val Pro Gly Asn Lys Arg He Thr Ala Thr 
7 05 710 715 7 20 

Leu Arg Ala Ala Leu Ser Gly Gly Leu Pro Gin Leu Asn Pro Ser Lys 
7 25 7 30 7 35 

70 Gly Leu Ser Ser Pro Ala Phe Thr Ala Pro Ala Tyr Lys Arg Val Asp 
740 745 750 

Leu Gly Val Met Tyr Lys Trp Leu Asp Pro Asp Asp Ser Phe Ala Gly 

755 760 765 

Arg Ser Lys Trp Leu Met Gly Val Ly^ Gly Ala Tyr He Gly Ala Asp 
75 77 0 775 780 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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Leu Phe Asn Leu ['he Acp Met Thr Asn Val Asn Ser Tyr Tyr Trp Val 
785 790 795 800 

Ser Asp Ala Tyr Gin Gin Gin Tyr Ala Val Pro A^n Tyr Leu Thr Arg 

805 810 815 

5 Arg Gin Phe Asn Leu Arg Leu Leu Val Glu Phe 
820 825 

(2) llIPORlIATIOtI FOR SEQ ID 1^0:4 47 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 67 2 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 

(iii) HVPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 
20 (A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

( A ) 1 JAI IE/ KEY : mi s c_f ea t u r e 

(B) LOCATION 1 ... 672 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(::i) SEQUENCE DESCRIPTION: SEQ ID MO: 4^7 



I let 


Tyr 


Ser 


Gly 


Hi s 


His 


Lys 


He 


His 


Tyr 


Pro 


Phe 


Leu 


He 


Leu 


Leu 


.1 


















1 C* 










1 5 




\' D ] 


C , - 


Leu 


n 1 -1 
Al a 


Phe 


Al a 


Al a 


Cys 


Lj^s 


Ser 


Va 1 


Lys 


Leu 


Lys 


Asp 


Al a 








20 










25 










30 






Glu 


Lyr 


Al d 


His 


Asp 


Arg 


Gl n 


Gl u 


Tyr 


Thr 


Lys 


Al a 


Al a 


Asp 


Met 


Tyr 






"J c 
Ji ^ 










4 0 










4 5 








Afc" n 


Thr 


Leu 


Tyr 


Al g 


Arg 


Thr 


Arg 


Arg 


Lys 


Gin 


Val 


Glu 


Me t 


Lys 


Al a 




R n 
o u 




















6 0 










i 1 


Th r 


Ala 


Phe 


Arg 


S'^^r 


Liiy 


Lii u 


Asn 


Tyr 


Arg 


Mia 


Mr a 


Gly 


Arg 


Gin 


65 










70 










75 










80 


Ala 


Lys 


Ala 


Leu 


Arg 


Gly 


Tyr 


Leu 


Asn 


Ala 


Arg 


Arg 


Tyr 


Gly 


Tyr 


Pro 










85 










90 










95 




Asp 


Ser 


Val 


Val 


Leu 


Leu 


Arg 


Leu 


Ala 


Gin 


Thr 


Tyr 


Gin 


Gin 


Gly 


Gly 








100 










105 










110 






Asn 


Tyr 


Lys 


Glu 


Ala 


Glu 


Val 


Leu 


Phe 


Arg 


Gly 


Tyr 


Leu 


Glu 


Ala 


Tyr 






115 










120 










125 








Pro 


Lys 


Ser 


Tyr 


Phe 


Ala 


Ala 


He 


Gly 


Leu 


Glu 


Gly 


Cys 


Leu 


Phe 


Ala 




130 










135 










140 










Arg 


Gin 


Gin 


Lys 


Glu 


Tyr 


Pro 


Thr 


Arg 


Tyr 


Arg 


He 


Arg 


Arg 


Ala 


Ala 


145 










150 










155 










160 


Glu 


Trp 


Asn 


Ser 


Ala 


Arg 


Gl y 


Asp 


Phe 


Gly 


Pro 


Ala 


Tyr 


Ala 


Pro 


Asp 










165 










17 0 










175 




Ala 


Ser 


Ala 


Leu 


Tyr 


Phe 


Thr 


Ser 


Ser 


Arg 


Ser 


Lys 


Asp 


Asp 


Gly 


Leu 








180 










185 










190 






Asp 


Asn 


Ser 


Ser 


He 


Thr 


Gly 


Leu 


Lys 


Pro 


Asn 


Asp 


He 


Tyr 


He 


He 






195 










200 










205 










Arg 


Asp 


Ala 


Gin 


Gly 


Arg 


Trp 


Gly 


Arg 


Pro 


Asp 


Ser 


Val 


Ser 


Gly 




210 










215 










220 










Gly 


lie 


Asn 


Thr 


Pro 


Trp 


Asp 


Glu 


Gly 


Val 


Pro 


Thr 


He 


Thr 


Pro 


Asp 


225 










230 










235 










240 


Gly 


Ser 


Thr 


lie 


Tyr 


Tyr 


Thr 


Leu 


Ala 


Gin 


Gin 


Gly 


Ala 


Asp 


Tyr 


Asp 










245 










2 50 










255 




Arg 


Thr 


Val 


Gin 


He 


Tyr 


Ser 


Ala 


Ala 


Arg 


Ser 


Gly 


Glu 


Gly 


Gly 


Trp 








260 










265 










270 






Ser 


Asn 


Gly 


Ser 


Leu 


Val 


Asp 


He 


Met 


Arg 


Asp 


Ser 


Leu 


Arg 


Met 


Ala 






275 










280 










285 








Ala 


His 


Pro 


Ser 


Met 


Ser 


Ala 


Ser 


Gly 


Asp 


Tyr 


Leu 


Tyr 


Phe 


Val 


Ser 




290 










295 










300 










Asn 


He 


Gly 


Gly 


Ser 


Tyr 


Gly 


Gly 


Lys 


Asp 


He 


Tyr 


Arg 


Val 


Lys 


Val 


305 










310 










315 










32 0 


Ser 


Asp 


Arg 


Ser 


Tyr 


Gly 


Ser 


Pro 


Glu 


Asn 


Leu 


Gly 


Pro 


Asp 


He 


Asn 










325 










330 










335 




Tlir 


Pro 


Gly 


Asp 


Glu 


Met 


Phe 


Pro 


Phe 


He 


Asp 


Gly 


Asp 


Ser 


Thr 


Leu 








340 










345 








350 






Phe 


Phe 


Ala 


Ser 


Asp 


Gly 


His 


Ala 


Gly 


Leu 


Gly 


Gly 


Leu 


Asp 


He 


Phe 






355 










360 










365 








Lys 


Ala 


Thr 


Leu 


Asp 


Ser 


Thr 


Gly 


Gin 


Trp 


His 


Val 


Val 


Asn 


Met 


Gly 




37 0 










37 5 








380 
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Gin 


Pro 


Val 


Asn 


Ser 


Ser 


Ala 


Asp 


Asp 


Phe 


Gly 


Leu 


Ala 


Val 


Glu 


Pro 




385 










390 










395 










4 00 




Lys 


Gly 


Lys 


Asn 


Lys 


Glu 


Giu 


Ala 


Leu 


Pro 


Asp 


Asn 


Gly 


Val 


Lys 


Gly 








*105 










410 










415 




5 


Val 


Phe 


Cys 


Ser 


Asn 


Arg 


Gly Asp 


Ala 


Arg 


Gly 


Trp 


Pro 


His 


Leu 


Phe 










420 










425 










430 








His 


Phe 


Glu 
435 


Leu 


Pro 


Ala 


He 


Tyr 
440 


Thr 


Glu 


He 


Gin 


Gly 
445 


Tyr 


Val 


Met 




Asp 


Arg 


Glu 


Glu 
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(2) IIIFORI lATIOH FOR SEQ ID MO: 448 

40 (i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 7 08 amino acids 

( B) TYPE: amino acid 
( D) TOPOLOGY : linear 

45 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORlGItlAL SOURCE: 
50 (A) ORG A! II SM: Porphyromonas gingival is 

(iM) FEATURE: 

(A) HAME/KEY: niisc_f eature 

(B) LOCATIOIl 1 . . .7 08 



55 



60 



65 



70 
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(xi) SEQUENCE DESCRIPTIOU: SEQ ID lIO:448 
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(2) lllFORMATIOM FOR SEQ ID 110:449 
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{j) SEQUEIICE CHARACTERISTICS: 

(A) LEDGTH : A 6Z amino acid^ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

10 (vi) ORIGINAL SOURCE: 

(A) ORGADISIl: Porphyromonas gingivali^ 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 
15 (B) LOCATIOII 1. . .462 

{::i) SEQUEIICE DESCRIFTIOM: SEQ ID MO: 4^9 

Met Asn Arci Phe Ser hsn His Trp Pro Cy^ lie Leu Val Gly Phe Val 
20 1 5 10 15 

Leu Trp Phe Val Ser Ala Ser Arg Thr Val Ala Gin ?\sn Ala Ser Glu 

20 ^5 30 

Thr Thr Val Ser Tyr Asp Thr Asp Thr Ala Val Leu Ser Qlu Ala Asp 
35 40 45 

25 Val Leu Arg He Ala Leu Ser Glu Asn Ala Thr Val Lys Val Ala Asp 
SO ' SS 60 

Met Asp Val Arg Lys Gin Glu Tyr Ala Arg Arg Ala A.la Arg Ala Asp 
65 "^O 75 80 

Leu Fhe Pro Lys Val Asp Leu Asn Gly Val Tyr Ser His Thr Leu Lys 
30 85 90 95 

Lys Gin Val Leu Tyr He Asp Met Pro Gly Phe Ser Ser Ser Glu Gly 

100 105 110 

He Glu Met Gly Arg Thr His Asn Thr Gin Gly Gly Val Asn Val Ser 
115 120 125 

35 Met Pro Leu Val Ser Ala Gin Leu Trp Lys Ser He Ala Met Thr uly 
130 135 140 

Glu Gin Leu Asp Leu Ala Leu Glu Lys Ala Arg Ser Ser Arg He Asp 
145 150 155 160 

Leu Val Ala Glu Val Lys Lys Ala Tyr Leu Ser Val Leu Leu Ala Glu 
40 165 170 175 

Asp Ser Tyr Gly Val Phe Lys Arg Ser Tyr Asp Asn Ala Leu Ala Asn 

180 185 190 

Tyr Lys Asn He Ser Asp Lys Phe Asp Arg Gly Leu Val Ala Glu Tyr 
195 200 205 

45 Asp Lys He Arg Ala Asn Val Gin Val Arg Asn He Glu Pro Asn Leu 
210 215 220 

Leu Gin Ala Gin Asn Ser Val Ala Leu Ala Leu Trp Gin Leu Lys Val 
225 230 235 2\0 

Leu i-iet Ser Met Glu Val Glu Thr Pro He Arg Leu Ser Gly Ser Leu 
50 245 250 255 

Ser Asp Tyr Lys Glu Gin Val Tyr Thr Gly Tyr Phe Ala Ala Asp Thr 

260 265 270 

Leu He Ser Asn Asn Ser Ser Leu Arg Gin Leu Asp He Gin Arg Arg 
275 280 285 

55 Leu Ala Val Ser Ala Asp Lys Leu Asn Lys Tyr Ser Phe Leu Pro Thr 
290 295 300 

Leu Asn Leu Gly Gly Gin Tyr Thr Tyr Ser Leu Asn Ser Asn Asp He 
305 310 315 320 

Lys Pile Trp Gly Glu Gly Gin Arg Trp Thr Pro Phe Ser Thr He Ser 
60 325 330 335 

Leu Ser Leu Tyr He Pro He Phe Asn Gly Gly Lys Arg Leu Tyr Asn 

340 345 350 

Val Lys Gin Ser Ala Leu Ser He Arg Gin He Asp Leu Gin Arg Arg 
355 360 365 

65 His He Glu Gin Ser He Arg Net Gly He Lys Asn Gin Asn Asp Arg 
370 375 390 

Leu Arg Thr Cys Met Gin Arg Phe Val Ala Ser Glu Glu Ala Val Arg 
385 " 390 395 400 

Ser Ala Glu Lys Gly Tyr Gin He Ala Glu Lys Arg Tyr Gin Thr Gly 
70 4 05 410 415 

Glu Gly Thr Leu Val Glu Leu Asn Asp Ala Asp Val Ala Leu Leu Gin 

420 425 430 

Ala Arg Leu Asn Tyr Asn Gin Ala He Phe Asp Phe Met Thr Ala Lys 
435 440 445 

75 Ala Glu Leu Asp Lys Met Asn Gly Met Gly He Pro Glu Gin 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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450 455 4 60 

(2) IIIFORIIATIOM FOR SEQ ID MO: 4 50 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LEtlGTH: 4 92 amino acids 

(B) TYPE: amino acid 
(D) TOFOLOGV: linear 

10 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 
15 {A) ORGAMISM: Porphyromonas gingivali^ 

(ix) FEATURE: 

{A) MTMIE/KEY: misc_feature 
(B) LOCATION 1 . . .492 
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30 
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70 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 50 
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30 



35 



40 



45 



50 



55 



60 



65 
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420 425 430 

Leu Phe A?p His Val Ser Gly Arg Trp Ser Ala Pro Val Val Val Asn 

435 4A0 445 

Ser Gly Ser Ser Ala Tyr Ser Asp Met Thr Leu Leu Ala Asp Gly Thr 

4 50 4 55 4 60 

He Gly Tyr Phe Val Glu Giu Gly Asp Glu He Ser Leu Val Phe He 
465 470 475 480 

Ara Phe V^ll Leu Asp Asp Leu Phe Asp Val Arg Gin 
485 490 

(2) IMFORlIATIOl! FOR SEQ ID MO: 451 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 24 5 amino acids 
15 (B) TYPE: amino acid 

{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAMISII: Porphyromonas gingivalis 

25 (ix) FEATURE: 

(A) IJAI IE/KEY: misc_f eature 
(E) LOCATION 1 . . .24 5 



(::i) SEQUENCE DESCRIPTION: SEQ ID NO: 4 51 



Met 


Lys 


Lys 


Gl u 


Lys 


Leu 


Trp 


He 


Ala 


He 


Val 


Ala 


Gly Leu 


Ala 


Phe 


1 








5 










10 










15 




Val 


Leu 


Gly 


Leu 


Tyr 


Ala 


Leu 


Gly 


Arg 


Ser 


Val 


Ala 


Gin 


Leu 


Arg 


Arg 








2 0 










25 










30 






Ser 


Gin 


Pro 


Ser 


Val 


Thr 


Val 


Thr 


Gly 


net 


Ala 


Gl u 


Arg 


Asn 


Phe 


Lys 






35 










40 










45 








Ser 


Asp 


Leu 


He 


Val 


Trp 


Thr 


Ala 


Ser 


Tyr 


Gin 


Leu 


Gin 


Met 


Met 


Asp 




50 










55 










60 










Leu 


Glu 


Ser 


Ala 


Tyr 


Lys 


Ala 


Leu 


Lys 


Glu 


Lys 


Gin 


He 


Leu 


Val 


Ala 


65 










70 










75 










80 


Asp 


Tyr 


Leu 


Lys 


Asn 


Lys 


Gin 


Leu 


Pro 


Asp 


Ser 


Ser 


Tyr 


He 


Phe 


Ser 










85 










90 










95 




Se r 


Val 


Ala 


He 


Ser 


Lys 


Glu 


Tyr 


Asn 


Tyr 


Tyr 


Tyr 


Asp 


Pro 


Arg 


Gin 








100 










105 










110 






Glu 


Gin 


Asn 


Val 


Arg 


Thr 


Phe 


Ala 


Gly 


Tyr 


Leu 


Leu 


Ser 


Gin 


Thr 


Val 






115 










120 










125 








Thr 


Val 


Thr 


Se r 


Gin 


Asp 


He 


Glu 


His 


Val 


Glu 


Lys 


He 


Ser 


Arg 


Asp 




130 










135 










140 










He 


Thr 


Glu 


Leu 


He 


Asn 


Gin 


Gly 


Val 


Glu 


He 


Thr 


Ser 


Asp 


Arg 


Pro 


145 










150 










155 










160 


Ala 


Tyr 


Tyr 


Tyr 


Thr 


Lys 


Leu 


Asn 


Asp 


Leu 


Lys 


Val 


Glu 


Met 


Leu 


Arg 










165 










170 










175 




Asn 


Ala 


Ser 


Glu 


Asp 


Ala 


Phe 


Asn 


Arg 


Ala 


Ser 


Val 


He 


Ala 


Glu 


Gly 








180 










185 










1 90 






Ser 


Gly 


Ser 


Ser 


Val 


Gly 


Lys 


llet 


Leu 


Ser 


Ser 


Ser 


Met 


Gl y 


Val 


Phe 






195 










200 










205 








Gin 


He 


Val 


Gly 


Leu 


Asn 


Ser 


Asn 


Glu 


Asp 


Tyr 


Ser 


Trp 


Gly 


Gly 


Ser 




210 










215 










22 0 










Phe 


Asn 


Thr 


Ser 


Ser 


Lys 


Met 


Lys 


Thr 


Ala 


Ser 


He 


Thr 


Vil 


Lys 


Ala 


22 5 










230 










235 










24 0 


Ser 


Pho 


Ala 


Leu 


Lys 
































245 
























(2) 


IN FORI lATION 


FOR 


SEQ 


ID NO: 452 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 
(D) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
75 (vi) ORIGINAL SOURCE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929a70A1_l_> 



wo 99/29870 



PCT/AU98/01023 



392/490 



(A) ORGAIIISM: Po rphyromonas gingiva li^ 

(ix) FEATURE: 

(A) MAI IE/ KEY: mi ^c_f eat ure 
5 (B) LOCATIOJl 1 ... 276 

SEQUENCE DESCRIPTION: 3EQ ID NO: 452 

I let Lys Lys Thr He Ala He He Ala Ser Ala Leu Leu Ala Leu Gly 
10 1 5 10 15 

Ala Val Gly Cys Lys Lys Asn Ala Asp Thr Thr Ala Val Ser Glu Lys 

2 0 2 5 30 

As-p Ser He Ala Leu Ser Met Gly He Leu Tyr Gly Gin Asp Phe Ala 
35 40 45 

15 Asn Gin Phe Glu I let Ser Arg Leu Gin Gly Gin Pro He Asp Ser Val 
50 55 60 

Ala Phe Leu Asp Gly Phe Lys Tyr Gly He Asp Thr Thr Arg Phe Ser 
65 7 0 7 5 80 

Tyr Asn Leu Gly Ala He Tyr Ala Ser Asn He Ala Arg Gin Leu Ala 
20 85 90 95 

Mis Asp Ser He Asp He Asp Lys Phe Tyr Ala Ala Met Arg Ala Ala 

100 105 110 

Leu Leu Lys Asp Thr Val Ser He Ala I let Lys Pro Ala Asp Ala Gin 
115 120 125 

25 Ala Phe I let Gin Arg lie Gin Ala Lys Lys Gin Arg Glu Asn Asn Met 
130 135 140 

Lys Gin Phe Gly Gin Asn He Glu Lys Gly Asn Glu Tyr He Asp Thr 
IAS 150 155 160 

Phe Lys Lys Glu Asp Gly Val Thr Val Thr Thr Thr Gly Leu Ala Tyr 
30 165 17u 175 

Lys Thr Leu Gin Glu Gly Thr Gly Ala Thr Pro Ser Leu Ala Asp Thr 

180 185 190 

Val Arg Val Lys Tyr Val Gly Thr Leu Val Asp Gly Lys Glu Phe Asp 
195 200 205 

35 Lys Asn Glu Glu Gly He Glu Phe Ala Val Thr Gly Val He Lys Gly 
210 215 22 0 

Trp Thr Glu Met Leu Gin Leu Met Lys Val Gly Gin Lys Val Arg Val 
225 230 235 240 

Val He Pro Gin Glu Leu Ala Tyr Gly Glu Thr Gly Asn Tyr Thr He 
40 245 250 255 

Glu Pro Phe Ser Thr Leu Thr Phe Glu Met Glu Leu He Gly He Lys 

260 265 270 

Pro Gly Lys Lys 
275 



45 



65 



70 



(2) INFORMATION FOR SEQ ID NO: 453 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 amino acids 
50 (E) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

55 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

60 (ix) FEATURE: 

(A) NAllE/KEY: mi sc_^ f eat u re 

(B) LOCATION 1 . . .77 5 



(::H SEQUENCE DESCRIPTION: SEQ ID 110:453 



Met 


Lys 


Val 


Leu 


Arg 


Gin 


Val 


Phe 


Leu 


Pro 


He 


Leu 


Phe 


Val 


Leu 


Leu 










5 










10 










15 




Thr 




Ala 


Cys 
20 


Ser 


Thr 


Thr 


Lys 


Asn 
25 


Leu 


Pro 


Glu 


Gly 


Glu 
30 


Gin 


Leu 


Tyr 


H e 


Gly 

35 


Met 


Gly 


Lys 


Thr 


Gin 
40 


He 


Leu 


Arg 


Gin 


Asp 
4 5 


Lys 


Ser 


His 


Ala 


Gly 


Gin 


Gin 


Ala 


Leu 


Thr 


Glu 


Val 


Glu 


Ser 


Thr 


Leu 


Lys 


Val 


Thr 




50 










55 










60 








Pro 


Asn 


Gly 


Ala 


He 


Phe 


Gly 


Ser 


Ala 


Ser 


Ala 


Ser 


Leu 


Pro 


Lys 


He 



75 65 70 



75 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID; <WO 9929670A1 J_> 



• 




wo 99/29870 



PCT/AU98/01023 



393/ 490 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



Pro Phe Gly Leu Trp Leu Tyr Asn Ser Phe Val Gly Asp Ser Thr Val 

85 90 ?5 

He Ser Lyc Trp He Fhe Asp Lys Phe Ala Ala Lys Pro Val Phe He 

100 105 110 

Ser Gin Val Ly^ Ser Asp Ser Arg Ala Lys Val Ala Thr Asn He Leu 

lis 120 125 

Arg Glu His Gly Tyr Phe Asp Ala Lys Val Lys Ser Ser Val Thr Thr 

130 135 140 

Leu Lys Lys Asp Ser Leu Lys Ala Lys He Ser Tyr Thr Val Asp Met 
1.15 150 155 160 

Ala Ser Pro Tyr His Tyr Asp Ser He He Pro Leu Pro He Ser Thr 

165 170 1*75 

Phe Pro Asp Ser He Leu Ala Tyr Arg Gin Thr Pro Ser Leu He Arg 

180 185 190 

Lys Gly Asp Gin Phe Asn Leu Ala Lys Leu His Giu Glu Arg Gin Thr 

195 200 205 

He Ser Ala Leu Leu Arg Asp Asn Gly Tyr Tyr Tyr Phe Arg Pro Gin 

210 215 220 

Asp He He Tyr Glu Ala Asp Thr Leu Leu Val Arg Gly Ala Val Cys 
225 230 235 240 

Leu Ara Ala Lys Leu Ser Glu Asp Thr Pro Pro Gin Ala llet Arg Pro 

245 250 255 

Trp Arg He Gly Lys Arg Thr Ala Val Leu Leu Gly Met Asn Gly Glu 

260 265 270 

Ser Pro Thr J\sp Ser Leu Glu Val Glu Asp ! let Lys Val Leu Tyr Tyr 

275 280 285 

Arg Lys llet Pro Val Arg Pro Lys He Leu Ala Lys Arg Phe Arg Fhe 

290 295 300 

Phe Ser Gly Asn Leu Tyr Arg Gin Lys Asp Asp Glu Thr Thr Arg Lys 
305 310 315 320 

Ser Leu Ala Arg Leu Gly Ala Phe Ser Val lie Asp Leu Asn Phe Leu 

325 330 335 

Gin Arg Asp Ser He Ser Gly Leu Leu Asp Val Arg Leu Leu Thr Thr 

340 345 350 

Leu Asp Lys Pro Trp Asp Ala Ser Leu Glu Thr Leu Phe Thr Ser Lys 

355 360 365 

Ser Asn Asp Fhe He Gly Pro Gly Leu Asn Phe Ala Leu Ala Arg Arg 

370 375 380 

Asn Val Phe Gly Gly Gly Glu Asn Leu Ser Trp Asn He Gly Gly Ser 
385 390 395 4 00 

Tyr Glu Trp Glu Thr Gly Asn Arg Pro Glu Asn Ser Ser Asn Arg Leu 

^05 410 415 

He Asp He Asn Ser Tyr Asn Met Asn Thr Ala Val Asn Leu Ser Phe 

420 425 430 

Pro Ser He Val Phe Pro Gly Leu Leu Asp Lys Tyr Tyr Tyr Tyr Pro 

4 35 44 0 4 45 

Thr Thr Thr Thr Phe Gin Ala Ser Ala Thr Ala Leu Asn Arg Ala His 

4 50 4 55 4 60 

Tyr Phe Ser Het Tyr Ser Phe Gly Phe Ser Thr Thr Tyr Glu Phe Gin 
4 65 47 0 4 75 4 SO 

Pro Ser Lys Giu His Arg His Ala He Phe Pro Leu Lys Leu Asn Tyr 

485 490 495 

Asn Leu Leu Gly His Gin Thr Glu Thr Phe Gin Ala He Thr Ala Asn 

500 505 510 

Asn Pro Pro Leu Leu Leu Ser Leu Gin Ser Gin Phe Leu Ala Gin Met 

515 520 525 

Gly Tyr He Tyr Thr Phe Asn Lys Ser Val Ser Glu Lys Ser Pro His 

530 535 540 

His Leu Trp Met Gin Phe Gly Leu Ser Glu Ala Gly Asn Leu Leu Asn 
545 550 555 560 

Leu He Tyr Leu Ala Ala Gly Lys Lys Tyr Ser Asp Thr Lys Asn Phe 

565 570 575 

Val Gly Val Pro Phe Ser Gin Phe He Lys Ala Thr Gly Glu Leu Arg 

580 585 590 

Tyr Ser Tyr Thr He Asp Arg Asn Gin Ser Leu Ala Thr Arg Phe Gly 

595 600 605 

Thr Gly Val He Tyr Ser Tyr Gly Asn Met Arg Val Ala Pro Tyr Ser 

610 615 62 0 

Glu Gin Phe Tvr Val Gly Gly Ala Asn Ser He Arg Ala Phie Thr Val 
625 ' 630 635 64 0 

Arg 3er He Gly Pro Gly Ary Phe Asn Pro Asp Ser Asp Asn Gin Tyr 

645 650 655 

Ser Tyr Leu Asp Gin Val Gly Glu Fhe Lys Leu Glu Ala Asn Val Giu 

660 665 6"0 

Tyr Arg Gly Lys Leu Phe Gly Asp Leu His Ala Ala Val Phe Leu Asp 
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WO 99/29870 PCT/AU98/01 023 

394/ 490 







675 










680 










685 








Ala 


Gly 

690 


Asn 


Va 1 


Trp 


Leu 


Leu 
695 


Arg 


Glu 


Asp 


Ser 


Ser 
7 00 


Arg 


Pro 


Gly 


Gly 


Ala 


Leu 


Ser 


Glu 


Val 


Gly 


Ser 


Val 


Ser 


Asn 


Phs 


Leu 


hsn 


Ser 


lie 


Ala 


705 










710 










715 










120 


Leu 


Gly 


Thr 


Gly 


Val 

725 


Gly 


Leu 


Arg 


Tyr 


A^p 
730 


Leu 


Ala 


Phe 


Leu 


Val 
735 


Val 


Arg 


V3l 


A^p 


Val 
7^0 


Gly 


Phe 


Gly 


Leu 


Hi^ 
7^5 


Leu 


Pro 


Tyr 


Asn 


Thr 
7 50 


Gly 


Ly^ 


Lys 


Gly 


Tyr 
755 


Tyr 


Asn 


He 


Pro 


Arg 
760 


Phe 


Lys 


A^p 


Ala 


He 
765 


Gly 


Phe 


His 


Leu 


Ala 

770 


Val 


Gly 


Tyr 


Pro 


Phs 
775 





















10 



15 (2} IIJFORMATIOM FOR SEQ ID MO: 4 54 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LEMGTH: 774 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORTGIIJAL SOURCE: 

(A) ORGAllISIl: Po rphyromonas gingiva Hs 



(i::) FEATURE: 
30 (A) HAJIE/KEY: misc_feature 

(E) LOCATION 1 ... 774 



(;-:a) SEQUEIJCE DESCRIPTIOM: SEQ ID HO: 454 



Met 


Ser 


Ser 


His 


Ser 


Val 


Arg 


Tyr 


Leu 


He 


Gly 


He 


Ala 


G I y 


Cys 


Leu 


1 








5 










10 










15 




Leu 


Leu 


Met 


Leu 


Ala 


Ser 


Ser 


Cys 


Ser 


Val 


Thr 


Arg 


Tyr 


Val 


Pro 


Asp 








20 










25 
















Gly 


Ser 


Arg 


Leu 


Leu 


Asp 


Arg 


Val 


Thr 


He 


Ala 


Ser 


Glu 


Thr 


Gly 


Ser 






35 










4Q 










45 








He 


Ala 


Leu 


Pro 


Glu 


Asp 


He 


Arg 


Asp 


Tyr 


Thr 


Leu 


Gin 


Gin 


Pro 


Asn 




50 










55 










60 










Tyr 


Arg 


Leu 


Phe 


Gly 


Met 


Thr 


Arg 


Trp 


Leu 


Leu 


Arg 


Val 


Tyr 


Ser 


Ser 


65 










70 










75 










80 


Ser 


Asn 


Pro 


Asn 


Ser 


Asn 


Ser 


Trp 


Trp 


Asn 


Arg 


Ser 


Leu 


Arg 


Lys 


Met 










85 










90 










95 




Gly 


Glu 


Pro 


Pro 


Val 


Leu 


He 


Asp 


Ser 


Val 


Leu 


Thr 


Asp 


Arg 


Thr 


Ala 








100 










105 
















Asn 


Arg 


Leu 


Ala 


Lys 


Ala 


Met 


Ala 


Gly 


Asp 


Gly 


Phe 


Leu 


Asp 


Ala 


Thr 






115 










120 










125 








Ala 


Arg 


Ala 


Val 


Val 


Asp 


Thr 


Gly 


Leu 


Tyr 


Lys 


Lys 


Ala 


Arg 


He 


Thr 




130 










135 










140 










Tyr 


Leu 


He 


Gin 


Fro 


Gly 


Ser 


Arg 


Tyr 


Tyr 


He 


Arg 


Asn 


Met 


Ala 


Leu 


1^5 










150 










155 










160 


Asp 


Va 1 


Lys 


Asn 


Pro 


Leu 


Leu 


Pro 


Pro 


Val 


Ala 


Leu 


Gly 


Asn 


Ser 


Leu 










165 










170 










175 




Pro 


Ser 


Ala 


Tyr 


Lys 


Val 


Gly 


He 


Ser 


Glu 


Gly 


Ser 


Pro 


Leu 


Ser 


Pro 








180 










185 










190 






He 


Val 


Leu 


Asp 


Glu 


Glu 


Arg 


Lys 


Ala 


He 


Ala 


Arg 


His 




Arg 


Asn 






195 










200 










2 05 








Asn 


Gly 


Phe 


Trp 


Lys 


Phe 


Ser 


Ala 


Glu 


Asp 


Val 


Tyr 


Tyr 


Glu 


Ala 


Asp 




210 










215 










220 










Thr 


Thr 


Val 


Ser 


Gly 


Gly 


Ser 


Gly 


Thr 


Lys 


Ser 


Ala 


Asp 


Leu 


Lys 


Leu 


2 2 5 










230 








235 










2 4 0 


Val 


VaJ 


Asn 


Gly 


He 


Gly 


Arg 


Tyr 


Pro 


Tyr 


Arg 


He 


Gly 


Arg 


Val 


Phe 










245 










250 














Phe 


His 


Ala 


Asp 


Tyr 


Asp 


Pro 


Leu 


Glu 


Ser 


Asp 


Phe 


Arg 


Val 


Gin 


Glu 








260 










265 
















Leu 


Pro 


Arg 


He 


Asp 


Ser 


He 


Ser 


Arg 


Gly 


Asp 


Tyr 


Thr 


Val 


Tyr 


Tyr 






275 










280 








285 








Gly 


Ser 


Arg 


Gly 


Arg 


Tyr 


He 


Arg 


Ala 


Ser 


Ala 


Leu 


Thr 




Ser 


Val 




2 90 










295 










300 










Ser 


Val 


Thr 


Pro 


Gly 


Ala 


Phe 


Phe 


Cys 


Glu 


Asp 


Asp 


Val 


G _ 


Arg 


Ser 


305 










310 










315 








320 


Tyr 


He 


Lys 


Leu 


Asn 


Ala 


Leu 


Pro 


He 


Val 


Arg 


Asn 


Val 


As r. 


He 


Arg 
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325 



330 



335 



5 
10 
15 
20 
25 
30 
35 
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45 
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60 
65 
70 
75 



Phe Val G]u His Asn Gly Lys Asp Glu lie Ala Leu Ala Asp Ser Ser 

340 345 350 

Ara Leu Vsl Asp Cys Tyr lie Leu Thr Val Pro Ala Lys Ser Lys Ser 

355 360 365 

Phe Glu Ala Glu Val Leu Gly Thr Asn Ser Ala Gly A?p Phe Gly Ala 

370 375 300 

Ala Leu Ser Leu Gly Phe Thr As-p Arg Asn Leu Phe Arg Gly Ala Glu 
385 390 395 400 

M-t Phe Asn He Lys Leu Lys Gly Ala Tyr Glu Ala He Arg Lys Gly 

405 410 415 

Ser His Ser Phe Met Glu Tyr Gly Val Glu Ser Ser Leu Arg Phe Pro 

420 425 430 

Arq Leu Leu Phe Pro Phe He Ser Asp Glu Thr Arg Arg Arg Leu Arg 

435 440 445 

Ala Ser Thr Glu Trp Lys He Gly Tyr Asn Tyr Gin Thr Arg Pro Glu 

4 50 4 55 4 60 

Phe Asp Arg Val He Leu Ser Ala Gin Leu Asn Tyr Ser Trp Gin Thr 
465 470 475 480 

Tyr Leu His Asn Arg Leu Arg His Thr He Arg Leu Leu Asp Val Asp 

4 85 4 90 4 95 

Tyr Leu His Leu Pro Tyr He Asp Pro Asp Phe Ala Gin Ser Leu Pro 

500 505 510 

Pre Thr Tlir Ala Leu Tyr Asn Tyr Thr Glu Gin Phe He Leu Gly Ser 

515 520 525 

Ala Tyr He Leu Asn Tyr Thr Thr Ala Ser Ser i let Glu Arg Thr Val 

53 0 535 54 0 

Ser Asn Pro Phe Thr Ala Arg Phe Ser He Gin Thr Ala Gly Asn Leu 
54 5 550 555 560 

Leu Gin Ala He Ser Tyr Leu Thr Asp Ser Pro Lys Asp Glu His Gl-y 

565 570 575 

Leu Tyr Lys Met Phe Gly Leu His Tyr Ala Gin Phe Val Lys Leu Asp 

580 585 590 

Leu Asp Leu Ala Lys Thr Val Leu Leu Glu Lys Asp Asn Thr Leu Ala 

595 600 605 

Leu His Leu Gly Phe Gly Leu Ala Phe Pro Tyr Gly Asn Ala Arg His 

610 615 620 

He Pro Phe Glu Leu Arg Tyr Phe Ala Gly Gly Ser Asn Ser Val Arg 
625 630 635 640 

Gly Trp Ser Val Arg Thr Leu Gly Pro Gly Ser Met Lys Met Thr Pro 

645 650 655 

Asp Lys Thr Phe Phe Asp Gin Met Gly Asp He Arg Leu Asp Leu Asn 

660 665 ^70 

Val Glu Tyr Arg Thr Lys Leu Phe Trp Lys Phe Arg Ala Ala Ala Phe 

675 680 685 

Val Asp Ala Gly Asn Val Trp Thr He Lys Glu Tyr Glu Asn Gin Glu 

650 695 "'OO 

Asp Gly Leu Phe Arg Phe Asp Arg Phe Tyr Lys Glu He Ala Leu Ala 
705 710 715 "^20 

Tyr Gly Leu Gly Leu Arg Leu Asp Phe Asp Tyr Phe Leu Val Arg Leu 

725 730 735 

Asp Ala Gly Leu Lys Ala Tyr Asp Pro Gin Gin Thr Gly Arg Tyr Lys 

740 745 750 

Trp Ala He Thr Arg Pro Asn Leu Ser Ser Asn Phe Ala Trp His He 

755 760 765 

Ala Val Gly Tyr Pro Phe 



(2) lUFORMATIOII TOR SEQ ID NO: 4 55 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( i i i ) ^i Y POT H ET I C AL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pcrphyromonas gingiva lis 

(i::) FEATURE: 

(A) NAME/KEY: inisc_f eature 

(B) LOCATION 1. . .867 



BNSDOCID: <WO 9929e70A1J_> 
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SEOUEIICE DESCRIPTION: SEQ ID IJO:*355 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



net 


Arg 


Lys 


Arg 


He 


Leu 


Gin 


Leu 


Phe 


Leu 


Thr 


Ala 


Leu 


Leu 


Leu 


Ala 


1 








5 










10 










15 




Leu 


Gly 


Ser 


Ser 


Leu 


Ala 


He 


Al a 


Gl n 


Thr 


Val 


Val 


Thr 


Gly 


Lys 


Val 








2 0 










25 










30 






He 


Arp 


Ser 


Gl u 


Thr 


Ser 


Glu 


Pro 


Leu 


He 


Gly 


Val 


Ser 


Val 


Ser 


Thr 






35 










4 C' 










4 5 








Gly 


Gin 


Gl-^- 


Ala 


Ser 


Leu 


Arg 


Gly 


Val 


Thr 


Thr 


Asp 


Met 


Asp 


Gly 


Gly 




50 










55 










60 










Phe 


Arg 


Phe 


Glu 


Val 


Pro 


Ala 


Ly^ 


Ser 


Val 


Leu 


Thr 


Phe 


Arg 


Cys 


Val 


65 










7 0 










75 








8 0 




Tyr 


Ala 


Thr 


Val 


Thr 


Arq 


Ser 


He 


Gly 


Arg 


Gly 


Ser 


Gin 


Glu 


Asp 










85 










90 










95 




Leu 


Gly 


Thr 


He 


Leu 


Leu 


Asp 


Pro 


Gin 


Ala 


He 


Gly 


Leu 


Asp 


Glu 


He 








100 










105 










110 






Gin 


WhX 


He 


Ala 


Ser 


Val 


Val 


Pro 


Lys 


Asp 


Arg 


Met 


Thr 


Pro 


Val 


Pro 






115 










120 










125 








Val 


Ser 


Asn 


He 


Arg 


Va: 


Ala 


Asp 


He 


Gin 


Ala 


Ala 


Ser 


Leu 


Asn 


Val 




130 










135 










140 










Glu 


Phe 


Pro 


Glu 


Leu 


Val 


Lys 


Ser 


Thr 


Pro 


Ser 


Thr 


Tyr 


Thr 


Thr 


Lys 


145 










150 










155 










160 


Gly 


S&r 


Gly 


Gly 


Phe 


Gly 


Asp 


Gly 


Arg 


Thr 


Asn 


Val 


Arq 


Gly 


Fhe 


Asp 










165 










170 










175 




Thr 


Tyr 


Asn 


Pile 


Gly 


Val 


Leu 


He 


Asn 


Gly 


V h 1 


Pro 


Val 


Asn 


Gly 


Met 








ICO 










185 










190 




Glu 


Asp 


Gly 


Lys 


Val 


Tyr 


Trp 


Ser 


Asn 


Trp 


Ser 


Gly 


Leu 


i'let 


Asn 


Gin 






195 










2 00 










2 05 








Ala 


Ser 


Thr 


He 


Gin 


He 


Gin 


Arg 


Gly 


Leu 


Gly 


Ala 


Ser 


Lys 


Leu 


Gly 




210 










215 










220 










lie 


Ser 


Ser 


Val 


Gly 


Gly 


Thr 


Met 


Asn 


He 


He 


Thr 


Lys 


Thr 


Thr 


Asp 


225 










230 










235 










240 


Ala 


Asn 


Thr 


Gly 


Gly 


Ser 


Ala 


Tyr 


Val 


Gly 


1 let 


Gly 


Asn 


Asp 


Gly 


Leu 










245 










250 










255 




His 


Ly^ 


Glu 


Ser 


Phe 


Ser 


He 


Ser 


Thr 


Gly 


Met 


Asn 


Asp 


Gly 


Trp 


Ala 








260 










265 










270 






He 


Thr 


He 


Ala 


Gly 


Ser 


His 


Met 


Thr 


Gly 


Leu 


Gly 


Tyr 


Val 


Lys 


Gly 






27 5 










2 80 










285 








Leu 


Lys 


Gly 


Arg 


Ala 


Phe 


Ser 


Tyr 


Phe 


Phe 


Asn 


Val 


Ser 


Lys 


Lys 


Phe 




2 90 










295 










300 










As-n 


Glu 


Arg 


His 


Thr 


Leu 


Ser 


Leu 


Thr 


Gly 


Phe 


Gly 


Ala 


Pro 


Gl n 


Trp 


305 










310 










315 










320 


Hif? 


Asn 


Gin 


Arg 


Ser 


Ser 


Lys 


Tyr 


Ser 


Val 


Ala 


Asp 


Tyr 


Asp 


Lys 


Tyr 










32 5 










330 










335 




Gly 


He 


Arg 


His 


Asn 


Gin 


Ser 


Phe 


Gly 


Tyr 


Leu 


Arg 


Gly 


Glu 


Leu 


Thr 








34 0 










345 










350 






Pro 


Thr 


Al5 


Tyr 


Ala 


Tyr 


Asn 


Thr 


Tyr 


His 


Lys 


Pro 


Gin 


Phe 


Ser 


Leu 






355 










360 










365 








Asn 


His 


Phe 


Trp 


Lys 


Met 


Asp 


Glu 


Asn 


Thr 


Ser 


Leu 


Tyr 


Thr 


Ala 


Unk 




370 










375 










380 










Tyr 


Ala 


Ser 


Leu 


Ala 


Thr 


Gly 


Gly 


Gly 


Arg 


Arg 


Ala 


Tyr 


Gly 


Lys 


Asn 


385 










390 










395 










400 


Ser 


Lys 


Trp 


Val 


Leu 


He 


Asn 


Tyr 


Asn 


Thr 


Gly 


Gin 


Pro 


Tyr 


Glu 


Gin 










4 05 










410 










415 




Thr 


Lys 


Val 


Thr 


Pro 


Asp 


Gly 


Leu 


He 


Asp 


Tyr 


Asp 


Ala 


Val 


Leu 


Ala 








4 20 










4 25 










430 






Al a 


Asn 


Al a 


Ala 


Ala 


Ser 


Asn 


Gly 


Ser 


Glu 


Ala 


He 


Phe 


Ala 


Leu 


Gly 






^35 










4 4 0 










4 4 5 








Se r 


Asn 


Ser 


His 


Lys 


Trp 


Phe 


G 1 y 


Leu 


Leu 


Ser 


Ser 


Phe 


Lys 


Lys 


Lys 




4 50 










455 










4 60 






Leu 


Asn 


Ser 


Ser 


Leu 


Thr 


Leu 


Thr 


Ala 


Gly 


Tyr 


Asp 


Gly 


Arg 


Tyr 


Tyr 


465 










470 










475 




4 80 




Gly 


Asp 


His 


Tyr 


Asp 


Ly^ 


He 


Thr 


Asp 


Leu 


Leu 


Gly 


Gly 


Ser 


Tyr 










485 










4 90 










4 95 




Tyr 


He 


Glu 


Asp 


Pro 


Lys 


Thr 


Ly5 


Leu 


Ala 


Tyr 


His 


Ala 


Glu 


Gly 


Gin 








500 










505 










510 






Gin 


Leu 


Lys 


Val 


Gly 


Asp 


He 


Va 1 


Asn 


Arg 


Asp 


Tyr 


Thr 


Gly 


Glu 


He 






515 










52 0 










525 








net 


Trp 


His 


Gly 


Leu 


Phe 


Ala 


Gin 


Met 


Glu 


His 


Ser 


Ser 


Glu 


Trp 


He 




530 










535 










540 










Asp 


Ala 


Phe 


Val 


Ser 


Gly 


Ser 


He 


Asn 


Tyr 


Glu 


Leu 


Tyr 


Arg 


Asn 


His 


545 










550 










555 






560 


Asn 


Tyr 




'31 y 


Ser 


Lys 


Ser 


Thr 


Gly 


Tyr 


Leu 


Pro 


Gly 


Val 


Ser 


Pro 










565 










57 0 










575 





SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID'. <WO 9929S70A1 J_ 



wo 99/29870 



PCT/AU98/01023 



397/ 490 

Trp Ly^ S-sr Phe Leu Pro Trp Ser Giy Lys Ala Giy Leu Ser Tyr Lys 

580 585 590 

Phe Ala Gin Giy His Asn Val Phe Ala A-n Giy Giy Fhe Phe Thr Arg 
595 600 605 

5 Ala Pro Leu Phe Giy Asn lie Tyr Ala Ala Giy Ala He He Pro Am 
610 615 620 

ha-p Lys Ala Asn I let Glu Lys: Val Leu Thr Giy Glu Val Giy Tyr Giy 
625 630 635 6-10 

P)ie Thr Am His Lys Asn Phe Glu Phe Asn He Asn Giy Tyr Tyr Thr 
10 645 650 655 

Ly£^ Trp Met A^p Arg Val Thr Ser Lys Arg He Giy Asn Glu Tyr Val 

660 665 670 

Tyr Leu Asn Giy Val Asp Ala Val His Cys Giy Val Glu Ala Glu Val 
675 680 685 

15 Ser Tyr Arg Pro He Arg Gin He Asp Leu Arg Giy Met Phe Ser Leu 
690 695 700 

Giy Asp Trp Thr Trp Gin Asn Asn Val Ser Tyr Thr Ser Tyr Asp Glu 
705 710 715 720 

Ala Giy Asn Glu Thr Giy Gin Asp He Thr Tyr He Lys Giy Leu His 
20 725 730 735 

Val Giy Asp Ala Ala Gin Met Thr Ala Ala Val Ser Ala Asp He Glu 

740 745 750 

Leu Plie Lys Giy Phe His Val He Giy Lys Tyr Asn Phe Leu Giy Lys 
755 760 765 

25 Asn Tyr Ala Giy Phe Asn Pro Ala Thr Arg Asn Ala Gin Gin Tyr Glu 
77 0 775 78 0 

Ala Asp Giy Lys Glu He Val Glu Ser Trp Lys Leu Pro Asp Val Giy 
78 5 7 90 7 95 800 

Leu Phe Asp Leu Ser Ala Ser Tyr Asn Phe Lys Leu Giy Ser Leu Ser 
30 305 810 815 

Thr Thr Phe Tyr Phe Asn Met Asp Asn Val Ala Asp Lys Arg Tyr Val 

820 825 830 

Ser Asp Ala Asp Asp Asn He He Giy Lys Lys His Asp Glu Ala Ser 
835 840 845 

35 Ala Leu Val Trp Tyr Giy Phe Giy Arg Thr Trp Ser Thr Giy He Arg 
850 855 860 

Val Asn Phe 
865 

40 (2) ItlFORIIATIOn E'OR SEQ ID 110:4 56 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LEIIGTH: 431 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(lii) HYPOTHETICAL: YES 



50 



65 



70 



75 



(vi) ORIGIIIAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 



(ix) FEATURE: 
55 (A) tiAME/KEY: misc_feature 

(B) LOCATION 1 ... 431 



60 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 456 










Met 


Lys 


Phe 


;-er 


He 


Arg 


Leu 


Phe 


Leu 


Cys 


He 


He 


Phe 


Leu 


Leu 


Ser 


1 








5 








10 










15 




Ala 


Phe 


He 


Leu 


Pro 


Ala 


Leu 


Giy 


Gin 


Lys 


Ser 


Lys 


Gin 


Val 


Gin 


Arg 








2 0 










2 5 










30 






Leu 


Glu 


Lys 


Gin 


Arg 


Lys 


Glu 


Ala 


Leu 


Lys 


Ala 


He 


Glu 


Lys 


Thr 


Asp 






35 










40 










45 








Arg 


Glu 


Leu 


Arg 


Asn 


Thr 


Lys 


Lys 


Asp 


Lys 


Gin 


Asp 


Lys 


Gin 


Lys 


His 


50 










c c 
^ ■~> 










60 










Leu 


Asn 


Leu 


Leu 


Asn 


Lys 


Gin 


Val 


Ala 


Gin 


Arg 


Lys 


Gin 


Met 


Val 


Gin 


65 










70 










75 










80 


Leu 


Leu 


Asp 


Asn 


Glu 


Val 


Lys 


Glu 


Leu 


Gin 


Ser 


Asp 


He 


Asp 


Ser 


Met 








85 










90 










95 




Thr 


Giy 


Val 


^ J *- 


His 


Gin 


Leu 


Ser 


Val 


Glu 


Glu 


Lys 


Ala 


Arg 


Ser 


Asp 






1 0 0 










105 










110 






Glu 


Tyr 


Ala 


Gin 


Ala 


Leu 


Gin 


Ser 


Met 


Gin 


Lys 


Arg 


Lys 


Arg 


Ser 


Leu 




115 










120 










125 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNISDOCID: <WO 9929B70A1 J_> 



wo 99/29870 



PCT/AU98/01023 



398/490 



50 



60 



65 



70 



75 



Asp 


Arg 


He 


Leu 


Phe 


He 


Ser 


Ser 


Al a 


Lys 


Ser 


Phe 


Asp 


Gl u 


Gly 


hJe t 




130 










135 










14 0 










Arg 


Arg 


Met 


Arg 


Phe 


Leu 


Glu 


Gin 


Tyr 


Al a 


Ser 


Al a 


Tyr 


Lys 


Leu 


Ala 


1^5 










150 










155 










1 60 


Ser 


Va 1 


Arg 


Leu 


Arg 


Asp 


Thr 


Arg 


Ser 


Lys 


Leu 


Glu 


Thr 


Glu 


Arg 


Ala 










1 65 










170 










17 5 




Thr 


Val 


Glu 


Asp 


Ala 


Lys 


Lys 


Glu 


Lys 


Gly 


His 


Leu 


Leu 


Val 


He 


Arg 








180 










185 










190 






Glu 


Gl u 


Gl u 


Lys 


Lys 


Lys 


Leu 


Glu 


Gly 


Gin 


Gin 


Ala 


Glu 


Gin 


Arg 


Arg 






1 95 










200 










205 








Gin 


Val 


Gin 


A] a 


Leu 


Gly Ala 


Lys 


Gin 


Lys 


Asp 


Leu 


Glu 


Ala 


Gin 


Leu 




210 










215 










220 










Arg 


Lys 


Gin 


Lys 


Lys 


Gin 


Ala 


Glu 


Ala 


Leu 


Asn 


Arg 


Lys 


He 


Glu 


Lys 


225 










230 










235 










240 


Gin 


He 


Ala 


Lys 


Glu 


He 


Glu 


Ala 


Ala 


Glu 


Arg 


Arg 


Ala 


Arg 


Glu 


Glu 




















250 










255 




Arg 


Glu 


Arg 


Leu 


Ala 


Arg 


Glu 


Ala 


Lys 


Ala 


Lys 


Gly 


Lys 


Pro 


Val 


Pro 








2 60 










265 










270 






Ala 


Glu 


Pro 


Gl u 


Arg 


Lys 


Ala 


Glu 


Thr 


Lys 


Gly Gly 


Tyr 


Ala 


Met 


Asp 






275 










280 










285 








Ala 


Ser 


Gl u 


Arg 


Ala 


Leu 


Ser 


Gly 


Ser 


Phe 


Ala 


Gin 


Asn 


Lys 


Gly 


Arg 




2 90 










295 










300 










Leu 


Pro 


Gly 


Pro 


Val 


Arg 


Gly 


Arg 


Tyr 


Arg 


He 


Val 


Ser 


Asp 


Phe 


Gly 


305 










310 










315 










32 0 


Val 


Hi s 


Gin 


His 


Ser 


Glu 


Leu 


Lys 


Lys 


Val 


Gin 


Val 


Asn 


Asn 


Gly 


Gly 










325 










330 










335 




He 


A^p 


He 


Ala 


Val 


Ala 


Thr 


•31 y 


Ser 


Asp 


Ala 


Thr 


Ser 


Val 


Phe 


Asp 


















345 










350 






Gly 


Val 


Val 


Ser 


Ser 


Val 


Phe 


Val 


He 


Pro 


Gly 


Tyr 


Asn 


Ser 


Ala 


Val 






355 










360 










365 








Met 


Val 


Arg 


His 


Gly 


Asn 


Tyr 


He 


Thr 


Val 


Tyr 


Ala 


Asn 


Leu 


Ser 


Lys 




370 










375 










330 










Val 


Tyr 


Val 


Asn 


Ser 


Gly 


Thr 


Arg 


Val 


Lj's 


Thr 


Gly 


Gin 


Ala 


Leu 


Gly 


385 










390 










395 










400 


Arg 


Ala 


Tyr 


Thr 


Asp 


Pro 


Ser 


Asn 


Asn 


Gin 


Thr 


He 


He 


His 


Phe 


Glu 










405 










410 










415 




He 


Trp 


Lys 


Glu 


Arg 


Ser 


Lys 


Gin 


Asn 


Pro 


Arg 


Leu 


Trp 


Leu 


Arg 










420 










425 










430 







10 



15 



20 



25 



30 



35 



40 (2) lUFORMATIOIJ FOR SEQ ID MO: 457 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 333 amino acidE 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(ii) noLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGItlAL SOURCE: 

(A) ORGAIIISfl: Porphyromonas gingivalic 



a 'A) FEATURE: 
55 (A) MAIIE/KEY: misc^f eature 

{E} LOCATIOIJ 1 . . . 333 



SEQUEHCE DESCRIPTION: SEQ ID HO: 4 57 



Met 


Lys 


Lys 


Tyr 


Leu 


Leu 


Tyr 


Ala 


Ser 


Leu 


Leu 


Thr 


Ser 


Val 


Leu 


Leu 


1 








5 










10 










15 




Phe 


Ser 




Ser 
2 0 


Lys 


Asn 


Asn 


Pro 


Asn 

25 


Glu 


Pro 


Val 


Glu 


Asp 
30 


Arg 


Ser 


He 


Glu 


He 
35 


Ser 


He 


Arg 


Val 


Asp 
40 


Asp 


Phe 


Thr 


Lys 


Thr 
45 


Gly 


Glu 


Ala 


Val 


Arg 
50 


Tyr 


Glu 


Arg 


Asn 


Gin 
55 


Gly 


Ser 


Ala 


Ala 


Glu 

60 


Arg 


Leu 


He 


Thr 


Asn 


Leu 


Tyr 


Leu 


Leu 


Leu 


Phe 


Asp 


Gin 


Ser 


Gly 


Ala 


Asn 


Pro 


Ala 


Lys 


65 










70 










75 










80 


Tyr 


Tyr 


He 


Thr 


Gly 
85 


Asn 


Thr 


Phe 


Thr 


Gly 
90 


Gly 


Thr 


Trp 


Leu 


Pro 
95 


Asp 


Asp 


Met 


Lys 


Val 


Lys 


Leu 


Asp 


Met 


Thr 


Gin 


Ser 


Glu 


Ala 


Gly 


Glu 


Arg 








100 










105 










110 




Lys 


Val 


Tyr 
115 


Val 


Val 


Ala 


Asn 


Val 
120 


Asp* 


Asn 


Ala 


Val 


Lys 
125 


Thr 


Ala 


Leu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO_9929e70AlJ_> 



wo 99/29870 



PCT/AU98/01023 



399/490 





Asp 


AJa 


Val 


Ala 


Asn 


Glu 


Ser 


Asp 


Leu 


GJn 


Thr 


Val 


Lys 


Arg 


Thr 


Thr 




130 










135 










140 








Gly 




Ala 


Met 


Pro 


Trp 


Ser 


Thr 


Asp 


He 


Ala 


Ser 


Pro 


Phe 


Leu 


I let 


Ser 




1 45 








150 










155 








Val 
175 


160 


5 


Asn 


Lys 


Thr 


His 


Asp 
165 


Phe 


Leu 


Ala 


Asn 


Arg 
170 


Leu 


Leu 


Asp 


Asn 


Pro 




Leu 


Val 


Arg 


Ala 


He 


Ala 


Lys 


Val 


Glu 


Leu 


Asn 


He 


Ser 


Leu 


Ser 


Glu 








180 










185 










1 ?0 








Lys 


Phe 


Gin 


lie 


Val 


Pro 


He 


He 


Val 


Asn 


Gly 


Ser 


Leu 


Ser 


Glu 


Phe 


10 






195 










2 00 










205 








Lys 


Phe 


Arg 


Tyr 


Val 


Asn 


Phe 


Asp 


Lys 


Glu 


Thr 


Tyr 


Val 


Val 


Lys 


Pro 






210 








215 










220 












Thr 


Thr 


Lys 


Pro 


Asp 


Asn 


Leu 


He 


Ser 


Ser 


Ala 


Asn 


Gly 


Val 


Trp 


Pro 




o o C 










230 










5 










2.10 


15 


Gin 


lie 


Thr 


Asp 


Trp 
2^15 


Thr 


Val 


Trp 


Gly 


Ala 
250 


Ser 


Leu 


Asn 


Thr 


Ser 
255 


Pro 




Ala 


Pro 


Asp 


Ala 

260 


Gly 


Thr 


Gly 


Tyr 


Thr 

265 


Leu 


Asp 


Ala 


Asn 


^iv 

270 


Lys 


Val 




Thr 


Ala 


Leu 


Arg 


He 


Val 


Thr 


Tyr 


Leu 


Asn 


Glu 


Arg 


Asp 


Ser 


Lys 


Gly 


20 






275 








280 










285 








Ala 


Thr 

290 


Va 1 


Glu 


Val 


Ala 


Leu 
295 


Pro 


Arg 


Val 


Asp 


Asp 
300 


Gly 


Thr 


Leu 


Pro 




Pro 


Pro 


Glu 


Phe 


Giy 


Pro 


Glu 


Leu 


Tyr 


Arg 


Leu 


Pro 


Leu 


Pro 


Asp 


Lys 




305 










310 










315 










320 


25 


He 


Leu 


Arg 


Asn 


His 
325 


Trp 


Tyr 


Lys 


Tyr 


Glu 

3 30 


Val 


Glu 


He 









(2) IIIFORIIATIOH FOR SEQ ID IIO:458 

30 (i) SEQUEIICE CHARACTERISTICS: 

(A) LEUGTH: 315 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
40 (A) ORGAMISM: Porphyromonas gingiva lis 

(i:0 FEATURE: 

(A) IIAME/KEY: mi sc_f ea t ure 

(B) LOCATIOtI 1 . . .315 



45 



SEQUENCE DESCRIPTION: SEQ ID NO: 458 





Met 


He 


Arg 


Thr 


He 


Leu 


Ser 


Arg 


Tyr 


Val 


Ser 


Ser 


Asn 


Phe 


Trp 


Ser 




1 






5 










10 










15 


Ala 


50 


Arg 


Gly 


Ala 


Thr 


Phe 


Phe 


E-'he 


Thr 


He 


Phe 


Pro 


Ala 


Phe 


He 


Leu 








2 0 










25 










30 








Ala 


Thr 


Ala 
35 


Leu 


Pro 


Ala 


Cys 


Gly 
40 


Gly 


Gly 


Thr 


Ala 


Ser 
45 


Gly 


Ser 


Asp 




Arg 


Thr 


Leu 


Ala 


Val 


Thr 


He 


Glu 


Pro 


Gin 


Lys 


T^-r 


Phe 


He 


Glu 


Ser 


55 




50 










55 










60 












li- 


Ala 


Asp 


Lys 


Ser 


Val 


Gin 


Val 


Val 


Ala 


Leu 


Val 


Pro 


Ala 


Gly 


Ser 




es 










70 










75 










80 




Asn 


Pro 


Glu 


Glu 


Tyr 
85 


Asp 


Pro 


Ser 


Pro 


Thr 
90 


Val 


Met 


Lys 


Arg 


Leu 
9 5 


Ser 


60 


Glu 


Ala 


Asp 


Ala 


Tyr 


Phe 


Tyr 


He 


Gly 


Gly 


Leu 


Gly 


Phe 


Glu 


Gin 


Arg 










100 








105 










110 








Asn 


Leu 


Ala 
115 


Ala 


He 


Arg 


Asp 


Asn 
120 


Asn 


Pro 


Lys 


Leu 


Pro 
125 


Leu 


Phe 


Gl u 




Met 


Gly 


Lys 


Ala 


Leu 


Ala 


Asp 


Ala 


Gly 


Ser 


Ala 


Asp 


Leu 


His 


Giy 


Ser 


65 




130 








135 










140 












Cys 


Thr 


Asp 


His 


Ser 


His 


Thr 


Asp 


Leu 


His 


Ala 


His 


Asp 


Pro 


His 


Tyr 




145 










150 










155 










160 




Trp 


Ser 


Ser 


Val 


Val 


Gly 


Ala 


Lys 


Ala 


Leu 


Ser 


Arg 


Ala 


Ala 


Tyr 


Asp 










165 










170 










175 




70 


Ala 


Leu 


Val 


Glu 
180 


Leu 


Tyr 


Pro 


Asn 


Glu 
105 


Lys 


Asp 


Lys 


Trp 


Asp 
190 


Lys 


Gly 




His 


Asp 


Arg 


Leu 


Asn 


Gly Arg 


He 


Asp 


Ser 


Val 


Lys 


Arg 


Leu 


Val 


Afrp 








195 










200 










205 










Thr 


Met 


Phe 


Ala 


Asn 


Gly 


Lys 


Ala 


Asp 


Lys 


Ala 


Phe 


Val 


He 


Tyr 


His 


75 




210 










215 










220 











SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



400/ 490 



20 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Pro 


Ser 


Leu 


Ser 


Phe 


Phe 


Ala 


Gin 


Glu 


Phe 


Gly 


Leu 


Arg 


Gin 


He 


Val 


225 










230 










235 










2^10 


li- 


Glu 


Glu 


Asp 


Gly 


Lys 


Glu 


Pro 


Thr 


Ala 


Ala 


His 


Leu 


Arg 


Arg 


Val 










2-^5 










250 










255 




ne 


Asp 


Gin 


Ala 


Arg 


Ala 


As-p 


Gly 


Val 


Arg 


He 


Val 


Phe 


He 


Gin 


Pro 








260 










265 










270 






Glu 


Fhe 


Ga u 


Thr 


Arg 


Gin 


Ala 


Glu 


A-p 


He 


Ala 


Arg 


Gl u 


He 


Gly 


Ala 






27 5 










280 










285 








Arg 


Pro 


Val 


Arg 


He 


Aeh 


Pro 


Leu 


Arg 


Ser 


Ser 


Trp 


Glu 


Glu 


Glu 


He 




2 90 










255 










30D 










Leu 


Wis 


He 


Ala 


Arg 


Ala 


Leu 


Ala 


His 


Glu 


Arg 












305 










310 










315 












(2) 


I! I FOR! lATIOI J 


FOR 


SEQ 


ID DO: 459 

















10 



15 

li) SEQUEIICE CHARACTERISTICS; 

(A) LEIJGTH: B4Q amino acidc 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

25 (vH ORIGIUAL SOURCE: 

{A) ORGAJJISil: Porphyromonas gingival is 

(ix) FEATURE: 

(A) HAIdE/KEY: mis cofeature 
30 (B) LOCATION 1. . .84 8 

(xi) SEQUEHCE DESCRIPTION: SEQ ID NO: ^59 



Met 


He 


Gly 


Lys 


Lys 


He 


Phe 


Phe 


He 


Leu 


Leu 


Ala 


Leu 


He 


Ala 


Phe 


1 








5 










10 










15 




Ser 


Gly 


Leu 


Asn 
20 


Ala 


Ala 


Thr 


Asp 


Thr 

25 


Glu 


Phe 


Lys 


Tyr 


Pro 
30 


Thr 


Asp 


Ala 


Asn 


He 
35 


He 


Gly 


His 


Val 


AO 


Asp 


Ser 


Lys 


Thr 


Gly 
45 


Glu 


His 


Leu 


Val 


Gly 


He 


Thr 


He 


Ala 


He 


Lys 


Gly 


Thr 


Thr 


Phe 


Gly 


Thr 


Ser 


Thr 




50 










55 








60 










Asp 


Ala 


Thr 


Gly 


His 


Tyr 


Tyr 


Leu 


Arg 


Asn 


Leu 


Arg 


Pro 


Gly 


Glu 


He 


65 










70 










75 










80 


Thr 


Leu 


He 


1 let 


Arg 
85 


Gly 


Flet 


Gl- 


Tyr 


Lys 
90 


Ser 


Gin 


Glu 


Arg 


Val 
95 


Va 1 


Arg 


V a 1 


Glu 


Lys 
100 


Asp 


Ly- 


Thr 


ue 


Glu 
105 


Val 


A 5 n 


Phe 


Glu 


Ala 
110 


Glu 


Glu 


Asp 


Ala 


He 


Asn 


Leu 


Asp 


Glu 


Val 


Val 


He 


Ser 


Ala 


Asn 


Arg 


Gl u 


Leu 






115 










120 










125 






Thr 


Leu 


Arg 


Arg 


Leu 


Ala 


Pro 


Thr 


Leu 


Val 


Asn 


Val 


Leu 


Asn 


Glu 


Lys 




130 










135 










140 








Val 


Phe 


Ser 


Gin 


Val 


Asn 


Ala 


Ser 


Asn 


Leu 


Ala 


Gin 


Gly 


Leu 


Ser 


Phe 


145 










150 










155 








160 


Gin 


Pro 


Gly 


Val 


Arg 
165 


Val 


Glu 


Asn 


Asn 


Cys 
170 


Gin 


Asn 


Cys 


Gly 


Phe 
175 


Asn 


Gin 


Val 


Arg 


He 
180 


Asn 


Gly 


Leu 


Asp 


Gly 
185 


Arg 


Tyr 


Ala 


Gin 


He 
1 90 


Leu 


He 


Asp 


Ser 


Arg 
195 


Pro 


He 


Met 


Ser 


Ala 
200 


Leu 


Ala 


Gly 


Val 


Tyr 
205 


Gly 


Leu 


Gj u 


Gin 


He 


Pro 


Ala 


Asn 


Met 


He 


Glu 


Arg 


Val 


Glu 


Val 


Val 


Arg 


Gly 


Gly 




210 










215 










220 






Gly 


Ser 


Ala 


Leu 


Tyr 


Gly 


Ser 


Ser 


Ala 


He 


Ala 


Gly 


Val 


Val 


Asn 


He 


225 










230 










235 










24 0 


He 


Thr 


Lys 


Glu 


Pro 
24 S 


Ser 


His 


Asn 


Ser 


Phe 
250 


Thr 


Phe 


Asn 


Glu 


Ser 
255 


Leu 


Ser 


Phe 


Thr 


Gly 
260 


Phe 


Ser 


Lys 


Leu 


Asp 

265 


Asn 


Asn 


Thr 


Asn 


Phe 
270 


Asn 


Ala 


Ser 


He 


Val 


Ser 


Asp 


Asp 


Asn 


Arg 


Ala 


Gly 


Ala 


Met 


Val 


Phe 


Gly 


Gin 






275 










280 










285 






Ala 


Arg 


Tyr 


Arg 


Asn 


His 


Trp 


Asp 


Ala 


Asn 


Asn 


Asp 


Gly 


Tyr 


Ser 


Glu 




2 90 




















300 








Leu 


Gly 


Lys 


He 


Asp 


Ala 


Arg 


Ser 


Leu 


Gly 


Ala 


His 


Ser 


Tyr 


Leu 


Arg 


305 










310 










315 








320 


Leu 


Ser 


Asp 


Tyr 


Ser 
325 


Lys 


Leu 


Thr 


Gly 


Glu 
330 


Phe 


His 


Thr 


He 


Ser 
335 


Glu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



401/ 490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Phi- 


Arg 


Arg 


Gly 


Gly 


Asp 


Arg 


He 


Asp 


Leu 


Pro 


Pro 


His 


Val 


Val 


Gly 




3 'JO 










345 










350 






Val 


Ala 


Glu 


Gin 


Thr 


Asp 


His 


Ser 


Val 


Phe 


Ser 


G 1 y 


Asn 


Leu 


Lys 


Tyr 






355 










360 










365 








Asp 


Leu 


Phe 


Ser 


Ser 


Asn 


Tyr 


Lys 


His 


His 


Phe 


Gl n 


Ala 


Tyr 


Thr 


Ser 


370 










375 










380 










Gly 


Gin 


lie 


Val 


Asn 


Arg 


Lys 


Ser 


Tyr 


Tyr 


Gly 


Giy 


He 


Gly 


Gl u 


He 


385 










390 










395 










400 


Asp 


Val 


Asn 


Gly 


His 


Pro 


Gly 


Gly 


Thr 


Glu 


Gly 


Tyr 


Pro 


He 


Pro 


Gin 








405 










4 10 










415 




Asp 


Gin 


Tj-r 


Gly 


Asn 


Asn 


Tyr 


Giy 


Val 


Thr 


Lys 


Gly 


Lys 


Thr 


Tyr 


Met 






'J 2 0 










425 










4 30 






Giy 


Gly 


lie 


Gin 


Tyr 


Ser 


Tyr 


Asp 


Leu 


Asp 


Lys 


Phe 


Leu 


Leu 


Met 


Pro 




'135 










4 40 










44 5 








Ser 


Gin 


Leu 


Leu 


Phe 


Gly 


Ala 


Glu 


Tyr 


Thr 


Arg 


Asp 


Glu 


Leu 


Asn 


Asp 




4 50 










4 55 










460 










Val 


I let 


Pro 


lie 


Leu 


Ser 


Trp 


Gin 


Th r 


Gly 


Glu 


Asp 


Ala 


Asn 


Gly 


Asn 


^65 










470 










475 










4 8 0 


Thr 


lie 


Pro 


Leu 


Tyr 


Pro 


Glu 


Leu 


Asp 


Gl n 


Asn 


He 


Asn 


Asn 


Tyr 


Ser 










-185 










4 90 










4 95 




Leu 


Phe 


Gly 


Gin 


Asn 


Glu 


Trp 


Lys 


Asn 


Asp 


Arg 


Trp 


Ser 


He 


Leu 


Va 1 








500 










505 










510 






Gly 


Ala 


Arg 


Leu 


Asp 


Lys 


His 


Ser 


Glu 


Val 


Lys 


Asp 


Met 


He 


Leu 


Ser 






515 










52 0 










525 








Fro 


AL-g 


Thr 


Thr 


Leu 


Arg 


Phe 


As]i 


Val 


Asn 


Pro 


Asp 


He 


Asn 


Leu 


Arg 




530 










535 










54 0 










Ala 


Tlir 


Tyr 


Ala 


Lys 


Gly 


Phe 


Arg 


Ala 


Pro 


Gin 


Val 


Phe 


Asp 


Gl u 


Asp 


545 










550 










555 










560 


Leu 


His 


Val 


Gly 


Val 


Val 


Gly 


Gly 


Gl u 


Ala 


Gin 


Lys 


Val 


Phe 


Asn 


Asp 










565 










57 0 










575 




Pro 


Asn 


Leu 


Lys 


Pro 


Glu 


He 


Ser 


His 


Ala 


Phe 


Ser 


Leu 


Ser 


Ala 


Asp 








580 










585 










590 






Met 


Tyr 


His 


Arg 


Phe 


Gly 


Asn 


Val 


Gin 


Thr 


A^n 


Phe 


Leu 


Val 


Glu 


Gly 






595 










600 










605 








Phe 


Tyr 


Thr 


Arg 


Leu 


Leu 


Asp 


Val 


Phe 


Thr 


Asn 


Glu 


Glu 


Gin 


Pro 


Asp 




610 










615 










620 










Gin 


His 


Asp 


Gly 


He 


Lys 


Arg 


Tyr 


Thr 


Arg 


He 


Asn 


Gly 


Ser 


Gly 


Ala 


625 










630 










635 










64 0 


Lys 


Val 


Phe 


Gly 


Leu 


Asn 


Leu 


Glu 


Gly 


Lys 


Val 


Ala 


Tyr 


Lys 


Ser 


Phe 










645 










650 










655 




Gin 


Leu 


Gin 


Ala 


Gly 


Leu 


Thr 


Leu 


Ala 


Ser 


Asn 


Lys 


Tyr 


Asp 


Glu 


Ala 








660 










665 










67 0 






Gin 


Glu 


Trp 


Gly 


Leu 


Asn 


Thr 


Val 


hyr. 


Asp 


Thr 


Asn 


Gly Ala 


Phe 


Val 






67 5 










680 










685 








Thr 


G 1 u 


Ala 


Asn 


Ala 


Asn 


Gly 


Gin 


Gin 


Glu 


Tyr 


Lys 


Asn 


Glu 


Ser 


Met 




690 










695 










7 00 










Thr 


Asp 


Thr 


Gin 


He 


Ttir 


Arg 


Thr 


Pro 


Ser 


Val 


Tyr 


Gly 


Tyr 


Phe 


Thr 


7 05 










710 










715 










7 2 


Leu 


Ala 


T-yr 


Asn 


Pro 


Ala 


Hlc 


Ser 


Trp 


Asn 


He 


Ala 


Leu 


Thr 


Gly 


Ala 










725 










7 30 










735 




Tyr 


Thr 


Gly 


Gin 


Met 


Tyr 


Val 


Pro 


His 


Ala 


1 le 


Glu 


Tyr 


Gly 


Val 


Lys 








740 










7 4 5 










7 50 






Ser 


Ala 


Glu 


Leu 


Asp 


He 


Met 


Gin 


Asn 


Asn 


Pro 


Glu 


He 


Thr 


Asp 


Glu 






755 










760 










765 








Thr 


Gly 


Lys 


Ala 


Pro 


Arg 


He 


Asp 


Glu 


Leu 


Lys 


Lys 


Thr 


Pro 


Ala 


Phe 




770 










775 










780 










Phe 


Asp 


Leu 


Gly 


Leu 


Lys 


Val 


Gly 


Tyr 


Asp 


Phe 


His 


Val 


Phe 


Gin 


Ala 


785 








790 










7 95 










800 


Thr 


Glu 


Val 


Gin 


Leu 


Tyr 


Val 


Gly 


Met 


Asn 


Asn 


He 


Phe 


Asn 


Ser 


Phe 










805 










810 










315 




Gin 


Lys 


Asp 


Phe 


Asp 


Arg 


Gly 


Ala 


Ala 


Arg 


Asp 


Ser 


Gly 


TyL- 


He 


Tyr 








020 










825 










830 






Gly 


Pro 


Tlir 


Gin 


Pro 


Arg 


Thr 


Gly 


Tyr 


Met 


Gly 


Leu 


Val 


Val 


hys 


Phe 






835 










84 0 










845 









(2) IHFORMATIOIJ FOR SEQ ID 110:460 



(i) SEQUEUCK CHARACTERISTICS: 

(A) LEHGTH: 2 02 amino acids 
70 {B) TYPE: amino acid 

( D ) TO POLOGY : linear 

iii) MOLECULE TYPE: protein 

75 (iii) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



402/490 



10 



55 



60 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGAMISM: Porphyromoiias gingival is 

(ix) FEATURE: 

(A) MAI IE/ KEY: misc^feature 

(B) LOCATION 1 . . .202 

(xi) 3EQUEMCE DESCRIPTION: SEQ ID NO: ^60 



Met Thr Val Lys Arg Ala Val Arg lie Ala Leu Leu Thr Leu lie Gly 
1 5 10 15 

lie Leu Phe Ser Ser Pro Ser Leu Val Arg Ala Gin Ser Leu Fhe Ser 
20 2 5 30 

15 Thr Glu His Val Leu Gin Leu Tyr Asn Lys lie Leu Tyr Gly Glu Ser 
35 40 45 

Ala Ala Asp Thr Val Ala Glu Lys Thr Ala Gly Glu Ser Ala Phe Pro 

50 55 60 

Phe He Asp Lys Leu He Asn Leu Gly Arg Thr Phe Leu Gly Lys Fro 
20 65 70 ^75 80 

Tyr Arg Tyr Arg Gly Pro Ser Pro Trp Pro 1 let Asp Cys Ser Gly Tyr 

05 90 95 

Val Ser Tyr Leu Tyr Ser Lys Phe Asp He Lys Leu Pro Arg Gly Ala 
100 105 110 

25 Ala Ala Gin Ser Gin Tyr Thr Asn Pro He Glu Arg Glu Asp Val Arg 
115 120 125 

Pro Gly Asp Leu Leu Phe Phe Lys Gly Arg Asn Ala Arg Ser Asn Arg 

130 135 1^0 

He Gly His Val Ala Leu Val Val Ser Val Asp Glu Asp Asp He Thr 
30 1^5 150 155 160 

Met Met His Ser Arg Asn Ser Arg Gly He Val He Glu Lys Leu Asn 

165 170 175 

Arg Ser Ala Tyr Phe Ser Arg Arg Leu Val Ser Tyr Gly Arg Val Pro 
180 185 1?0 

35 Gly Ala Lys Arg Val He Pro Arg Lys Ser 
195 200 

(2) INFORMATION FOR SEQ ID NO: 4 61 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 55 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 Hi) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

( V i ) OR I G I HAL S OU RCE : 
50 (A) ORGANISM: Porphyromonas gingivalis 

{ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 ... 455 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 61 



Met 


Lys 


Arg 


Thr 


He 


Leu 


Leu 


Thr 


Ala 


Leu 


Thr 


Val 


Leu 


Ser 


Ser 


Leu 


1 


















10 










15 




Ser 


Leu 


Leu 


Arg 
20 


Ala 


Gin 


Asn 


Glu 


Ser 
25 


Glu 


Ala 


Ser 


Thr 


Asn 

30 


Pro 


Met 


Ser 


Gly 


Leu 

35 


Ser 


Leu 


Glu 


Asp 


Cys 


He 


Arg 


He 


Ala 


Lys 
45 


Glu 


Arg 


Asn 


Leu 


Asn 


Leu 


Arg 


Arg 


Gin 


Glu 


He 


Glu 


Gin 


Glu 


Asn 


Arg 


He 


He 


Ser 




50 










55 










60 








Leu 


Asp 


Ala 


Ala 


Arg 


His 


Ser 


Phe 


Leu 


Pro 


Ser 


Val 


Asn 


Ala 


Gly 


He 


65 










70 










75 










8 0 


Gly 


His 


Asn 


Tyr 


Ser 
85 


Phe 


Gly 


Arg 


Ser 


90 


Asp 


Lys 


Thr 


Gly 


Val 
95 


Thr 


Val 


Asp 


Arg 


Ser 
100 


Ser 


Met 


Asn 


Thr 


Asn 
105 


Leu 


Ser 


He 


Gly 


Ala 
110 


Ser 


Va 1 


Glu 


Val 


Phe 
115 


Ser 


Gly 


Thr 


Arg 


Arg 

120 


Leu 


His 


Asp 


Leu 


Ly5 
125 


Gin 


Gin 


Lys 


Tyr 


Asn 
130 


Val 


Glu 


Asp 


Gly 


He 
135 


Ala 


Arg 


Leu 


Gin 


Lys 
14 0 


Ala 


Arg 


Glu 


Asp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOGID; <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



403/490 





Leu 


Ser 


Leu 


Gin 


He 


Ala 


Ala 


Leu 


Tyr 


He 


Asn 


Leu 


Leu 


Phe 


Arg 


Gin 




145 










150 










155 










1 60 




Glu 


tiet 


Thr 


Arg 


Thr 


Ala 


Glu 


Thr 


Gin 


Leu 


Ala 


Leu 


He 


Arg 


Gl u 


Gin 










165 










17 0 










175 




5 


Arg 


Asn 


Arg 


Thr 


Ala 


Glu 


Met 


Val 


Arg 


Val 


Gly 


Lys 


Trp 


Ala 


Glu 


Gly 






180 










185 










190 








l.ys 


Leu 


Leu 


Asp 


He 


Asn 


Ala 


Gin 


Met 


Ala 


Lys 


Asp 


Glu 


Gin 


Leu 


Leu 








195 








200 










205 










Val 


Gin 


Tyr 


Arg 


Ser 


Glu 


Glu 


Glu 


Leu 


Ala 


Arg 


Leu 


Asp 


Leu 


Gly 


Gin 


10 




210 








215 










220 












Ala 


Leu 


Glu 


Leu 


Glu 


His 


Pro 


Giu 


Ser 


He 


Ala 


Val 


Lys 


Ala 


Pro 


Asp 














230 










235 










240 




Thr 


Asp 


Val 


Leu 


Val 


Ala 


Gl u 


Arg 


Leu 


Gly 


Ser 


Leu 


Leu 


Ala 


Pro 


Glu 










245 










250 










255 




15 


Glu 


lie 


Tyr 


Arg 


Thr 


Ala 


Leu 


Gly 


Leu 


Lys 


Pro 


Ala 


Leu 


His 


Ser 


Ser 








2 60 










265 










27 0 








Glu 


Leu 


Gin 
275 


He 


Ala 


Ser 


Aia 


Arg 
280 


Glu 


Gly 


Leu 


Ala 


Ser 
285 


Ala 


Arg 


Ala 




Ala 


Tyr 


Phs 


Pro 


Thr 


Leu 


Ser 


Leu 


Ser 


Ala 


Gly 


Tyr 


Ser 


Asn 


Gly 


Tyr 


20 




2 90 










295 










300 












Phe 


Arg 


Asp 


Leu 


Gly 


Ly^ 


Giu 


Tyr 


Ala 


Ala 


He 


Asn 


Pro 


Ser 


Phe 


Ser 




305 








310 










315 










320 




Glu 


Gin 


Trp 


Lys 


Asn 


Asn 


Gly 


Ser 


Tyr 


Ser 


He 


Gly 


Leu 


Ser 


Leu 


Asn 








325 










330 










335 




25 


lie 


Pro 


He 


Phe 
340 


Ser 


Ala 


Met 


Gin 


Thr 
345 


Gin 


Asp 


Arg 


Val 


Arg 
350 


Ser 


Ser 




Arg 


Leu 


Gin 


He 


Arg 


Ser 


Ser 


Glu 


Leu 


Arg 


Leu 


Val 


Gl u 


Glu 


Lys 


Lys 






355 










360 










365 










Ala 


Leu 


Tyr 


Lys 


Glu 


He 


Arg 


Gin 


Ala 


Tyr 


Ser 


Asn 


Ala 


Val 


Ala 


Ala 


30 




370 










375 










380 












Asp 


Lys 


Ala 


He 


Ala 


Ala 


Ala 


Glu 


Asn 


Ser 


Lys 


Ala 


Ala 


Thr 


Leu 


Lys 




385 










390 










395 










400 




Ala 


Tyr 


Glu 


Tyr 


Ala 


Arg 


Asp 


Ser 


Phe 


Glu 


Ala 


Gly 


Arg 


Leu 


Ser 


Aia 








405 










410 










415 




35 


Tyr 


Glu 


Tyr 


Ala 


Glu 


Ala 


Lys 


Thr 


Lys 


Tyr 


Ala 


Leu 


Ser 


Gin 


Val 


Glu 








420 










425 










430 








Glu 


Leu 


Arg 
435 


Ala 


Lys 


Tyr 


Asp 


Phe 
440 


He 


Tyr 


Lys 


Ala 


Lys 
445 


Val 


Leu 


Asp 




Phe 


Tyr 


Gin 


Gly 


Lys 


Asp 


Phe 




















40 




450 










455 





















(2) IMFORHATIOH FOR SEQ ID MO;462 

(i) SEQUEHCE CHARACTERISTICS: 
45 (A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



50 



55 



60 



65 



70 



75 



Hi) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIUAL SOURCE: 

(A) ORGAHISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) MAJ'1E/?:EY: misc_feature 

(B) LOCATIOM 1 ... 444 





C-.i) 


SEQUEHCE DESCRIPTIOH: SEQ ID MO: 462 










Met 


Arg 


Phe 


Gin 


His 


Tyr 


Leu 


He 


Cys 


Thr 


Ala 


Ala 


Val 


Ala 


Ala 


Leu 


1 






c 










10 










15 




Ala 


Al a 


Asn 


Fro 


Leu 


Thr 


Gly 


Gin 


Ser 


Asn 


1 let 


Thr 


Leu 


Glu 


Glu 


Cys 








20 










5 










30 






He 


Asp 


Tyr 


Ala 


Arg 


Arg 


His 


Ser 


Ser 


Ala 


Va 1 


Ala 


Leu 


Ser 


Ala 


Ala 




35 










40 










45 








Glu 


Leu 


Glu 


Gin 


Ser 


Lys 


Ala 


Asp 


Tyr 


Leu 


Gin 


Ala 


Val 


Gly 


Asn 


Phe 




50 








55 










60 










Leu 


Pro 


Arg 


Val 


Ser 


Ala 


Gly 


Thr 


Gly 


Ala 


Ser 


Trp 


Asn 


Phe 


Gly 


Arg 


65 








70 










75 










8 0 


Gly 


Leu 


Asp 


Ala 


Glu 


Thr 


Asn 


Thr 


Tyr 


Thr 


Asp 


He 


Asn 


Ser 


Phe 


Asn 






85 










90 










95 




Asn 


Ser 


Tyr 


Ser 


He 


His 


Ala 


Thr 


Met 


Thr 


Leu 


Phe 


Asp 


Gly 


Leu 


Gin 








100 










105 










110 







SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929S70A1_I_> 



45 



50 



70 



75 



WO 99/29870 PCT/AU98/01023 

404/ 490 





Ser 


Val 


Tyr 
115 


Arg 


Leu 


Arg 


Met 


Ala 
120 


His 


Ala 


Arg 


Arg 


Glu 
125 


Aia 


ser 


Arg 




Leu 


Ser 
130 


Val 


Arg 


Glu 


Gin 


Gin 
135 


Giu 


Leu 


Ala 


Ala 


Leu 
140 


Gly 


Thr 


Thr 


Glu 


5 


Ala 
1^5 


Tyr 


Tyr 


Asp 


Leu 


Val 
150 


Tyr 


Ala 


Arg 


Gin 


Met 
155 


Gin 


Glu 


Leu 


Ala 


Met 
160 




Gin 


Lys 


Tyr 


Glu 


Glu 
165 


Ser 


Ser 


Arg 


Leu 


His 
170 


Arg 


Gin 


Thr 


Ala 


Arg 
175 


Met 


10 


Glu 


Gl u 


Leu 


Gly 


Met 


Lys 


Ser 


Arg 


Pro 


J\sp 


Val 


Leu 


Giu 


Met 


Gin 


Ser 








180 










185 










190 








Arg 


Met 


Ala 
195 


Gly 


Asp 


Arg 


Leu 


Ala 
200 


Leu 


Thr 


Gin 


Ala 


Asp 

205 


Asn 


Gin 


Cys 




lie 


lie 


Ala 


Leu 


lie 


Arg 


Leu 


Lys 


Giu 


Lys 


Met 


Asn 


Fhe 


Pro 


lie 


Asp 


15 




210 










215 










220 










Asp 

225 


Glu 


Leu 


Val 


Val 


Asp 
230 


Asp 


Met 


Pro 


Ala 


Asp 

235 


Ser 


Leu 


Ser 


Aia 


Asp 
240 




Met 


Ala 


Glu 


Ser 


Asp 
245 


Ser 


Ser 


Ala 


Gly 


Val 

250 


Phe 


Ala 


Arg 


Ala 


Ala 
255 


His 


20 


His 


His 


Pro 


Val 


Leu 


Leu 


Arg 


Ala 


Lys 


Leu 


Asp 


Glu 


Gin 


Ala 


Ala 


Thr 








2 GO 








265 










270 








Asp 


Arg 


Leu 
215 


Arg 


Ala 


Ala 


Arg 


Gly 
280 


Ala 


Phe 


Leu 


Pro 


Ser 
286 


Val 


Ser 


Val 




Ser 


Gly 


Gly 


Trp 


Asn 


Thr 


Gly 


Phe 


Ser 


Arg 


Phe 


Leu 


Asn 


Gly 


Ser 


Asp 


25 




2?0 










2 *^5 










300 










Tyr 
305 


Thr 


Pro 


Phe 


Ser 


Glu 
310 


Gin 


Phe 


Arg 


Asn 


Arg 
315 


Arg 


Gly 


Glu 


Tyr 


Val 
320 




Ser 


Leu 


Asn 


Leu 


Ser 
325 


lie 


Pro 


lie 


Phe 


Ser 
330 


Gly 


Phe 


Ser 


Leu 


Val 

335 


Ser 


30 


His 


Leu 


Acq 


Gin 


Ala 


Arg 


Ala 


Glu 


Arg 


Arg 


Ala 


Ala 


lie 


Vai 


Arg 


Arg 








340 










345 










350 








Gly 


Glu 


Ala 

355 


Glu 


Arg 


Arg 


Leu 


Tyr 
360 


Ser 


Glu 


lie 


Ala 


Gin 

365 


Ala 


Met 


Ala 




Asp 


Arg 


Asp 


Ala 


Ala 


Leu 


Ala 


Ser 


Tyr 


Arg 


Gin 


Ala 


Lys 


Glu 


His 


Thr 


35 




370 










375 










380 










Asp 
385 


Ala 


net 


Gin 


Thr 


Aia 
390 


Tyr 


Glu 


Ala 


Val 


Leu 
3 95 


Gin 


Arg 


Tyr 


Giu 


Glu 
4 00 




Gly 


Leu 


Asn 


Thr 


Ala 
405 


lie 


Asp 


Leu 


Thr 


Thr 
410 


Gin 


Ala 


Asn 


Arg 


Leu 
415 


Leu 


40 


Asp 


Ala 


Arg 


Val 


Gin 


Arg 


Leu 


Arg 


Ala 


Ala 


Jlet 


Thr 


Tyr 


Arg 


Leu 


Lys 








420 










425 










430 








Cys 


Lys 


Leu 
435 


lie 


Aia 


Tyr 


Tyr 


Gly 
440 


Cys 


Leu 


Ser 


Asp 











(2) IMFORMATIOM FOR SEQ ID HO: 463 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LEliGTH: 94 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGV: linear 

(ii) MOLECULE TYPE: protein 

tiii) HYPOTHETICAL: YES 



55 (vi) ORIGINAL SOURCE: 

(A) ORGAIIISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) MAME/KEY: mi sc_f ea ture 
60 {B) LOCATIOl! 1...94 0 

(xi ) SEQUEIICE DESCRIPTION: SEQ ID 110:463 



65 



Met 


Asn 


Li-s 


Pl-)e 


Tyr 


Lys 


Ser 


Leu 


Leu 


Gin 


Ser 


Gly 


Leu 


Ala 


Aia 


Fhe 


1 








5 










10 










15 




Val 


Ser 


Met 


Aia 

20 


Thr 


Ala 


Leu 


Thr 


Ala 

25 


Ser 


Ala 


Gin 


lie 


Ser 

30 


Phe 


'3iy 


Gly 


Giu 


Pro 
35 


Leu 


Ser 


Phe 


Ser 


Ser 
40 


Arg 


Ser 


Aia 


Gly 


Thr 
45 


His 


Ser 


Phe 


Asp 


Asp 
50 


Aia 


Met 


Thr 


lie 


Arg 
55 


Leu 


Thr 


Pro 


Asp 


Phe 
60 


Asn 


Pro 


Glu 


Asp 


Leu 


lie 


Ala 


Gin 


Ser 


Arg 


Trp 


Gin 


Ser 


Gin 


Arg 


Asp 


Gly 


Arg 


Pro 


Val 


65 










70 










7 5 










80 


Arg 


lie 


Gly 


Gin 


Val 

85 


lie 


Pro 


Val 


Asp 


Vai 
90 


Asp 


Phe 


Ala 


Ser 


Lys 
95 


Ala 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



405/490 



Ser His lie Ser Ser He Gly Asp Val Asp Val Tyr Arg Leu Gin Phe 

100 105 110 

hys Leu Glu Gly Ala Lys Ala He Thr Leu Tyr Tyr Asp Ala Phe Asn 
115 120 125 

5 He Pro Glu Gly Gly Arg Leu Tyr Tie Tyr Thr Pro Asp His Glu He 
130 135 1^0 

Val Leu Gly Ala Tyr Thr Asn Ala Thr His Arg Arg Asn Gly Ala Fhe 
145 150 155 160 

Ala Thr Glu Pro Val Pro Gly Ser Glu Leu He tlet Asp Tyr Glu Val 
10 155 170 175 

Ser Arg Gly Gly Thr Leu Pro Asp He Lys He Ser Gly Ala Gly Tyr 

180 185 190 

He Phe Asp Lys Val Gly Gly Arg Pro Val Thr Asp Asn His Tyr Gly 
195 200 205 

15 He Gly Glu Asp Asp Ser Asp Ser Asp Cys Glu He Asn He Asn Cys 
210 215 220 

Pro Glu Gly Ala Asp Trp Gin Ala Glu Lys Asn Gly Val Val Gin Met 
225 230 235 2A0 

lie llet Val Lys Gly Gin Tyr He Ser Met Cys Ser Gly Asn Leu Leu 
20 245 250 255 

Asn Asn Thr Lys Gly Asp Phe Thr Pro Leu He He Ser Ala Gly His 

260 265 270 

Cys Ala Ser He Thr Thr Asn Phe Gl'y Val Thr Gin Ser Glu Leu Asp 
275 280 285 

25 Lys Trp He Fhe Thr Phe His Tyr Glu Lys Arg Gly Cys Ser Asn Gly 
2 90 2 95 300 

Thr Leu Ala He Phe Arg Gly Asn Ser He He Gly Ala Ser Met Lys 
305 310 315 320 

Ala Phe Leu Pro He Lys Gly Lys Ser Asp Gly Leu Leu Leu Gin Leu 
30 325 330 335 

Asn Asp Glu Val Pro Leu Arg Tyr Arg Val Tyr Tyr Asn Gly Trp Asp 

340 3A5 350 

Ser Thr Pro Asp He Pro Ser Ser Gly Ala Gly He His His Pro Ala 
355 360 365 

35 Gly Asp Ala Met Lys He Ser He Leu Lys Lys Thr Fro Ala Leu Asn 
370 375 380 

Thr Trp He Ser Ser Ser Gly Ser Gly Gly Thr Asp Asp His Fhe Tyr 
385 390 395 -lOO 

Pile Lys Tyr Asp Gin Gly Gly Thr Glu Gly Gly Ser Ser Gly Ser Ser 
40 405 410 ^15 

Leu Phe Asn Gin Asn Lys His Val Val Gly Thr Leu Thr Gly Gly Ala 

420 425 430 

Gly Asn Cys Gly Gly Thr Glu Phe Tyr Gly Arg Leu Asn Ser His Trp 
435 440 445 

45 Asn Glu Tyr Ala Ser Asp Gly Asn Thr Ser Arg Met Asp He Tyr Leu 
4 50 4 55 4 60 

Asp Pro Gin Asn Asn Gly Gin Thr Thr He Leu Asn Gly Thr Tyr Arg 
465 470 475 480 

Asp Gly Tyr Lys Pro Leu Pro Ser Val Pro Arg Leu Leu Leu Gin Ser 
50 485 490 495 

Thr Gly Asp Gin Val Glu Leu Asn Trp Thr Ala Val Pro Ala Asp Gin 

500 505 510 

Tyr Pro Ser Ser Tyr Gin Val Glu Tyr His He Phe Arg Asn Gly Lys 
515 520 525 

55 Glu He Ala Thr Thr Lys Glu Leu Ser Tyr Ser Asp Ala He Asp Glu 
530 535 540 

Ser He He Gly Ser Gly He He Arg Tyr Glu Val Ser Ala Arg Phe 
545 550 555 560 

He Tyr Pro Ser Pro Leu Asp Gly Val Glu Ser Tyr Lys Asp Thr Asp 
60 565 570 575 

Lys Thr Ser Ala Asp Leu Ala He Gly Asp He Gin Thr Lys Leu Lys 

580 585 590 

Pro Asp Val Thr Pro Leu Pro Gly Gly Gly Val Ser Leu Ser Trp Lys 
595 600 605 

65 Val Pro Phe Leu Ser Gin Leu Val Ser Arg Phe Gly Glu Ser Pro Asn 
610 615 620 

Pro Val Phe Lys Thr Phe Glu Val Pro Tyr Val Ser Ala Ala Ala Ala 
625 630 635 640 

Gin Thr Pro Asn Pro Pro Val Gly Val Val He Ala Asp Lys Phe Met 
70 645 650 655 

Ala Gly Thr Tyr Pro Glu Lys Ala Ala He Ala Ala Val Tyr Val Met 

660 665 670 

Pro Ser Ala Pro Asp Ser Thr Phe His Leu Phe Leu Lys Ser Asn Thr 
675 680 685 

75 Asn Arg Arg Leu Gin Lys Val Thr Thr Pro Ser Asp Trp Gin Ala Gly 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929e70A1J_> 



wo 99/29870 PCT/AU98/01 023 

406/490 







690 










695 










700 












Thr 


Trp 


Leu 


Arg 


He 


Asn 


Leu 


Asp 


Lys 


Pro 


Phe 


Pro 


Val 


Asn 


Asn 


Asp 




705 










710 










715 










720 




His 


Met 


Leu 


Phe 


Ala 


Gly 


He 


Arg 


Met 


Pro 


Asn 


Lys 


Tyr 


Ly^ 


Leu 


Asn 


c 
O 










725 










730 










735 






Arg 


Ala 


He 


Ar"g 
7^0 


Tyr 


Val 


Arg 


Asn 


Pro 
7^5 


Asp 


Asn 


Leu 


Phe 


Ser 
750 


He 


Thr 




Gly 


Lys 


Lys 


He 


Ser 


Tyr 


Asn 


Asn 


Gly 


Val 


Ser 


Phe 


Glu 


Gly 


Tyr 


'21 y 


•1 n 






755 










760 










765 






lie 


Pro 
770 


Ser 


Leu 


Leu 


Gly 


Tyr 
77 5 


Met 


Ala 


He 


Lys 


Tyr 

78 0 


Leu 


Val 


Val 


Asn 




Thr 


Asp 


Ala 


Pro 


Lys 


He 


Asp 


Met 


Ser 


Leu 


Val 


Gin 


Glu 


Pro 


Tyr 


Ala 




7 85 




















7 95 










800 




Lys 


Gly 


Thr 


Asn 


Val 


Ala 


Pro 


Phe 


Pro 


Glu 


Leu 


Val 


Gly 


He 


Tyr 


Val 










8 05 










810 










815 






Tyr 


Lys 


Asn 


Gly 
820 


Thr 


Phe 


He 


Gly 


Thr 

825 


Gin 


Asp 


Pro 


Ser 


Val 
830 


Thr 


Thr 




Tyr 


Ser 


Val 


Ser 


Asp 


Gly 


Thr 


Glu 


Ser 


Asp 


Glu 


Tyr 


Glu 


He 


Lys 


Leu 


20 






83 5 










8-5 0 










84 5 








Val 


Tyr 

850 


Lys 


Gly 


Ser 


Gly 


He 
855 


Ser 


Asn 


Gly 


Val 


Ala 
860 


Gin 


He 


Glu 


Asn 




Asn 


Asn 


Ala 


Val 


Val 


Ala 


Tyr 


Pro 


Ser 


Val 


Val 


Thr 


Asp 


Arg 


Phe 


Ser 




865 










87 0 










875 










880 


25 


He 


Ly^ 


Asn 


A], a 


His 


Met 


Val 


His 


Ala 


Ala 


Ala 


Leu 


Tyr 


Ser 


Leu 


Asp 










885 










8 90 










895 




Gly 


Ly^ 


Gin 


Val 
900 


Arg 


Ser 


Trp 


Asn 


Asn 
905 


Leu 


Arg 


Asn 


Gly 


Val 
91 0 


Thr 


Phe 




Ser 


Val 


Gin 


Gly 


Leu 


Thr 


Ala 


Gly 


Thr 


Tyr 


Met 


Leu 


Val 


Met 


Gin 


Thr 


30 






915 










920 










925 








Ala 


ASTl 

930 


Gly 


Pro 


Val 


Ser 


Gin 

935 


Lys 


He 


Val 


Lys 


Gin 
9^0 











(2) in FORMAT I ON FOR SEQ ID NO: 4 6^ 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: protein 

(iii) }JYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATIO]] 1 , . . 67 0 

50 



55 



60 



65 



70 



75 





(xi 


SEQUENCE DESCRIPTION: SEQ 


ID \\0:464 










Met 


Lys 




Leu 


He 


Arg 


Leu 


Phe 


Leu 


Ser 


Leu 


Met 


Leu 


Leu 


Ser 


Leu 


1 








5 










10 










15 




Trp 


Thr 


Gly 


Cys 


Thr 


His 


Glu 


Glu 


Leu 


Ser 


He 


Cys 


Asp 


Gly 


Glu 


Asn 








2 0 










25 










30 






Thr 


Leu 


Val 


Leu 


Arg 


Val 


Glu 


Thr 


Gly 


Lys 


Ala 


Pro 


Asn 


Ala 


Arg 


Ala 






35 










40 










45 






Thr 


Glu 


Pro 


Gly 


Gin 


Gly 


He 


Tyr 


Asn 


Glu 


Asn 


Lys 


Val 


Gly 


Ser 


He 




50 










55 










60 








Ser 


Val 


Leu 


Phe 


Tyr 


Leu 


Glu 


Gly 


Gin 


Leu 


Arg 


Trp 


Gin 


Val 


Lys 


Ser 


65 










70 










75 








80 


Thr 


Asp 


Tyr 


Gin 


He 


His 


Glu 


Gly 


Ala 


Tyr 


He 


He 


Pro 


Val 


Lys 


Glu 










85 










90 










95 




Gin 


Met 


Arg 


Pro 


Leu 


Phe 


Asn 


Gly 


Asn 


Asn 


Asf^ 


Phe 


Ser 


He 


Tyr 


Val 








100 










105 










110 




Val 


Ala 


Asn 


Leu 


Asp 


Phe 


Asn 


Ala 


Pro 


Ala 


Thr 


Glu 


Ala 


Ala 


Leu 


Ser 






115 










120 










125 








Gin 


Fhe 


Val 


Val 


Glu 


Lys 


Ser 


He 


Glu 


Val 


Ser 


Ser 


Thr 


Thr 


Ala 


Pro 




130 










135 










140 










Ala 


Asp 


Phe 


Val 


Met 


Leu 


Ala 


His 


Gly 


Asn 


Lys 


Gl n 


He 


Asn 


Met 


Ala 


145 










150 










155 










160 


Thr 


Thr 


Glu 


Gly 


Lys 


Leu 


Leu 


Gly 


Asp 


Tyr 


Lys 


Leu 


Lys 


Arg 


Val 


Ala 










165 










170 








175 




Ala 


Lys 


He 


Arg 


Met 


He 


Lys 


Pro 


Thr 


He 


Asn 


Val 


Gin 


Gly 


1'yi^ 


Glu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929S70A1 1 > 



wo 99/29870 



PCT/AU98/01023 



407/ 490 

180 185 190 

Val Yal Gly Asn He Gin Ala Lys Phe Arg Asn Ser Val Thr Lys Gly 

195 200 205 

Phe Leu Thr Tlir Glu Ala Gin Glu lie Pro Als Ala Ala Ser Tyr Lys 
5 210 215 220 

Thr Ser Glu Tyr Leu Asp He Ala Glu Ser Ala Pro Ala Asn Ser He 
225 230 235 2-^0 

Hi;? Phe Tyr Ser Tyr Tyr Asn Lys Trp Thr Leu Ser Thr Pro Glu Lys 
2^5 250 255 

10 Arg Pro Glu Phe Phe He Met Val Lys Phe Lys Lys Thr Gly Gin Pro 
260 265 270 

Asp Asn Thr Ala Lys Pro Tyr Tyr Tyr Arg Val Pro Leu Glu Ser Gin 

275 280 285 

Asp Asn Gin Val Lys Ser Asn Val Leu Tyr Asn Leu Asn Val Lys He 
15 290 295 300 

Glu He Leu Gly Ser Leu Gin Glu Pro Glu Ala Val Ser Val Asn Gly 
305 310 315 320 

Thr Leu Ala He Glu Glu Trp He Leu His Gin Asp Ala Phe Asn Leu 
325 330 335 

20 Pro Ala Thr Asn Tyr Leu He Val Glu Gin His Glu He Phe Het Asn 
340 345 350 

Asn Val Asn Thr Tyr Ser Val Lys Tyr Gin Thr Ser Gin Lys Pro He 

355 360 365 

Ser He Ser He Gin Ser Val Thr Phe Ser Tyr Val Ser Ser Asp Gly 
25 370 375 300 

Thr Gin Mis Asn Asp Leu Val Ala Ser Ser Ser Asp Gin Tyr Pro Thr 
385 390 395 4 00 

He Thr Ser Asp Asn Thr Ser He He He Thr Ser Lys He Pro Val 
405 410 415 

30 Asn Asn Val Pro Lys Lys He Val Phe Glu Val Thr Asn Gly Val Ala 
420 425 430 

Gly Leu Lys Glu Thr Val Thr Val Leu Gin Tyr Pro Ala Gin Phe He 

435 440 445 

Val Asn Thr Leu Gly Thr Ala Ser Ala Trp Arg Pro Asp Gly Ser Leu 
35 450 455 460 

Ala Pro Gly Leu Asn Asn Lys Ala He Tyr His Val Val Val Leu Val 
465 470 475 480 

Pro Pro Glu Asn Leu Phe Glu Asp Gly Thr Gin Thr He He Gly Tyr 
485 490 495 

40 Pro Pro Thr Glu Thr He Ser Phe His Lys Lys Glu Asn Asn Thr Tyr 
500 505 510 

Pro He Val Trp Ser Asp Thr Asn Thr Thr Lys Gin Asp Leu Glu Thr 

515 520 525 

Ser Arg Met He Ser Pro Ser Phe Glu Leu Ala Ser Gin Leu Gly Ala 
45 530 535 540 

Thr Leu Pro Met Pro Tyr Leu Glu Tyr Trp Pro Gly Thr Ser Tyr Leu 
545 550 555 560 

Leu Asp Tyr Ser Gly Asn Tyr Asn Asn Lys Arg Tyr Ala Leu Phe Asn 
565 570 575 

50 Cys Ala Phe Tyr Trp Glu Lys Arg Lys Val Asn Asn Glu Glu He Lys 
580 585 590 

Phe Asp Asp Trp Arg Leu Pro Thr Glu Ala Glu He Lys Leu He Asp 

595 600 605 

Lys Leu Gin His Asn Glu Gin Ser Ala Val Gin Ala He Met Thr Gly 
55 610 615 620 

Asn Tyr Tyr Trp Asp Ser Tyr Ser Ala Asn Gly Ser Tyr Lys Met Gin 
625 630 635 640 

Gly Gly Gly Gly Gin Gly Asn Ser Ser Lys Ala Tyr Val Arg Cys Val 
645 650 655 

60 Arg Asp Val Lys Lys Pro He Arg Asp Lys Lys Ser Gly Lys 
660 665 670 

(2) IMFORMATIOtJ FOR SEQ ID HO: 4 65 

65 (i) SEQUEIICE CHARACTERISTICS: 

(A) LEIIGTH: 12 82 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

70 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 
75 (A) ORGAIIISM: Porphyromonas gingivalis 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <W0 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



408/ 490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(ix) FEATURE: 

(A) IJAIIE/KEY: mi sc_ feature 

(B) LOCATION 1 . . .1282 

SEQUENCE DESCRI PTIOII : SEQ ID MO: '165 





Arg 




He 




^er 


Phe 


L'='U 


M e t 


j.j d x_ 




oe X 


L*^u 


n 1 S 


Le u 


Gl j 




















1 n 










1 5 




Gin 


S'^' r 


Gin 


Th r 


Trp 


His 




MSp 


Pro 


Asp 


Ser 


V d X 


n 1 ^ 
MX d 


Al a 


Leu 








2 0 










2 5 










30 






P ro 


Ser 


He 


<jX J 


He 


Gl 






S'^-r 




Thr 


rg 


11'=' 


i nr 


rne 


'j1 u 






35 










A n 










H o 








Val 


Val 


trn e 


Pro 




PK 

r ne 


Tyr 


Ser 


V d X 




Lys 


Arg 


Gl U 


Gly 


Asn 


Gin 




R n 










c: c; 










60 








V ct i- 


Ph 


\3i n 


Arg 


lie 


Ser 


VI e t 


Pro 


ijly 


Cys 


Gly 


Ser 


Phe 


Gly 


Asn 


Leu 


65 










7 Q 










/ 3 










8 0 


n 1 \- 


Glu 


Al a 


Glu 


L«=u 


Pro 


Va 1 


Leu 


Lys 


Lys 


t'let 


T 1 1=1 

X X e 


Mid 


Val 


Pro 


Glu 










O O 




















95 




t nfe 


Ser 


Thr 


Ml a 


Asn 


V ca 1 


MX a 


\/ - 1 

Vd X 


Lys 


lie 


Lys 


Gl u 


Thx' 


Glu 


Thr 


Phe 


















1 05 










110 






Asp 


AsTi 


Tyr 


Asn 


lie 


Tyr 


Pro 


Asn 


Pro 


Thr 


Tyr 


Va 1 


Val 


Glu 


Glu 


Leu 
















12 0 










125 








Pro 


G 1 n 


Gly 


Gly 


Thr 


T y r 


Leu 


Va 1 


Glu 


Ala 


Phe 


Ala 


He 


Asn 


Asn 


Asp 




1 3 'J 










135 










14 0 








Tyr 


Tyr 


Ser 


Gin 


As 11 


Va 1 


Ser 


Leu 


Pro 


Ser 


Thx' 


His 


Tyr 


Val 


Tyr 


Ser 


1 -1 5 










1 5 >j 










155 








1 GO 


Gl n 






Tyr 


fne 


Arg 


Ser 


LjX n 


Arg 


Phe 


lie 


Glu 


Va 1 


Thr 


Leu 


Tyr 










X DO 










17 0 










17 5 




ro 




rg 


Tyr 


Asn 


Pro 


V <j X 


Arg 


oxn 


Glu 


He 


Leu 


Phe 


Al a 


Lys 


Lys 








1 ft n 

i D u 










185 










190 






X X e 


IjI u 


V a X 


Thr 


X X e 


Thr 


Phe 


Asp 


Asn 


Pro 


Gl n 


Pro 


Pro 


Leu 


Gin 


Lys 






195 










200 










2 05 








JKsv\ 


Th r 


1 y 


T 1 en 

X X e 


f ne 


As n 


T ^-c- 


Va 1 


n 1 ^ 
MX a 


Ser 


Ser 


Al a 


Phe 


He 


Asn 


Tyr 




210 










Z X ^ 










220 








'"lu 


Ala 


nsp 


Lily 




o e r 


MX a 


Tic 

X X e 


oX u 


li iz- ri 

Mt ri 


Asp 


I let 


Va 1 


Phe 


Ser 


Arg 


225 










9 ri 

Z J L' 










2 35 










24 0 


J' 


Thr 


Thr 


Th 1- 

i nr 




Tic 

X X e 


oe r 


^jx y 


Asn 


/al 


Al a 


Ser 


Asn 


Leu 


Pro 


Gin 










24 5 










z ou 










255 








Asp 


1 yr 


Leu 


Va 1 


T 1 13 

X X e 


Tyr 


Asp 


Asp 


Met 


Phe 


As n 


Va 1 


Asn 


Gin 








260 










9 

Z OQ 










27 0 






n 


Pro 


n IS 


Asp 




Tic 

lie 


Lys 


Arg 


Leu 


Cys 


Glu 


His 


Arg 


Al a 


Phe 


Tyr 






275 










9 p o 

Z O 










285 












IT ne 


Asp 
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Lys 


His 


Leu 


Ser Glu 


Gly 


Tyr 






1235 








1240 








1245 






Tyr 


Ser 


He 


Gin 


Fhe 


Asn 


Thr 


Ser 


Asn 


Phe 


Asp 


Pro 


Gly Phe 


Tyr 


Leu 




1250 








1255 








12&0 
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1200 
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He Lys 

(2) IIirORMATlOH FOR SEQ ID IIO:466 



(i) SEQUEl-JCE CHARACTERISTICS: 

(A) LENGTH: 127 4 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

30 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona s gingival is 

35 (i>:) FEATURE: 

(A) KAME/KEY: mis cofeature 

(B) LOCATION 1 . . . 1274 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 66 
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275 280 285 

Ser lie Lys Asp Val Leu Asn Ser Phe Pro Ser Asn Ala Thr Ser Tyr 

290 295 300 

lie Asn Glu Thr Lys Leu Lys Asn Phe lie Arg Ser Val Tyr Asn Gin 
5 306 310 315 320 

Ser Asn Ala Lys Arg Thr Leu Asp Gly Lys Leu Gly Tyr Val Leu Leu 

325 330 335 

He Gly Lys Pro Leu Ser Lys Tyr Leu Ala Asp Thr Asp Asn Thr Lys 
340 345 350 

10 Val Pro Thr Ser Phe He His Asn Val Ser Leu He Pro Ser His Pro 
355 360 365 

Thr Phe Gly Ser He Cys Ala Ser Asp Tyr Phe Phe Ser Cys Val Ser 

370 375 380 

Pro Leu Asp Thr Val Gly Asp Leu Phe He Gly Arg Phe Ser Val Thr 
15 385 390 395 400 

Asn Ala His Glu Leu Mis Asn Leu He Glu Lys Thr He Asn Lys Glu 

4 05 410 415 

He Ser Tyr Asn Pro He Ala His Lys Asn He Leu Tyr Ala Glu Gly 
420 425 430 

20 Lys Gly Cys Asp Ala Pro He Leu Arg Leu Phe Leu Lys Glu He Ala 
435 440 445 

Ser Gly Tyr Thr Val Asn Ser He Leu Lys Ser Asn Gin Val Ser Ala 

450 455 460 

He Asp Ser He Phe Asp Cys Leu Asn Asn Gly Ser His His Phe Tyr 
25 465 470 475 480 

Phe Asn Thr His Gly I let Pro Thr Val Trp Gly He Gly Gin Gly Leu 

4 85 4 90 4 95 

Asp Val Asn Thr Leu Thr Ala Arg Leu Asn Asn Thr Ser Ser Gin Gly 
500 505 510 

30 Leu Cys Thr Ser Leu Ser Cys Ser Ser Ala Val Ala Asp Ser Thr He 
515 520 525 

Arg Ser Leu Gly Glu Val Leu Thr Thr Tyr Ala Pro Asn Lys Gly Phe 

530 535 540 

Ser Ala Phe Leu Gly Gly Ser Arg Ala Thr Gin Tyr Ala Val Tyr Leu 
35 545 550 555 560 

Glu Gly Pro Cys Pro Pro Ser Glu Phe Tyr Glu Tyr Leu Pro Tyr Ser 

565 570 575 

Leu Tyr His Asn Leu Ser Thr Val Val Gly Glu Met Leu Leu Ser Ser 
580 585 590 

40 He He Asn Thr Asn Ser Val Asp Thr Tyr Ser Lys Phe Asn Phe Asn 
595 600 605 

Leu Leu Gly Asp Pro Ala Leu Asn He Met Ala His Gly Met Glu Val 

610 615 620 

Ser Asn Cys He Thr Leu Pro Asn Asn Thr He He Ser Ser Pro He 
45 625 630 635 640 

Thr He Lys Asn Gly Gly Cys Leu Lys He Pro Glu Lys Gly Val Leu 

645 650 655 

His Phe Thr Asn Asn Gly Ser He Gin Val Met Ser Gly Gly Thr Leu 
660 665 670 

50 Glu He Gly Asn Gin Ala Lys He Ser Gly Glu Thr Gly Ala Asn Pro 
675 680 685 

Thr Phe He Thr Val Tyr Gly Asp Gly Leu Ala He Asn Lys Gin Val 

690 695 700 

Glu He Asp Asn He Asp Arg Leu Asn Leu Phe Ser Thr His Ser Val 
55 705 710 715 720 

Met Pro Lys Phe His Phe Asp Ser Val Lys Phe Asn Ser Ala Pro Leu 

725 730 735 

Tyr Thr Thr Asn Cys He Val Glu He Ser Asn Cys Glu Phe Thr Asn 
74 0 745 7 50 

60 Arg Ser Asp He He Ser Lys Asn Cys Asp Leu Ser Val Glu Asn Ser 
755 760 765 

I let Phe Ser Ser Ser Gly He Thr Val Phe Lys Pro Met Ala Thr Ser 

770 775 780 

Ser He Thr Gly Leu Ser Thr Lys Ala Lys He Thr Asp Asn Thr Phe 
65 785 790 795 800 

Phe Ala Thr Gly Asn Phe Ala Tyr His He Thr Asn Thr Pro Gly Leu 

805 810 815 

Thr Ala Tlir Ser Asn Ala Ala He Lys Leu Asp Asn He Pro Glu Tyr 
820 825 830 

70 Tyr He Ser Gly Asn Lys He Val Asn Cys Asp Glu Ala Leu Val Leu 
835 840 845 

Asn Asn Ser Gly Asn Arg Thr Asn Arg Leu His Asn He Thr Arg Asn 

850 855 860 

Val He Lys Asi^. Cys Arg He Gly Ser Thr Leu Tyr Asn Ser Tyr Gly 
75 865 87 0 87 5 88 0 
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Ser 
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Ser 
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Phe 


His 


Tyr 


Asn 


Ser 


Leu 


Gl n 
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Gly 


Arg 


Ser 


Gly 


Asn 


Ser 
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Ala 


Glu 


Ala 


Val 






995 










1000 








1005 






Glu 


Phe 


Gin 


Thr 


Ala 


Leu 


Asp 


Cys 


He 


Gly 


Asn 


Ser 


Asp 


Tyr 


Leu 


Ser 




1010 








1015 








1020 








Ala 


Ly5 


Val 


Ala 


Leu 


Lys 


Met 


Met 


Val 


Glu 


Thr 


Tyr 


Pro 


Glu 


Ser 


Asp 


1025 








1030 








1035 








1040 


Phe 


Ala 


He 


Ala 


Ala 


Leu 


Lys 


Glu 


Leu 


Phe 


Arg 


He 


Glu 


Lys 


1-1 et 


Ser 










1045 








1050 








1055 


Gly 


Asn 


Asp 


Tyr 


Glu 


Gly 


Leu 


Lys 


Asp 


Tyr 


Phe 


Arg 


Ser 


Asn 


Pro 


Thr 








1060 








1065 








1070 




lie 


He 


Ser 


Ser 


Gin 


Asn 


Leu 


Fhe 


Pro 


Thr 


Ala 


Asp 


Phe 


Leu 


Ser 


Ala 






107 5 








1080 








1085 






Arq 


Cy- 


Asp 


He 


Val 


Cys 


Glu 


Asn 


Tyr 


GI n 


Ser 


Ala 


He 


Asp 


Trp 


Tyr 




1 090 








1095 








1100 








G I u 


Asn 


Arg 


Leu 


Asn 


Ser 


Glu 


He 


Ser 


Tyr 


Gin 


Asp 


Ser 


Val 


Phe 


Ala 


1105 








1110 








1115 








1120 


Val 


He 


Asp 


Leu 


Gly 


Asp 


He 


Tyr 


Trp 


Asn 


Met 


Gin 


Leu 


Asp 


Ser 


Leu 










1125 








1130 








1135 


Arn 


Gly 


Thr 


Gly 


He 


Asp 


Leu 


Asn 


He 


Leu 


Ser 


cys 


Glu 


Gin 


Arg 


Lys 








1140 








1145 








1150 




Ser 


Leu 


Glu 


Ser 


His 


Gin 


Asn 


Val 


Lys 


Asn 


Tyr 


Leu 


Leu 


Ser 


Thr 


Leu 






1155 








1160 








1165 






Pro 


Glu 


Ser 


Thr 


Gly 


Thr 


Leu 


Leu 


Pro 


Pro 


Leu 


Glu 


cys 


Asn 


Lys 


Ser 




1170 








1175 








1180 








Ser 


Leu 


Asp 


Lys 


Ser 


Lys 


He 


He 


Ser 


He 


Ser 


Pro 


Asn 


Pro 


Ala 


Lys 


118 










1190 








1195 








1200 


Ala 


Val 


Val 


Thr 


He 


He 


Tyr 


Tyr 


Thr 


Asp 


Asn 


Pro 


Ser 


cys 


Ser 


Val 










1205 








1210 








1215 


lie 


Lys 


He 


Tyr 


Gly 


He 


Asn 


Gly 


Ala 


Ser 


Ala 


Asp 


He 


Thr 


Gly 


Leu 








1220 








1225 








1 


230 




Pro 


Lys 


His 


Leu 


Ser 


Glu 


Gly 


Tyr 


Tyr 


Ser 


He 


Gin 


Phe 


Asn 


Thr 


Ser 






123 










1240 








1245 






Asn 


Phe 


Asp 


Pro 


Gly 


Phe 


Tyr 


Leu 


Val 


Thr 


Leu 


Asn 


Val 


Asp 


Gin 


Lys 




125 










1255 








1 


2 60 








He 


He 


Asp 


Thr 


Glu 


Lys 


Leu 


Arg 


He 


Lys 















10 



15 



20 



25 



30 



35 



40 



45 



50 1265 1270 

(2) inrORMATlOIJ FOR SEQ ID NO: 4 67 

(i) SEQUEMCE CHARACTERISTICS: 
55 (A) LEMGTH: 925 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: proteii-) 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIDAL SOURCE: 

(A) ORGATflSM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAUE/KEY: misc_£eature 

(B) LOCATION 1 . . . 925 

70 (xi) SEQUEHCE DESCRI PTIOli : SEQ ID HO: 4 67 

Met Ala He Met Met Lys Ser He Val Phe Arg Ala Phe Leu Thr He 

15 10 15 

Leu Leu Ser Trp Ala Ala He Thr Asn Pro Thr Ala Gin Glu He Ser 

75 20 2 5 30 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO_9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



413/ 490 



GJy Met Asn Ala Ser Cys Leu Ala Ala Pro Ala Gin Pro Asp Thr lie 

35 JO 0 5 

Leu Tyr Glu Ser Phe Glu Asn Gly Pro Val Pro Asn Giy Trp Leu Glu 
50 55 60 

5 He Asp Ala Asp Ala Asp Gly Ala Thr Trp Gly Ser Pro Ser Gly Ser 
65 70 -75 80 

Phe Ser Val Pro Tyr Gly His Asn Gly Leu Cys Thr Tyr Ser His lie 

85 9 0 95 

Arg Ser Gly He Ser Thr Ala Gly Asn Tyr Leu He Thr Pro Asn He 
10 100 105 HO 

Glu Gly Ala Lys Arg Val Lys Tyr Trp Val Cys Asn Gin Tyr Ser Thr 

115 120 125 

Asn Pro Glu His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala He 
130 135 l-iO 

15 Glu Asp Phe Val Leu Leu Phe Asp Asp Ser He Thr Gly Lys Pro Thr 
1^5 150 155 160 

Pro Leu Val Trp Arg Arg Arg He Val Asp Leu Pro Glu Gly Thr Lys 

165 110 I'^S 

Tyr He Ala Trp Arg His Tyr Lys Val Thr Asp Ser His Thr Glu Phe 
20 180 185 190 

Leu Lys Leu Asp Asp Val Thr Val Tyr Arg Ser He Glu Gly Pro Glu 

195 2 00 2 05 

Pro Ala Thr Asp Phe Thr Val He Asn He Gly Gin Asn Val Gly Arg 
210 215 220 

25 Leu Thr Trc Asn Tyr Fro Glu Asp Tyr Gin Pro Glu Gly Lys Gly Asn 
225 230 235 2'10 

Glu Glu Leu Gin Leu Ser Giy Tyr Asn He Tyr Ala Asn Gly Thr Leu 

245 250 255 

Leu Ala Gin He Lys Asp Val Ser He Leu Glu Tyr Val Asp Ser Thr 
30 260 265 270 

Tyr Ser Leu Arg Asp Asn Pro Leu Gin Val Glu Tyr Cys Val Thr Ala 

275 280 285 

Val Tyr Asp Glu Ser He Glu Ser Ser Thr Val Cys Gly Thr Leu His 
290 295 300 

35 Tyr Ala Thr Asp Ala He Leu Tyr Glu Asn Phe Glu Asn Gly Pro Val 
305 310 315 320 

Pro Asn Gly Trp Leu Val He Asp Ala Asp Gly Asp Gly Phe Ser Trp 

325 330 335 

Gly His Tyr Leu Asn Ala Tyr Asp Ala Phe Pro Gly His Asn Gly Gly 
40 340 345 350 

His Cys Ser Leu Ser Ala Ser Tyr Val Pro Gly He Gly Pro Val Thr 

355 360 365 

Pro Asp Asn Tyr Leu He Thr Pro Lys Val Glu Gly Ala Lys Arg Val 
370 375 300 

45 Lys Tyr Trp Val Ser Thr Gin Asp Ala Asn Trp Ala Ala Glu His Tyr 
385 390 395 400 

Ala Val Met Ala Ser Thr Thr Gly Thr Ala Val Gly Asp Phe Val He 

4 05 410 415 

Leu Phe Glu Glu Thr Met Thr Ala Lys Pro Thr Gl-y Ala Trp Tyr Glu 
50 420 425 430 

Arg Thr He Asn Leu Pro Glu Gly Thr Lys Tyr He Ala Trp Arg His 

435 440 445 

Tyr Asn Cys Thr Asp He Tyr Phe Leu Lys Leu Asp Asp He Thr Val 
450 455 460 

55 Phe Gly Thr Pro Ala Ser Glu Pro Glu Pro Val Thr Asp Phe Val Val 
4eS 470 475 480 

Ser Leu He Glu Asn Asn Lys Gly Arg Leu Lys Trp Asn Tyr Pro Asn 

485 490 495 

Gly Tyr Glu Pro Asp Lys Thr Asp Asp Lys Asp Pro Leu Gin Leu Ala 
60 500 505 510 

Gly Tyr Asn He Tyr Ala Asn Gly Ser Leu Leu Val His He Gin Asp 

515 520 525 

Pro Thr Val Leu Glu Tyr He Asp Glu Thr Tyr Ser Ser Arg Asp Asp 
530 535 54 0 

65 Gin Val Glu Val Glu Tyr Cys Val Thr Ala Val Tyr Asn Asp Asn He 
545 550 555 560 

Glu Ser Gin Ser Val Cys Asp Lys Leu He Tyr Asp Ser Gin Ser Asp 

565 570 575 

He He Leu Tyr Glu Gly Phe Glu Ala Gly Ser He Pro Glu Gly Trp 
70 580 585 590 

Leu Leu He Asp Ala Asp Gly Asp Asn Val Asn Trp Asp Tyr Tyr Pro 

595 600 605 

Trp Thr llet Tyr Gly His Asp Ser Glu Lys Cys He Ala Ser Pro Ser 
610 615 620 

75 Tyr Leu Pro Met He Gly Val Leu Thr Pro Acp Asn Tyr Leu Val Thr 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_ 



wo 99/29870 PCT/AU98/01023 

414/ 490 



10 



15 



20 



25 



30 



35 



40 



45 



65 



70 



75 



625 










630 










635 










64 0 


Pro 


Arg 


Leu 


Glu 


Gly 


Ala 


Lys 


Leu 


Val 


Ly- 


Tyr 


Trp 


Val 


Ser 


Ala 


Gin 










64 5 










es 0 










655 




Asp 


Ala 


Val 


Tyr 


Ser 


Al a 


Glu 


Hi s 


Tyr 


Al a 


Val 


I iet 


Val 


Ser 


Thr 


Thr 








660 










665 










67 0 






Gly 


Thr 


Ala 


Val 


Glu 


Asp 


Phe 


Val 


Leu 


Leu 


Phe 


Gl u 


Glu 


Thr 


1 let 


Thr 






675 










680 










685 








Ala 


Lys 


Ala 


Asn 


Gly 


Ala 


Trp 


Tyr 


Glu 


Arg 


Thr 


He 


Thr 


Leu 


Pro 


Ala 




690 










695 










7 00 










Gly 


Thr 


Lys 


Tyr 


He 


Al a 


Trp 


Arg 


His 


Tyr 


Asp 


Cys 


Thr 


Asp 


Met 


Phe 


7 05 










710 










7 15 










720 


Phe 


Leu 


Leu 


Leu 


Asp 


Asp 


He 


Thr 


Val 


Tyr 


Arg 


Ser 


Thr 


Glu 


Thr 


Val 










7 2 5 










730 










735 




Pro 


Gl u 


Pro 


Val 


Thr 


Asp 


Phe 


Val 


Val 


Ser 


Leu 


He 


Glu 


Asn 


Asn 


Lys 








74 0 










745 










750 






Gly 


Arg 


Leu 


Lys 


Trp 


Asn 


Tyr 


Pro 


Asn 


Gly 


Tyr 


Glu 


Pro 


Asp 


Lys 


Thr 






7 55 










7 60 










7 65 








Asp 


Asp 


Lys 


Lys 


Pro 


Leu 


Gin 


Leu 


Thr 


Gly 


Tyr 


Asn 


He 


Tyr 


Ala 


Asn 




77 0 










7 7 5 










7 80 










Gly 


Ser 


Leu 


Leu 


Val 


His 


He 


Gin 


Asp 


Pro 


Thr 


Val 


Leu 


Glu 


Tyr 


He 


7 85 










7 90 










7 95 










800 


Asp 


Glu 


Thr 


Tyr 


Ser 


Ser 


Arg 


Asp 


Gly Gin 


Val 


Gl u 


f'let 


Glu 


Tyr 


Cys 










805 










810 










815 




Va 1 


Thr 


Al u 


Va 1 


Tyr 


Asn 


Asp 


Asn 


He 


Glu 


Ser 


Gin 


Ser 


Val 


Cys 


Asp 








82 0 










825 










830 






Ly^ 


Leu 


A^n 


Tyr 


Thr 


He 


Thr 


Ser 


Leu 


Asp 


Asn 


He 


Gin 


Ser 


Asp 


Thr 






835 










840 










8 45 








Ser 


Leu 


Lys 


He 


Tyr 


Pro 


Asn 


Pro 


Ala 


Ser 


Tyr 


Val 


Val 


Arg 


He 


Glu 




850 










855 










860 










Gly 


Leu 


Ser 


Arg 


Ser 


Lys 


Ser 


Thr 


He 


Glu 


Leu 


Tyr 


Asn 


Ala 


Leu 


Gly 


865 










870 










875 










880 


He 


Cys 


He 


Leu 


Arg 


Glu 


Glu 


Thr 


His 


Ser 


Glu 


Lys 


Thr 


Glu 


He 


Asp 










885 










8 90 










8 95 




Val 


Ser 


Arg 


Leu 


Asn 


Asp 


Gly 


Val 


Tyr 


Leu 


He 


Lys 


Val 


Val 


Gly 


Gly 








900 










905 










910 






Asn 


Lys 


Thr 


Thr 


Thr 


Glu 


Lys 


Val 


Glu 


He 


Lys 


Arg 


Pro 












915 










920 










925 









(2) INFORIIATION FOR SEQ ID NO: 4 68 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 922 amino acids 
(E) TYPE; amino acid 
(D) TOPOLOGV: linear 

(ii) MOLECULE TYPE: protein 

{Hi} HYPOTHETICAL: YES 

50 (vi) ORIGIHAL SOURCE: 

(A) ORGAIIISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 
55 (B) LOCATION 1...922 

(xi) SEQUEKCE DESCRIPTION: SEQ ID 110:4 68 



60 



Met 


Met 


Lys 


Ser 


He 


Val 


Phe 


Arg 


Ala 


Phe 


Leu 


Thr 


He 


Leu 


Leu 


Ser 


1 








5 










10 










15 




Trp 


Ala 


Ala 


He 
20 


Thr 


Asn 


Pro 


Thr 


Ala 
25 


Gin 


Glu 


He 


Ser 


Gly 
30 


Met 


Asn 


Ala 


Ser 


'-'J *- 


Leu 


Ala 


Ala 


Pro 


Ala 


Gin 


Pro 


Asp 


Thr 


He 


Leu 


Tyr 


Glu 






35 










40 








45 






Ser 


Phe 
50 . 


Glu 


Asn 


Gly 


Pro 


Val 

55 


Pro 


Asn 


Gly 


Trp 


Leu 
60 


Glu 


He 


Asp 


Ala 


Asp 


Ala 


Asp 


Gly Ala 


Thr 


Trp 


Gly 


Ser 


Pro 


Ser 


Gly 


Ser 


Phe 


Ser 


Val 


65 










70 










7 5 










80 


Pro 


Tyr 


Gly 


His 


Asn 
85 


Gly 


Leu 


Cys 


Thr 


Tyr 
90 


Ser 


His 


He 


Arg 


Ser 
95 


Gly 


He 


Ser 


Thr 


Ala 
100 


Gly 


Asn 


Tyr 


Leu 


He 
105 


Thr 


Pro 


Asn 


He 


Gl u 
110 


Gly 


Ala 


Lys 


Arg 


Val 
115 


Lys 


Tyr 


Trp 


Val 


Cys 
120 


Asn 


Gin 


Tyr 


Ser 


Thr 
125 


Asn 


Pro 


Glu 


His 


Tyr 


Ala 


Val 


Met 


Val 


Ser 


Thr 


Thr 


Gly 


Thr 


Ala 


He 


Glu 


Asp 


Phe 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



415/ 490 



130 135 I'lO 

Val L-u Leu The Asp A^p Ser He Thr Gl y hys Pro Thr Fro Leu Val 
1^5 150 155 160 

Trp Arg Arg Arg He Val Asp Leu Pro Glu Gly Thr Lys Tyr He Ala 
5 165 170 

Trp Arg His Tyr Lys Val Thr Asp Ser His Thr Glu Phe Leu Lys Leu 

180 185 190 

Asp Asp Val Thr Val Tyr Arg Ser He Glu Gly Pro Glu Pro Ala Thr 
195 200 205 

10 Asp Phe Thr Val He Asn He Gly Gin Asn Val Gly Arg Leu Thr Trp 
210 215 220 

Asn Tyr Pro Glu Asp Tyr Gin Pro Glu Gly Lys Gly Asn Glu Glu Leu 
225 230 235 240 

Gin Leu Ser Gly Tyr Asn He Tyr Ala Asn Gly Thr Leu Leu Ala Gin 
15 245 250 255 

He Lys Asp Val Ser He Leu Glu Tyr Val Asp Ser Thr Tyr Ser Leu 

260 265 270 

Arc Asp Asn Pro Leu Gin Val Glu Tyr Cys Val Thr Ala Val Tyr Asp 
275 280 285 

20 Glu Ser He Glu Ser Ser Thr Val Cys Gly Thr Leu His Tyr Ala Thr 
290 295 300 

A5^p Ala He Leu Tyr Glu Asn Phe Glu Asn Gly Pro Val Pro Asn Gly 
305 310 315 320 

Trp Leu Val He Asp Ala Asp Gly Asp Gly Phe Ser Trp Gly His Tyr 
25 325 330 335 

Leu Asn Ala Tyr Asp Ala Phe Pro Gly His Asn Gly Gly His Cys Ser 

34 0 3.i5 350 

Leu Ser Ala Ser Tyr Val Pro Gly He Gly Pro Val Thr Pro Asp Asn 
355 360 365 

30 Tyr Leu He Thr Pro Lys Val Glu Gly Ala Lys Arg Val Lys Tyr Trp 
370 375 380 

Val Ser Thr Gin Asp Ala Asn Trp Ala Ala Glu His Tyr Ala Val Met 
385 390 395 400 

Ala Ser Thr Thr Gly Thr Ala Val Gly Asp Phe Val He Leu Phe Glu 
35 405 410 415 

Glu Thr Met Thr Ala Lys Pro Thr Gly Ala Trp Tyr Glu Arg Thr He 

420 425 430 

Asn Leu Pro Glu Gly Thr Lys Tyr He Ala Trp Arg His Tyr Asn Cys 
435 440 445 

40 Thr Asp He Tyr Phe Leu Lys Leu Asp Asp He Thr Val Phe Gly Thr 
450 455 460 

Pro Ala Ser Glu Fro Glu Pro Val Thr Asp Phe Val Val Ser Leu He 
465 470 475 480 

Glu Asn Asn Lys Gly Arg Leu Lys Trp Asn Tyr Pro Asn Gly Tyr Glu 
45 485 490 495 

Pro Asp Lys Thr Asp Asp Lys Asp Pro Leu Gin Leu Ala Gly Tyr Asn 

500 505 SI'-' 

He Tyr Ala Asn Gly Ser Leu Leu Val His He Gin Asp Pro Thr Val 
515 520 525 

50 Leu Glu Tyr He Asp Glu Thr Tyr Ser Ser Arg Asp Asp Gin Val Glu 
530 535 540 

Val Glu Tyr Cys Val Thr Ala Val Tyr Asn 7\sp A^n He Glu Ser Gin 
545 550 555 560 

Ser Val Cys Asp Lys Leu He Tyr Asp Ser Gin Ser Asp He He Leu 
55 565 570 575 

Tyr Glu Gly Phe Glu Ala Gly Ser He Pro Glu Gly Trp Leu Leu He 

580 505 5 90 

Asp Ala Asp Gly Asp Asn Val Asn Trp Asp Tyr Tyr Pro Trp Thr Met 
595 600 605 

60 Tyr Gly His Asp Ser Glu Lys Cys He Ala Ser Pro Ser Tyr Leu Pro 
610 615 620 

Met He Gly Val Leu Thr Pro Asp Asn Tyr Leu Val Thr Pro Arg Leu 
625 630 635 64 0 

Glu Gly Ala Lys Leu Val Lys Tyr Trp Val Ser Ala Gin Asp Ala Val 
65 645 650 655 

Tyr Ser Ala Glu His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala 

660 665 670 

Val Glu Asp Phe Val Leu Leu Phe Glu Glu Thr Met Thr Ala Lys Ala 
675 680 685 

70 Asn Gly Ala Trp Tyr Glu Arg Thr He Thr Leu Pro Ala Gly Thr Lys 
690 695 700 

Tyr He Ala Trp Arg His Tyr Asp Cys Thr Asp Met Phe Phe Leu Leu 
•705 710 715 "720 

Leu Asp Asp He Thr Val Tyr Arg Ser Thr Glu Thr Val Pro Glu Pro 
75 725 730 "735 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO_9929870A1 J_> 



wo 99/29870 



PCT/AU98/01023 



416/490 





Va 1 


Thr 


Asp 


Phe 
7 4 0 


Va 1 


Val 


Ser 


Leu 


He 
745 


Glu 


Asn 


As n 


Lys 


Gly 
750 


Arg 


Leu 




Ly5 


Trp 


A3n 
755 


Tyr 


Pro 


Asn 


Gly 


Tyr 
760 


Gl u 


Pro 


Asp 


Lys 


Thr 

765 


Asp 


Asp 


Lys 


0 


Ijys 


Pro 
770 


Leu 


Gl n 


Leu 


Thr 


Gly 
775 


Tyr 


Asn 


He 


Tyr 


Ala 
780 


Asn 


Gly 


Ser 


Leu 




Leu 


Val 


His 


He 


Gin 


Asp 


Pro 


Thr 


Val 


Leu 


Glu 


Tyr 


He 


Asp 


Glu 


Thr 




7 85 










7 90 










7 95 










800 


1 n 

IV 


Tyr 


Ser 


Ser 


Arg 


Asp 


Giy 


Gin 


Val 


Glu 


Met 


Glu 


Tyr 


Cys 


Val 


Thr 


Ala 










805 










810 










815 






Val 


Tyr 


Acn 


Asp 
820 


Asn 


He 


Glu 


Ser 


Gin 
825 


Ser 


Val 


Cys 


Asp 


Lys 
830 


Leu 


Asn 




Tyr 


Thr 


He 


Thr 


Ser 


Leu 


Asp 


Asn 


He 


Gin 


Ser 


Asp 


Thr 


Ser 


Leu 


Lys 


15 






035 










8^0 










845 








He 


Tyr 
850 


Pro 


Asn 


Pro 


Ala 


Ser 
855 


Tyr 


Val 


Val 


Arg 


He 

860 


Glu 


Gly 


Leu 


Ser 




Arg 


Ser 


Lys 


Ser 


Thr 


He 


Glu 


Leu 


Tyr 


Asn 


Ala 


Leu 


Gly 


He 


Cys 


He 




865 










870 










875 










880 


20 


Leu 


Arg 


Glu 


Glu 


Thr 


His 


Ser 


Glu 


Lys 


Thr 


Glu 


He 


Asp 


Val 


Ser 


Arg 










885 










890 










8 95 






Leu 


Asn 


Asp 


'31 y 

900 


Val 


Tyr 


Leu 


lie 


Lys 
905 


Val 


Val 


Gly 


Gly 


Asn 
910 


Lys 


Thr 




Thr 


Thr 


Glu 
915 


Lys 


Val 


Glu 


He 


Lys 
920 


Arg 


Pro 















25 



45 



50 



55 



60 



65 



70 



75 



(2) IIIFORriATIOIl FOR SEQ ID MO: 4 69 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LEMGTH: 921 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

35 (iii) HYPOTHETICAL: YES 

ivi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival i£ 

40 (ix) FEATURE: 

(A) IJAJ^E/KEY: misc_feature 

(B) LOCATION 1 . . , 921 



(::i) SEQUENCE DESCRI PTIOIM SEQ ID NO: 469 



Met 


Lys 


Ser 


He 


Val 


Phe 


Arg 


Ala 


Phe 


Leu 


Thr 


He 


Leu 


Leu 


Ser 


Trp 


1 








5 










10 










15 




Ala 


Ala 


He 


Thr 


Asn 


Pro 


Thr 


Ala 


Gin 


Glu 


He 


Ser 


Gly 


Met 


Asn 


Ala 








20 










25 










30 






Ser 


s 


Leu 


Ala 


Ala 


Pro 


Ala 


Gin 


Pro 


Asp 


Thr 


He 


Leu 


Tyr 


Glu 


Ser 






35 










40 










45 








Phe 


Glu 


Asn 


Gly 


Pro 


Val 


Pro 


Asn 


Gly 


Trp 


Leu 


Glu 


He 


Asp 


Ala 


Asp 




£0 










55 










60 










Ala 


Asp 


Gly 


Ala 


Thr 


Trp 


Gly 


Ser 


Pro 


Ser 


Gly 


Ser 


Phe 


Ser 


Val 


Pro 


65 










70 










75 










80 


Tyr 


Gly 


His 


Asn 


Gly 


Leu 


Cys 


Thr 


Tyr 


Ser 


His 


He 


Arg 


Ser 


Gly 


He 










85 










90 










95 




Ser 


Th r 


Ala 


Gly 


Asn 


Tyr 


Leu 


He 


Thr 


Pro 


Asn 


He 


Glu 


Gly Ala 


Ly^ 








100 










105 










110 






Arg 


Val 


Lys 


Tyr 


Trp 


Val 


Cys 


Asn 


Gin 


Tyr 


Ser 


Thr 


Asn 


Pro 


Glu 


His 






115 










120 










125 








Tyr 


Ala 


Val 


Met 


Val 


Ser 


Thr 


Thr 


Gly 


Thr 


Ala 


He 


Glu 


Asp 


Phe 


Val 




130 










135 










14 0 










Leu 


Leu 


Phe 


Asp 


Asp 


Ser 


He 


Thr 


Gly 


Lys 


Pro 


Thr 


Pro 


Leu 


Val 


Trp 


145 










150 










155 










160 


Arg 


Arg 


Arg 


He 


Val 


Asp 


Leu 


Pro 


Glu 


Gly 


Thr 


Lys 


Tyr 


He 


Ala 


Trp 










165 










170 










175 




Arg 


His 


Tyr 


Lys 


Val 


Thr 


Asp 


Ser 


His 


Thr 


Glu 


Phe 


Lea 


Lys 


Leu 


Asp 








180 










185 










190 






Asp 


Val 


Thr 


Val 


Tyr 


Arg 


Ser 


He 


Glu 


Gly 


Pro 


Glu 


Pro 


Ala 


Thr 


Asp 






195 










200 










205 








Phe 


Thr 


Val 


He 


Asn 


He 


Gly 


Gin 


Asn 


Val 


Gly 


Arg 


Leu 


Thr 


Trp 


Asn 




210 










215 










220 










Tyr 


Pro 


Glu 


Asp 


Tyr 


Gin 


Pro 


Glu 


Gly 


Lys 


Gly 


Asn 


Glu 


Glu 


Leu 


Gin 


225 










230 










235 










240 



BNSDOCJD: <WD 9929870A1J_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



417/ 490 

Leu Ser Gly Tyr Asn II- Tyr Ala Asn G]y Thr Leu Leu Ala Gin lie 

245 250 255 

Lys Val Ser lie Leu Glu Tyr Val Asp Ser Thr Tyr Ser Leu Arg 

260 265 270 

5 Asp Asn Pro Leu Gin Val Glu Tyr Cys Val Thr Ala Val Tyr Asp Glu 
275 280 285 

Ser lie Glu Ser Ser Thr Val Cys Gly Thr Leu His Tyr Ala Thr Asp 

2 90 295 300 

Ala Tie Leu Tyr Glu Asn Phe Glu Asn Gly Pro Val Pro Asn Gly Trp 
10 305 310 315 320 

Leu Val lie Asp Ala Asp Gly Asp Gly Phe Ser Trp Gly His Tyr Leu 

325 330 335 

Asn Ala Tyr Asp Ala Phe Pro Gly His Asn Gly Gly His Cys Ser Leu 
3A0 34 5 350 

15 Ser Ala Ser Tyr Val Pre Gly He Gly Pro Val Thr Pro Asp Asn Tyr 
355 360 365 

Leu He Thr Pro Lys Val Glu Gly Ala Lys Arg Val Lys Tyr Trp Val 

370 375 380 

Ser Thr Gin Asp Ala Asn Trp Ala Ala Glu His Tyr Ala Val Met Ala 
20 385 390 395 400 

Ser Thr Thr Gly Thr Ala Val Gly Asp Phe Val He Leu Phe Giu Glu 

405 ^10 415 

Thr Met Thr Ala Lys Pro Thr Gly Ala Trp Tyr Glu Arg Thr He Asn 
420 425 430 

25 Leu Pro Glu Gly Thr Lys Tyr He Ala Trp Aug His Tyr Asn Cys Thr 
435 4 40 4 45 

Asp He Tyr Phe Leu Lys Leu Asp Asp lie Thr Val Phe Gly Thr Pro 

450 45b 460 

Ala Ser Glu Fro Glu Pro Val Thr Asp Phe Val Val Ser Leu He Glu 
30 465 470 475 480 

Asn Asn Lys Gly Arg Leu Lys Trp Asn Tyr Pro Asn Gly Tyr Glu Pro 

485 490 495 

Asp Lys Thr Asp Asp Lys Asp Pro Leu Gin Leu Ala Gly Tyr Asn He 
500 505 510 

35 Tyr Ala Asn Gly Ser Leu Leu Val His He Gin Asp Pro Thr Val Leu 
515 520 525 

Glu Tyr He Asp Glu Thr Tyr Ser Ser Arg Asp Asp Gin Val Giu Val 

530 535 540 

Glu Tyr Cys Val Thr Ala Val Tyr Asn Asp Asn He Glu Ser Gin Ser 
40 545 550 555 560 

Val Cys Asp Lys Leu He Tyr Asp Ser Gin Ser Asp He He Leu Tyr 

565 570 575 

Glu Gly Phe Glu Ala Gly Ser He Pro Glu Gly Trp Leu Leu He Asp 
580 585 590 

45 Ala Asp Gly Asp Asn Val Asn Trp Asp Tyr Tyr Pro Trp Thr 14et Tyr 
595 600 605 

Gly His Asp Ser Glu Lys Cys He Ala Ser Pro Ser Tyr Leu Pro I'let 

610 615 620 

He Gly Val Leu Thr Pre Asp Asn Tyr Leu Val Thr Pro Arg Leu Glu 
50 625 630 635 640 

Gly Ala Lys Leu Val Lys Tyr Trp Val Ser Ala Gin Asp Ala Val Tyr 

645 650 655 

Ser Ala Glu His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala Val 
660 665 670 

55 Glu Asp Phe Val Leu Leu Phe Glu Glu Thr Met Thr Ala Lys Ala Asn 
675 680 685 

Gly Ala Trp Tyr Glu Arg Thr He Thr Leu Pro Ala Gly Thr Lys Tyr 

690 695 700 

He Ala Trp Arg His Tyr Asp Cys Thr Asp Met Phe Phe Leu Leu Leu 
60 705 710 715 720 

Asp Asp He Thr Val Tyr Arg Ser Thr Glu Thr Val Pro Glu Pro Val 

7 25 7 30 735 

Thr Asp Phe Val Val Ser Leu He Glu Asn Asn Lys Gly Arg Leu Lys 
740 745 7 50 

65 Trp Asn Tyr Pro Asn Gly Tyr Glu Pro Asp Lys Thr Asp Asp Lys Lys 
755 760 765 

Pro Leu Gin Leu Thr Gly Tyr Asn He Tyr Ala Asn Gly Ser Leu Leu 

770 775 78 0 

Val His He Gin Asp Fro Thr Val Leu Glu Tyr He Asp Glu Thr Tyr 
70 785 790 795 800 

Ser Ser Arg Asp Gly Gin Val Glu Met Giu Tyr Cys Val Thr Ala Val 

805 910 815 

Tyr Asn Asp Asn He Glu Ser Gin Ser Val Cys Asp Lys Leu Asn Tyr 
820 825 830 

75 Thr He Thr Ser Leu Asp Asn He Gin Ser Asp Thr Ser Leu Lys He 
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835 840 845 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Tyr 


Pro 
850 


Asn 


Pro 


Ala 


Ser 


Tyr 
855 


Val 


Val 


Arg 


He 


Glu 

8 60 


Gly 


Leu 


Ser 


Arg 


Ser 


hys 


Ser 


Thr 


He 


Glu 


Leu 


Tyr 


Asn 


Ala 


Leu 


Gly 


He 


Cys 


He 


Leu 


C65 










870 










875 










880 


Arg 


Glu 


Gl u 


Thr 


His 
885 


Ser 


Gl u 


Lys 


Thr 


Glu 

890 


He 


Asp 


Val 


Ser 


Arg 
895 


Leu 


Asn 


Asp 


Gly 


Val 
900 


Tyr 


Leu 


He 


Lys 


Val 
905 


Val 


Gly 


Gly 


Asn 


Ly- 

91 0 


Thr 


Thr 


Thr 


Glu 


Lys 
915 


Val 


Glu 


He 


Lys 


Arg 

920 


Pro 

















10 



(2) IMFORIIATION FOR SEQ ID 110:4 70 

15 (i) SEQUEHCE CHARACTERISTICS: 

(A) LEJJGTH: 593 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) riOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

( vi ) ORIGIl lAL SOURCE : 
25 (A) ORGAIIISII: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAllE/KEY: misc_feature 

(B) LOCATIOM 1, . . 593 



(xi) SEQUEHCE DESCRIPTION; SEQ ID UO:470 



Met 


Asn 


Ser 


He 


Met 


Lys 


Tyr 


Gin 


Leu 


Tyr 


Thr 


Ala 


Val 


He 


Met 


Ala 


1 








5 










10 










15 




Leu 


Ser 


Val 


Ser 


Ser 


Val 


Cys 


Gly 


Gin 


Thr 


Pro 


Arg 


Asn 


Thr 


Glu 


Thr 








20 










25 










30 






Lys 


Arg 


Pro 


Asp 


Thr 


Leu 


Arg 


Arg 


Gl u 


Leu 


Thr 


He 


Val 


Asn 


Asp 


Gin 






35 










40 










4 5 








Thr 


Val 


Glu 


Met 


Glu 


His 


Ala 


Asp 


Pro 


Leu 


Pro 


Ala 


Ala 


Tyr 


Lys 


Ala 




50 










55 










60 








He 


Glu 


Pro 


Arg 


Leu 


Lys 


Pro 


Phe 


Arg 


Pro 


Glu 


Tyr 


Asn 


Lys 


Arg 


Thr 


65 










70 










75 










80 


Phe 


Gly 


Phe 


Val 


Pro 


Glu 


Val 


Ser 


Ser 


Ser 


Gly 


Arg 


Asn 


Asn 


Leu 


Pro 










85 










90 










95 




A^n 


He 


Leu 


Pro 


Thr 


Glu 


Gly 


His 


Met 


Lys 


His 


Arg 


Gly 


Tyr 


Leu 


Asn 








100 










105 










110 






He 


Gly 


He 


Gly 


His 


Thr 


Leu 


Asn 


Gin 


Arg 


Met 


Asp 


Ala 


Gly 


Tyr 


Arg 






115 










120 










125 








Leu 


He 


Asp 


Ala 


Glu 


Gin 


Glu 


Arg 


Leu 


As n 


Leu 


Phe 


Leu 


Ser 


Tyr 


Arg 




130 










135 










14 0 










-31 y 


net 


Lys 


Ser 


Ala 


Phe 


Asn 


Thr 


Gly 


Asp 


Phe 


Asp 


Gly 


Asp 


Arg 


Lys 


145 










150 










155 










160 


Asp 


Arg 


Arg 


Met 


Met 


Ala 


Gly 


Val 


Asp 


Tyr 


Glu 


Gin 


Arg 


Arg 


Pro 


Ser 










165 










17 0 










175 




Phe 


Val 


Leu 


Ala 


Thr 


Gly 


Leu 


Tyr 


Tyr 


Ser 


Asn 


His 


Tyr 


Phe 


Asn 


Asn 








180 










185 










190 






Tyr 


Gly 


Arg 


Gly 


Ala 


Thr 


Thr 


Asn 


Val 


Gly 


Ser 


He 


Pro 


Gin 


Leu 


Ser 






195 










200 










205 








Thr 


Pro 


Val 


Thr 


Pro 


Gin 


Met 


Asp 


Asn 


Gly 


Thr 


His 


Asn 


Val 


Arg 


Val 




210 










215 










220 










Tyr 


Leu 


Gly 


Ala 


Lys 


Asn 


Asp 


Val 


He 


Asp 


Ala 


Arg 


He 


Asp 


Tyr 


Arg 


225 










230 










235 










24 0 


Phe 


Phe 


Arg 


Ser 


He 


Pro 


Tyr 


Leu 


Gly 


Thr 


Asp 


Pro 


Met 


Ly^ 


Ala 


Leu 










245 










250 










255 




Thr 


Glu 


His 


Thr 


Pro 


Gl u 


Leu 


Asn 


Val 


Thr 


I let 


Ser 


Asn 


Glu 


Leu 


Ser 








260 










265 
















Asp 


A-p 


He 


Lys 


Leu 


Gly 


Val 


Glu 


Val 


Arg 


Thr 


Gly 




Leu 


Fhe 


Fh- 






275 










280 










285 








Ala 


Lys 


Asn 


Ser 


Glu 


Met 


He 


Gin 


Thr 


Gly 


Val 


Leu 


Ser 


Giu 


Thr 


Asp 




2 90 










2 95 










300 










Arg 


A^n 


Leu 


Tyr 


Tyr 


Val 


Glu 


Gly 


Ala 


Pro 


Thr 


He 


Gly 


Phe 


Val 


'3iy 


305 










310 










315 










320 


Asp 


Ser 


Asp 


Asn 


Met 


Gin 


Trp 


Asn 


He 


Gin 


Ala 


Gly 


Val 


Gly 


He 


Ser 










325 










330 










335 




Ser 


His 


Phe 


Gly 


Ala 


Lys 


Gly 


Arg 


Leu 


Pile 


Phe 


Trp 


Pro 


Lys 


Leu 


Asp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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419/ 490 



45 



55 



60 



65 



70 



75 









34 0 










345 










350 






Ala 


Ser 


Leu 


Ser 


He 


Phe 


Pro 


Ser 


Trp 


Arg 


Val 


Tyr 


Ala 


Lys 


Ala 


Phe 






355 










360 










365 








Gly 


'31 y 


Val 


He 


Arg 


Asn 


Gly 


Leu 


Al a 


Asp 


Val 


net 


Gin 


Glu 


Glu 


net 




370 










375 










380 










Pro 


Tyr 


Leu 


Met 


Fro 


Asn 


Thr 


He 


Val 


Leu 


Pro 


Ser 


Arg 


Asn 


Ala 


Leu 


385 










390 










395 










4 00 


Thr 


Ala 


Gin 


Leu 


Gly 


Val 


Lys 


Gly 


Asn 


He 


Ala 


Asp 


Val 


Val 


Arg 


1 let 










405 










4 10 










415 




Glu 


Val 


Tyr 


Gly 


As'p 


Phe 


Ser 


Lys 


Leu 


Thr 


Gly 


Val 


Fro 


Phe 


Tyr 


Thr 








420 










425 










430 






Pro 


Thr 


Leu 


Pro 


Leu 


Tyr 


Asn 


Pro 


Ser 


Asp 


Leu 


Tyr 


Gin 


Tyr 


Asn 


Val 






435 










440 










4 45 








Ser 


Phe 


Leu 


Pro 


He 


Tyr 


Ala 


Asp 


Gly 


Ser 


Arg 


Trp 


Arg 


Ala 


Gly 


Gly 




450 










455 










460 










Ly^ 


Leu 


Glu 


Tyr 


Ser 


Tyr 


Arg 


Asp 


Met 


Leu 


Arg 


Phe 


Leu 


Val 


Asp 


Ala 


465 










470 










475 










4 80 


Ser 


Tyr 


Gly 


Lys 


Trp 


Asn 


Leu 


Asp 


Gly 


Gly 


Leu 


Val 


Ala 


Ser 


Met 


Gin 










4 85 










4 90 










4 95 




Pro 


Asp 


Leu 


He 


Leu 


Lys 


Ala 


Glu 


Val 


Gly 


Val 


His 


Pro 


He 


Ala 


Pro 






500 










505 










510 






Leu 


Asp 


Val 


Arg 


Leu 


Arg 


Tyr 


Thr 


Gin 


Leu 


Asn 


Gly 


Arg 


Tyr 


Arg 


Tyr 




515 










520 










525 








Ser 


t^'he 


Gly 


Ser 


Ala 


Gly 


Ser 


Glu 


Ala 


Leu 


Gly 


He 


Gly 


Asn 


Val 


His 




530 










535 










54 0 










Leu 


Leu 


Ser 


Ala 


Asp 


Val 


Ser 


Tyr 


Lys 


Leu 


Lys 


Lys 


Asn 


Leu 


Ser 


Leu 


545 










550 










555 










560 


Tyr 


Leu 


Ly^ 


He 


Asp 


Asn 


net 


Leu 


Ala 


Glu 


Thr 


Thr 


Glu 


Leu 


He 


Gly 








565 










57 0 










57 5 




T-yr 


Tyr 


Pro 


net 


Gin 


Pro 


Phe 


His 


Cys 


Phe 


Ala 


Gly 


Phe 


Ser 


Trp 


Tlir 








58 0 










585 










590 






Phe 

































10 



15 



20 



25 



30 



35 (2) IIIFORnATIOM FOR SEQ ID HO: 4 71 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LEMGTH: 58 9 amino acids 

(B) TYPE: amino acid 
40 ( D) TOPOLOGY: linear 

{ii) nOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGAIIISn: Porphyromonas gingival is 



{i::} FEATURE: 
50 (A) MAIIE/KEY : misc_featur€ 

(B) LOCATIOM 1 ... 589 



(xi) SEQUENCE DESCRIPTIOII: SEQ ID MO: 471 



net 


Lys 


Tyr 


Gin 


Leu 


Tyr 


Thr 


Ala 


Val 


He 


list 


Ala 


Leu 


Ser 


Val 


Ser 


1 






5 










10 










15 




Ser 


Val 


Cys 


Gly 


Gin 


Thr 


Pro 


Arg 


Asn 


Thr 


Glu 


Thr 


Lys 


Arg 


Fro 


Asp 






2 0 










25 










30 






Thr 


Leu 


Arg 


Arg 


Glu 


Leu 


Thr 


He 


Val 


Asn 


Asp 


Gin 


Thr 


Val 


Glu 


net 






35 










40 










45 








Glu 


His 


Ala 


Asp 


Pro 


Leu 


Pro 


Ala 


Ala 


Tyr 


Lys 


Ala 


He 


Glu 


Pro 


Arg 




50 










55 










60 










Leu 


Lys 


Pro 


Phe 


Arg 


Pro 


Glu 


Tyr 


Asn 


Lys 


Arg 


Thr 


Phe 


Gly 


Phe 


Val 


65 










70 










75 










80 


Pro 


Glu 


Val 


Ser 


Ser 


Ser 


Gly 


Arg 


Asn 


Asn 


Leu 


Pro 


As n 


He 


Leu 


Pro 










85 










90 










95 




Thr 


Glu 


Gly 


His 


net 


Lys 


His 


Arg 


Gly 


Tyr 


Leu 


Asn 


He 


Gly 


He 


Gly 








100 










105 










110 






His 


Thr 


Leu 


Asn 


Gin 


Arg 


net 


Asp 


Ala 


Gly 


Tyr 


Arg 


Leu 


He 


Asp 


Ala 






115 










120 










125 








Glu 


Gin 


Glu 


Arg 


Leu 


Asn 


Leu 


Phe 


Leu 


Ser 


Tyr 


Arg 


Gly 


i-let 


Lys 


Ser 




130 










135 










140 










Ala 


Phe 


Asn 


Thr 


Gly 


Asp 


Phe 


Asp 


Gly 


Asp 


Arg 


Lys 


Asp 


Arg 


Arg 


Met 


145 










150 










155 










160 


Met 


Ala 


Gly 


Val 


Asp 


Tyr 


Glu 


Gin 


Arg 


Arg 


Pro 


Ser 


Phe 


Val 


Leu 


Ala 
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65 











165 










170 










175 




Thr 


Gly 


Leu 


Tyr 


Tyr 


Ser 


Asn 


His 


Tyr 


Phe 


Asn 


Asn 


Tyr 


Gly 


Arg 


Gly 








180 










185 










1 90 






Ala 


Thr 


Thr 


Asn 


Val 


Gly 


Ser 


He 


Pro 


Gin 


Leu 


Ser 


Thr 


Pro 


Val 


Thr 






1 95 










200 










205 








Pro 


Gin 


I let 


Asp 


Asn 


Gly 


Thr 


His 


Asn 


Val 


Arg 


Val 


Tyr 


Leu 


Gly 


Ala 




210 










215 










220 










Lys 


Asn 


Asp 


Val 


He 


Asp 


Ala 


Arg 


He 


Asp 


Tyr 


Arg 


Phe 


Phe 


Arg 


Ser 


225 










230 










2 35 










240 


He 


Pro 


Tyr 


Leu 


'3iy 


Thr 


Asp 


Pro 


Met 


Lys 


Ala 


Leu 


Thr 


Glu 


His 


Thr 










245 










2 50 










255 




Pro 


Glu 


Leu 


Asn 


Val 


Thr 


Met 


Ser 


Asn 


Glu 


Leu 


Ser 


Asp 


Asp 


He 


Lys 








260 










265 










270 






Leu 


Gly 


Val 


Glu 


Val 


Arg 


Thr 


Gly 


Gly 


Leu 


Phe 


Phe 


Ala 


Lys 


Asn 


Ser 






275 










280 










28 5 








Glu 


1 let 


He 


Gin 


Thr 


Gly 


Val 


Leu 


Ser 


Glu 


Thr 


Asp 


Arg 


Asn 


Leu 


Tyr 




2 90 










295 










300 










Tyr 


Val 


Glu 


Gly 


Ala 


Pro 


Thr 


He 


Gly 


Plie 


Val 


Gly 


Asp 


Ser 


Asp 


Asn 


305 










310 










315 










320 


I let 


Gin 


Trp 


As n 


He 


Gin 


Ala 


Gly 


Val 


Gly 


He 


Ser 


Ser 


His 


Phe 


Gly 










325 










330 










335 




Ala 


Lys 


Gly 


Arg 


Leu 


Phe 


Phe 


Trp 


Pro 


Lys 


Leu 


Asp 


Ala 


Ser 


Leu 


Ser 








3^0 










345 










350 






He 


Phe 


Fro 


Ser 


Trp 


Arg 


Val 


Tyr 


Ala 


Lys 


Ala 


Phe 


Gly 


Gly 


Val 


He 






355 










360 










365 








Arg 


Asn 


Gly 


Leu 


Ala 


Asp 


Val 


Met 


Gin 


Glu 


Glu 


Met 


Pro 


Tyr 


Leu 


Met 




370 










37 5 










3S0 










Pro 


Apn 


Thr 


He 


Val 


Leu 


Pro 


Ser 


Arg 


Asn 


Ala 


Leu 


Thr 


Ala 


Gin 


Leu 


385 










390 










395 










4 00 


Gly 


Val 


Ly^ 


Gly 


Asn 


He 


Ala 


Asp 


Val 


Val 


Arg 


Met 


Glu 


Val 


Tyr 


Gly 










4 05 










410 










415 




A^p 


Phe 


Ser 


Lys 


Leu 


Thr 


Gly 


Val 


Fro 


Phe 


Tyr 


Thr 


Pro 


Thr 


Leu 


Pro 








420 










425 










430 






Leu 


Tyr 


Asn 


Pro 


Ser 


Asp 


Leu 


Tyr 


Gin 


Tyr 


Asn 


Val 


Ser 


Phe 


Leu 


Pro 






435 










440 










445 








He 


Tyr 


Ala 


Asp 


Gly 


Ser 


Arg 


Trp 


Arg 


Ala 


Gly 


Gly 


Lys 


Leu 


Glu 


Tyr 




450 










455 










460 










Ser 


Tyr 


Arg 


Asp 


Met 


Leu 


Arg 


Phe 


Leu 


Val 


Asp 


Ala 


Ser 


Tyr 


Gly 


Lys 


4 65 










470 










475 










480 


Trp 


As n 


jjeu 


Asp 


uiy 


Gl y 


Leu 


Val 


Ala 


Ser 


Met 


Gl n 


Pro 


Asp 


Leu 


lie 










4B5 










4 90 










4 95 




Leu 


Lys 


Ala 


Glu 


Val 


Gly 


Val 


His 


Pro 


He 


Ala 


Pro 


Leu 


Asp 


Val 


Arg 








500 










505 










510 






Leu 


Arg 


Tyr 


Thr 


Gin 


Leu 


Asn 


Gly 


Arg 


Tyr 


Arg 


Tyr 


Ser 


Phe 


Gly 


Ser 






515 










520 










525 








Ala 


Gly 


Ser 


Glu 


Ala 


Leu 


Gly 


He 


Gly 


Asn 


Val 


His 


Leu 


Leu 


Ser 


Ala 




530 










535 










54 0 










Asp 


Val 


Ser 


Tyr 


Lys 


Leu 


Lys 


Lys 


Asn 


Leu 


Ser 


Leu 


Tyr 


Leu 


Lys 


He 


5^5 










550 










555 










5 60 


Asp 


Asn 


Met 


Leu 


Ala 


Glu 


Thr 


Thr 


Glu 


Leu 


He 


Gly 


Tyr 


Tyr 


Pro 


Met 










565 










570 










575 




Gin 


Pro 


Phe 


His 


Cys 


Phe 


Ala 


Gly 


Phe 


Ser 


Trp 


Thr 


Phe 









10 



15 



20 



25 



30 



35 



40 



45 



50 



580 585 
55 (2) IM FORMATION FOR SEQ ID 110:47 2 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 34 6 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGIIIAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingivalii 



(ix) FEATURE: 
70 (A) IIAME/KEY: misc^feature 

(B) LOCATIOll 1 ... 346 

(xi) SEQUEIICE DESCRI PTIOtJ : SEQ ID MO: ^72 

75 net Met Glu Lys Cys He Phe Ala His Tyr Pro His Asn Leu Val Phe 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



421/490 



15 10 15 

M-t II*-- Arg Lys His Phe Gly He He Leu Gly Phe Leu Ser Leu Val 

2 0 25 3'-"^ 

Fhe Ser Ala Gly Ala Gin Gin Giu Lys Gin Val Phe His Phe Leu Asn 
5 35 40 45 

Leu Pro Ala Thr Ala Gin Ala Leu Ala Ala Gly Gly Lys Ala He Thr 

50 55 60 

He Val A5-p hsp Asn Pro Gly Leu Ala Phe Giu Asn Pro Ala Leu Leu 
65 70 75 80 

10 Gly Tyr Giu Ser Gly Gly Arg Ala Phe Leu Ser Tyr Leu Tyr Tyr Met 

85 90 ?5 

Ser Gly Ser His Met Gly Asn Ala Cys Tyr Ala Ser Ser Val Gly Giu 

100 105 HO 

Arg Gly I'let Trp Gly Val Gly Met Arg Phe Leu Asn Tyr Gly Ser Met 
15 115 120 125 

Gin Gly Tyr Asp Gin Asn Ala lie Ala Thr Gly Ser Phe Ser Ala Ser 

130 135 140 

A^p He Ala Val Gin Gly Phe Tyr Ser His Giu Leu Ser Asn His Phe 
145 ISO 155 160 

20 Arg Gly Gly Val Ser Leu Lys Ala Leu Tyr Ser Ser He Giu Thr Tyr 

165 170 175 

Ser Ser Phe Gly Leu Gly Val Asp Val Gly He Ser Tyr Tyr Asp Asp 

180 185 1?0 

Asp Lys Gly Tyr Ser Ala Ser Ala Leu Phe Lys Asn Val Gly Ala Gin 
25 195 200 205 

Leu Lys Gly Tyr Asn Giu Giu Arq Giu Pro Leu Asp Trp Asp Phe Gin 

210 215 220 

Leu Gly Phe Ser Arg Ser Phe He Asn Ala Pro Phe Arg Leu His He 
22b 230 235 2A0 

30 Thr Leu Phe Asn Leu Asn Pro His Tyr Phe Lys Arg Leu Val Pro Arg 

245 250 255 

Asp Leu Ser Lys Met Gin Lys Phe Leu Arg His Phe Ser He Gly Ala 

260 265 270 

Giu Phe Thr Pro Ser Giu Arg Phe Trp Val Gly Leu Gly Tyr Thr Pro 
35 275 280 285 

Gin He Ala Gin Asp Phe Giu Val Giu Gly Gly Asn Lys Trp Gly Gly 

2 90 2 95 300 

Leu Ser Ala Gly Val Gly Phe Thr Ser Gly Val Val Arg Val Gly Val 
305 310 315 320 

40 Ser Ala Ala Thr Tyr His Pro Ala Ala Leu Ser Phe Met Cys Ser Val 

325 330 335 

Gly He Arg Leu Asp Asp Lys Ser He Phe 
340 345 

45 (2) IlIPORIlATIOil FOR SEQ ID MO: 473 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LEMGTH: 3^5 amino acids 

(B) TYPE: amino acid 
50 ( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIllAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



55 



(iM) FEATURE: 
60 (A) IIAIIE/KEY: misc_featur^ 

(B) LOCATIOII 1 ... 345 



65 



70 



75 





(xi ) 


SEQUEIICE DESCRIPTIOIi: SEQ ID MO: '173 










Met 


G] u 


Lys 


Cys 


He 


Phe 


Ala 


iiis 


Tyr 


Pro 


His 


Asn 


Leu 


*./a 1 


Plie 


Met 


1 






5 










10 










15 




He 


Arg 


Lys 


His 


Phe 


Gly 


He 


He 


Leu 


Gly 


Phe 


Leu 


Ser 


Leu 


Val 


Phe 






2 0 










25 










30 






Ser 


Ala 


Gly 


Ala 


Gin 


Gin 


Giu 


Lys 


Gin 


Val 


Phe 


His 


Phe 


Leu 


Asn 


Leu 






35 










40 










4 5 








Pro 


Ala 


Thr 


Ala 


Gin 


Ala 


Leu 


Ala 


Ala 


Gly 


Gly 


Lys 


Ala 


He 


Thr 


He 




50 










55 










60 










Va] 


Asp 


Asp 


Asn 


Pro 


Gly 


Leu 


Ala 


Plie 


Giu 


Asn 


Pro 


Ala 


Leu 


Leu 


Gly 


65 








70 










75 










SO 


Tyr 


Giu 


Ser 


Gly 


Gly 


Arg 


Ala 


Phe 


Leu 


Ser 


Tyr 


Leu 


Tyr 


Tyr 


Met 


Ser 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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422/490 














85 










90 










95 






Gly 


Ser 


His Met Gly 


Asn 


Ala 


Cys 


Tyr 


Ala 


Ser 


Ser 


Val 


Gly 


Glu 


Arg 








100 








105 










110 








Gly 


Met 


Trp Gly Val 


Gly 


Met 


Arg 


Phe 


Leu 


Asn 


Tyr 


Gly 


Ser 


Met 


Gin 


5 






115 






120 










125 










Gl y 


Tyr 


A^p Gin Asn 


Ala 


He 


Ala 


Thr 


Gly 


Ser 


Phe 


Ser 


Ala 


Ser 


Asp 






130 






135 










l-^ 0 












lie 


Al a 


Val Gin Gly 


Phe 


Tyr 


Ser 


His 


Glu 


Leu 


Ser 


Asn 


His 


Phe 


Arg 


10 


1^5 






150 










155 










160 


Gly 


Gly 


Val Ser Leu 


Lys 


Ala 


Leu 


Tyr 


Ser 


Ser 


He 


Glu 


Thr 


Tyr 


Ser 








165 










170 










175 






Ser 


Phe 


Gly Leu Gly 


Val 


Asp 


Val 


Gly 


He 


Ser 


Tyr 


Tyr 


Aep 


Asp 


Asp 








180 








185 










190 






15 


L^-r 


Gly 


Tyr Ser Ala 


Ser 


Ala 


Leu 


Phe 


Lys 


Asn 


Val 


Gly 


Ala 


Gin 


Leu 






195 






200 








205 










Lys 


Gly 


Tyr Asn Glu 


Glu 


Arg 


Glu 


Pro 


Leu 


Asp 


Trp 


Asp 


Phe 


Gin 


Leu 






210 






215 










220 












Gl V 


Phe 


Ser Arg Ser 


Phe 


He 


Asn 


Ala 


Pro 


Phe 


Arg 


Leu 


His 


He 


Thr 


20 


225 






230 










235 










240 


Leu 


Phe 


Asn Leu A^n 


Pro 


His 




Phe 




Arg 


Leu 


Val 


Pro 


Arg 


Asp 








2^5 










250 










255 






Leu 


Ser 


Lj^s Met Gin 


Lys 


Phe 


Leu 


Arg 


His 


E'he 


Ser 


He 


Glv 


Al a 


Glu 








260 








265 










270 






25 


Phe 


Thr 


Pro Ser Glu 


Arg 


Phe 


Trp 


Val 


Gly 


Leu 


Gly 


Tyr 


Thr 


Pro 


Gin 






275 






280 










285 










He 


Ala 


Gin Aj^p Phe 


Gl u 


Val 


Glu 


Gly 


Gly 


Asn 


Lys 


Trp 


Gly 


Gly 


Leu 






2 90 






2 95 










3 0 0 












Ser 


Ala 


Gly Val Gly 


Phe 


Thr 


Ser 


Gly 


Val 


V a 1 


Arg 


Val 


Glv 


Val 


Ser 


30 


305 






310 










315 










320 


Ala 


Ala 


Thr Tyr His 


Pro 


Ala 


Ala 


Leu 


Ser 


Phe 


Met 


Cys 


Ser 


Val 










325 










330 










335 






lie 


Arg 


Leu A^p Asp 


Lys 


Ser 


He 


Phe 






















3^0 








3.i5 
















35 


(2) 


I M FORI lAT I on FOR 


SEQ 


ID MO: ^7^ 




















(i) 


SEQUEtlCE CHARACTERISTICS: 






















(A) LEMGTH: 330 amino 


acids 














40 






(B) TYPE: 


amino acid 






















(D) TOPOLOGY: 


1 inear 






















(ii) 


MOLECULE TYPE: 


protein 




















{ill) 


HYPOTHETICAL: YES 




















45 


































(vi) 


ORIGINAL SOURCE: 


























(A) ORGAIIISM: 


Porphyromonas gingivalis 








50 




(ix) 


FEATURE : 




























(A) HAI-IE/KEY: 


misc feature 




















(E) LOCATIOII 1 . . . 330 
























SEQUEriCE DESCRIPTIOII: SEQ ID MO: ^7^1 










55 


net 


He 


Arg Lys His 


Phe 


Gly 


He 


He 


Leu 


Gly 


Phe 


Leu 


S-^^r 


Leu 


Va 1 




1 




5 










10 










1 5 






Phe 


Ser 


Ala Gly Ala 


Gin 


Gin 


Glu 


Lys 


Gin 


Val 


Phe 


His 


Phe 


Leu 


As n 








20 








25 










30 






60 


Leu 


Pro 


Ala Thr Ala 


Gin 


Ala 


Leu 


Ala 


Ala 


Gly 


Gly 


Lys 


Ala 


He 


Thr 






35 






40 








45 










He 


Val 


Asp Asp Asn 


Pro 


Gly 


Leu 


Ala 


Phe 


Glu 


Asn 


Pro 


Ala 


Leu 


Leu 






50 






55 










60 












Gly 


Tyr 


Glu Ser Gly 


Gly 


Arg 


Ala 


Phe 


Leu 


Ser 


Tyr 


Leu 


Tyr 


Tyr 


Met 


65 


65 






70 










75 










8 0 


Ser 


Gly 


Ser iii^ Met 


Gly 


Asn 


Ala 


Cys 


Tyr 


Ala 


Ser 


Ser 


Val 


Gly 


Glu 








85 










90 










95 






Arg 


Gly 


Met Trp Gly 


Val 


Gly 


Met 


Arg 


Phe 


Leu 


Asn 


Tyr 


Gly 


Ser 


Met 








1 00 








105 










110 






70 


Gin 


Gly 


Tyr Asp Gin 


Asn 


Ala 


He 


Ala 


Thr 


Gly 


Ser 


Phe 


Ser 


Ala 


Ser 






115 






120 








125 










A£-p 


He 


Ala Val Gin 


Gly 


Phe 


Tyr 


Ser 


His 


Glu 


Leu 


Ser 


Asn 


His 


Phe 






130 






135 










140 












Arg 


Gly 


Gly Val Ser 


Leu 


Lys 


Ala 


Leu 


Tyr 


Ser 


Ser 


He 


Glu 


Thr 


Tyr 


75 


1^5 






150 










155 










160 


Ser 


Ser 


Phe Gly Leu 


Gly 


Val 


Asp 


Val 


Gly 


He 


Ser 


Tyr 


Tyr 


Asp 


Asp 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSCXDCID: <WO_9929870AUL> 



wo 99/29870 

423/490 



PCT/AU98/01023 



40 











165 










170 










175 






Lys 


Gly 


Tyr 


Ser 


Ala 


Ser 


Ala 


Leu 


Phe 


Lys 


Asn 


Va 1 


Gly 


A J. a 


L).L n 






180 










185 










190 






Leu 


Lys 


Gly 


Tyr 


Asn 


Glu 


Glu 


Arg 


Glu 


Pro 


Leu 


Asp 


Trp 


Asp 


Phe 


Gin 






195 










200 










205 






He 


Leu 


Gly 
210 


Phe 


Ser 


Arg 


Ser 


Fhe 
215 


He 


Asn 


Ala 


Pro 


Phe 

220 


Arg 


Leu 


His 


Tin- 


Leu 


Phe 


As 11 


Leu 


Asn 


Pro 


His 


Tyr 


Phe 


Lys 


Arg 


Leu 


V a 1 


Pro 


Arg 


2 5i 










230 










2 35 










2 4 0 


A^p 


Leu 


Ser 


Lys 


! let 


Gin 


L:^-s 


Phe 


Leu 


Arc 


His 


Phe 


Ser 


T 1 o 

lie 


'j± y 


Ala 








245 








2 50 










255 




Glu 


Phe 


Thr 


Fro 
260 


Ser 


Glu 


Arg 


Phe 


Trp 
265 


Val 


Gly 


Leu 


Gly 


Tyr 
270 


Thr 


Pro 


Gin 


He 


Ala 

275 


Gin 


Asp 


Phe 


Glu 


Val 
280 


Glu 


Gly 


Gly 


Asn 


Lys 
285 


Trp 


Gly 


Gly 


Leu 


Ser 


Ala 


Gly 


Val 


Gly 


Phe 


Thr 


Ser 


Gly 


Val 


Val 


Arg 


Val 


Gly 


Val 




290 






295 










300 








Val 


Ser 


Ala 


Ala 


Thr 


Tyr 


His 


Pro 


Ala 


Ala 


Leu 


Ser 


Phe 


Met 


Cys 


Ser 


305 








310 










315 










320 


•31 y 


lie 


Arg 


Leu 


Asp 


Asp 


Lys 


Ser 


He 


Phe 




















325 










3 30 















10 



15 



20 



(2) lUFORllATIOM FOR SEQ ID 110:475 

25 (i) SEQUEIICE CHARACTERISTICS: 

(A) LEllGTH: 324 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGItJAL SOURCE: 
35 (A) ORGANISM: Po rphyrotnonas gingivali^ 

(ix) FEATURE: 

(A) MAME/PCEY: misc_feature 

(B) LOCATIOM 1 ... 324 



(Mi) SEQUEIICE DESCRIPTIOil : SEQ ID 110:475 





Met 


Cys 


Leu 


Glu 


Pro 


He 


He 


Ala 


Pro 


He 


Ser 


Ser 


Glu 


Leu 


Leu 


Glu 




1 






5 










10 










15 


Gly 


45 


Gin 


Glu 


Leu 


Thr 


Ala 


Asp 


Arg 


Phe 


Leu 


Arg 


Met 


Thr 


Asn 


Lys 


Ala 










20 






25 










30 








Asn 


Glu 


He 


Tyr 


Val 


Phe 


Thr 


Ala 


Glu 


Glu 


Ala 


Pro 


His 


cys 


Met 


Lys 








35 








40 










45 










Glu 


Val 


Gly 


Arg 


Leu 


Arg 


Glu 


Glu 


Ala 


Phe 


Arg 


11 i s 


Tyr 


Gly 


Gly 


Gly 


50 




50 






55 










60 












Tlir 


Gly 


Lys 


Ala 


He 


Asp 


He 


Asp 


Glu 


Phe 


Asp 


Thr 


Met 


Pro 


Gly 


Ser 




65 






70 










75 










00 




Tyr 


Lys 


Gin 


Leu 


He 


Val 


Trp 


Asp 


Pro 


Gin 


Asn 


Lys 


Ala 


He 


Leu 


Gly 










85 










90 










95 




55 


Gly 


Tyr 


Arg 


Phe 


He 


Tyr 


Gly 


Arg 


Asp 


Val 


Ala 


Phe 


Asp 


Thr 


Asp 


Gly 




100 










105 










110 




Phe 




Lys 


Pro 


Leu 


Leu 


Ala 


Thr 


Ala 


Glu 


Met 


Phe 


Arg 


Phe 


Ser 


Asp 


Ala 








115 










120 










125 










Leu 


His 


Asp 


Tyr 


Leu 


Pro 


Tyr 


Thr 


Val 


Glu 


Leu 


Gly 


Arg 


Ser 


Phe 


Val 


60 




130 






135 










1 4 0 












Ser 


Leu 


Gin 


Tyr 


Gin 


Ser 


Thr 


Arg 


Met 


Gly 


Thr 


Lys 


Ala 


He 


Phe 


Val 




145 








150 










155 










160 




Leu 


Asp 


Asn 


Leu 


Trp 


Asp 


Gly 


He 


Gly 


Ala 


Leu 


Thr 


Val 


Val 


Asn 


Pro 










165 










170 










175 




65 


Glu 


Ala 


Leu 


Tyr 
180 


Phe 


Tyr 


Gly 


Lys 


Vol 

185 


Thr 


Met 


Tyr 


Lys 


Asp 
190 


Tyr 


Asp 




Arg 


Arg 


Ala 


Arg 


Asn 


Leu 


He 


Leu 


Tyr 


Fhe 


Leu 


Arg 


Lys 


His 


Phe 


Ser 




195 










200 










205 










Asp 


Pro 


Glu 


Gly 


Leu 


Val 


Lys 


Pro 


He 


His 


Pro 


Leu 


Pro 


He 


Glu 


He 


70 


210 










21 5 










220 












Ser 


Ala 


Glu 


Asp 


Glu 


Ala 


Leu 


Phe 


Ser 


Ser 


Ser 


Asp 


Phe 


Asp 


Thr 


Asn 




225 










230 










235 










240 




Tyr 


Lys 


Thr 


Leu 


Asn 
245 


He 


Glu 


Val 


Arg 


Lys 
2 50 


Leu 


Gly 


He 


Asn 


He 

255 


Pro 


75 


Pro 


Leu 


Val 


Ser 


Ala 


Tyr 


He 


Ala 


Leu 


Ser 


Pro 


Glu 


Met 


Arg 


Val 


Phe 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO_9929e70A1 J„> 



10 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



WO 99/29870 PCT/AU98/01023 

424/ 490 









260 










265 










270 






Gly 


Thr 


Ala 


1 


A^n 


Glu 


Ser 


Phe 


Gly 


Glu 


Val 


Glu 


Glu 


Thr 


Gly 


He 






275 










280 










285 








Phe 


He 


Ala 


Val 


Gly 


Lys 


He 


Leu 


Glu 


Glu 


Lys 


Lys 


Gin 


Arg 


His 


He 




290 










2 95 










300 










Glu 


Ser 


Phe 


He 


Leu 


Ser 


Arg 


Asn 


Glu 


Lys 


Lys 


Gly 


Leu 


A^p 


Ser 


Ser 


305 










310 










315 










320 




Gly 


Arg 


Ser 



























(2) I!JFORMATIOII FOR SEQ ID 110:47 6 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LEMGTH: oAl amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

20 (iii) HYPOTHETICAL: YES 

(vi) ORIGIUAL SOURCE: 

(A) ORGAMISil: Porphyromonas gingivali^ 

25 (ix) FEATURE: 

(A) MAI IE/ KEY: mis cofeature 

(B) LOCATIOH 1 , . . 54 7 



(xi) SEQUEMCE DESCRIPTION: SEQ ID 110:416 



I IC' U 


Lys 


Thr 


X J. e 


V a 1 


Arg 


Tyr 


Ser 


Arg 


Leu 


Pro 


V ct 1 


Al a 


Leu 


Phe 


Phe 




















1 0 










1 5 




Cys 


Leu 


Leu 


Gly 


Ala 


Val 


His 


Leu 


Ser 


Val 


Glu 


Ala 


Gin 


Met 


Leu 


Asn 








20 










25 










30 






Thr 


Pro 


Phe 


Glu 


Leu 


Ser 


Asp 


Gin 


He 


Val 


Leu 


Ser 


Pro 


Thr 


Glu 


Arg 






35 










40 










45 






Gin 


Tyr 


Arg 


Glu 


He 


Cys 


Val 


Gin 


Thr 


Lys 


Glu 


Lys 


Arg 


Gly 


Ala 


Asp 




50 










55 










60 










Leu 


Phe 


Pro 


Leu 


Ser 


Asp 


Lys 


Leu 


Arg 


Asp 


Ser 


Ala 


Tyr 


Val 


Arg 


Phe 


65 










70 










75 










80 


Gly 


Ser 


Ala 


Tyr 


Gly 


Asp 


He 


Ala 


Gly 


Asp 


Tyr 


Leu 


Pro 


Tyr 


Asn 


Gly 










85 










90 










95 




Acn 


Asn 


Tyr 


Ser 


Ser 


Leu 


Ser 


Leu 


Glu 


Ser 


Gly 


Gly 


Arg 


He 


Ser 


Val 








100 










105 










110 






Arg 


Asn 


Tyr 


Gly 


Thr 


Leu 


Gin 


Gly 


Ser 


Ala 


Ser 


Tyr 


Ser 


Arg 


Gly 


Met 






115 










120 










125 








Hi:? 


Lys 


Arg 


He 


Gly 


Trp 


Asn 


Ala 


Leu 


Arg 


Asn 


Ala 


Glu 


Ala 


Tyr 


Tyr 




130 










135 










140 










Pro 


Tyr 


Leu 


Val 


Ser 


Asp 


Ser 


Thr 


Gly 


Gly 


Asp 


Tyr 


His 


Phe 


Glu 


A^P 


1-15 










150 










155 










160 


Tyr 


Arg 


Leu 


Ala 


Gly 


Tyr 


Tyr 


Ser 


Phe 


Arg 


Ala 


Gly 


Arg 


Leu 


Pro 


Leu 










165 










170 










175 




Gly 


He 


Gly 


Phe 


Ser 


Tyr 


Arg 


Gly 


Glu 


Val 


Ala 


Tyr 


Arg 


Leu 


Thr 


Asp 








ISO 










185 










190 






Pro 


Arg 


Thr 


Thr 


Asn 


Thr 


Thr 


Gly 


Ala 


Leu 


Glu 


Leu 


Ser 


Cys 


Ala 


Thr 






195 










200 










205 








Ser 


Leu 


Thr 


Leu 


Pro 


Arg 


Glu 


Asn 


Arg 


Leu 


Ser 


Leu 


Ser 


Ala 


Ala 


Tyr 




210 










215 










220 










Leu 


Tyr 


His 


Arg 


Gin 


His 


Leu 


Thr 


Gin 


Tyr 


Acn 


Trp 


Arg 


Pro 


Gly 


Gin 


225 










2 30 










235 










240 


Gin 


Asp 


Lys 


Ph- 


Phe 


Val 


Ser 


Tyr 


Gly 


Phe 


Gly 


Gin 


Val 


A-p 


Val 


Ser 










245 










250 










255 




Asn 


Ser 


Pro 


il e 


Trp 


Phe 


Gly 


He 


Ser 


Arg 


Met 


As n 


Tyr 


Val 


Asn 


Gly 








260 










265 










270 






Trp 


Lys 


Leu 


Ser 


Ser 


Arg 


Leu 


Asp 


Thr 


Arg 


Arg 


Gly 


Asp 


Ala 


He 


Gly 






275 










280 










285 








Leu 


Asp 


Tyr 


Ser 


Gly 


Tyr 


Phe 


Leu 


Asp 


Thr 


Glu 


Glu 


Arg 


Ser 


Ser 


He 




2 90 










2 95 










300 








Asn 


Leu 


Phe 


Ala 


Leu 


Leu 


Tyr 


Asn 


Arg 


Leu 


Ara 


Leu 


Tyr 


Gly 


Ser 


Trp 


305 










310 










315 










320 


His 


Leu 


Ser 


A5?p 


Phe 


Asp 


Phe 


Ser 


Phe 


Ser 


Ala 


Asp 


Tyr 


Ala 


Leu 


Arg 










325 










330 










335 




Gin 


Gly 


He 


Glu 


Arg 


He 


Tyr 


Glu 


Asp 


Tyr 


Lys 


Pro 


Asp 


Asp 


Asn 


Tyr 








34 0 










3^5 










350 






His 


lie 


Tyr 


Asp 


Leu 


Arg 


He 


Leu 


Ala 


He 


Arg 


Arg 


Trp 


Tyr 


net 


Leu 
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425/490 



35 



40 



45 



50 



55 



60 



65 



70 



75 







355 










360 










365 








Asn 


Glu 


Phe 


Ser 


Ala 


Gin 


Ala 


Gin 


Ala 


Ser 


Tyr 


Arg 


He 


Arg 


Thr 


Asp 




3-70 










31b 










380 










Arg 


Gly 


Cys 


Ala 


Leu 


Arg 


Val 


Ser 


Ala 


Gly 


Ser 


Asp 


Phe 


Tyr 


Gly 


Tyr 


385 










390 










395 










4 00 


A^p 


Glu 


Thr 


Tyr 


Arg 


Lys 


His 


Gly 


His 


His 


Thr 


net 


Ser 


Gly 


Met 


Leu 










405 










410 










415 




Arg 


Pro 


Phe 


Ala 


'3iy 


He 


Ai5 


Tyr 


Asp 


His 


Ala 


Gly 


Ser 


Lys 


Leu 


Asp 








420 










425 










430 






Ph- 


Gly 


Leu 


Per 


Leu 


Ser 


Ala 


Ala 


Tyr 


Arg 


net 


Val 


Leu 


Thr 


His 


Ser 




^35 










440 










445 








Tyr 




lie 


Arg 


Thr 


He 


Gl n 


Lys 


Glu 


Gin 


Leu 


Asp 


Tyr 


Gln 


Leu 


Ala 




4 5 0 










4 55 










4 60 










']'■ " r 


L*^ u 






Al a 






Asn 




Glu 


Gl" 


Val 


Glu 


Val 


Arg 


Ser 


'16 5 










47 0 










475 










4 80 


Ser 


Leu 


Tyr 


Val 


Ser 


He 


Pro 


net 


Gin 


Asn 


Thr 


His 


Arg 


Leu 


! let 


Thr 










^85 










A 90 










495 




Glu 


Leu 


Arg 


Leu 


Tyr 


Gly 


Asp 


Leu 


net 


Lys 


Arg 


Lys 


Asp 


Gly 


He 


Ala 






500 










505 










510 






Tyr 


Gly 


Lys 


Thr 


Pro 


Gly 


Val 


He 


Ser 


His 


lie 


Leu 


Ser 


Asp 


Pro 


Gin 






515 










52 0 










525 








Ala 


Glu 


Arg 


Tlir 


Ser 


Gly 


His 


Thr 


He 


Gly 


Ala 


He 


Cys 


Asn 


He 


Ser 




530 










535 










54 0 










Tyr 


Leu 


Phe 




























545 
































(2) 


111 FORMAT I Oil 


FOR 


SKQ 


ID 110:477 

















10 



15 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LEMGTH: 750 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



{ii) MOLECULE TYPE: protein 

( i i i ) HY POTHETICAL : YES 

(vi) ORIGIIIAL SOURCE: 

(A) ORGAN I Si 1: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) HAME/KEY: mi £C_ feature 
{ B) LOCATIOII 1 ... 750 





) 


SEQUENCE DESCRIPTIOII : SEQ ID 110:477 










Met 


Lyc 


Lys 


Leu 


His 


Met 


He 


Ala 


Ala 


Leu 


Ala 


Val 


Leu 


Pro 


Phe 


Cys 


1 
















10 










15 




Leu 


Thr 


Ala 


Gin 


Ala 


Pro 


Val 


Ser 


Asn 


Ser 


Glu 


He 


Asp 


Ser 


Leu 


Ser 








2 0 










25 










30 






Asn 


Val 


Gin 


Leu 


Gin 


Thr 


Val 


Gin 


Val 


Val 


Ala 


Thr 


Arg 


Ala 


Thr 


Ala 






35 










40 










45 








Lys 


Thr 


Pro 


Val 


Ala 


Tyr 


Thr 


Asn 


Val 


Arg 


Ly^ 


Ala 


Glu 


Leu 


Ser 


Lys 




50 










55 










60 










Ser 


Asn 


T y r 


Gly 


Arg 


Asp 


He 


Pro 


Tyr 


Leu 


Leu 


Met 


Leu 


Thr 


Pro 


Ser 


65 










70 










75 










80 


Val 


Val 


Ala 


Tlir 


Ser 


Asp 


Ala 


Gly 


Thr 


Gly 


He 


Gly 


Tyr 


Ser 


Gly 


Phe 










85 










90 










95 




Arg 


Val 


Arg 


Gly 


Thr 


Asp 


Ala 


Asn 


Arg 


He 


Asn 


He 


Thr 


Thr 


Asn 


Gly 








100 










105 










110 






Val 


Pro 


Leu 


Asn 


Asp 


Ser 


Glu 


Ser 


Gin 


Ser 


Val 


Phe 


Trp 


Val 


Asn 


Met 






115 










120 










125 








Pro 


Asp 


Phe 


Ala 


Ser 


Ser 


He 


Glu 


Asp 


Leu 


Gin 


Val 


Gin 


Arg 


Gly 


Val 




130 










135 










140 










Gly 


Thr 


Ser 


Thr 


Asn 


Gly 


Ala 


Gly 


Ala 


Phe 


Gly 


Ala 


Ser 


Val 


Asn 


Met 


145 










150 










155 










160 


Arg 


Thr 


Asp 


Asn 


Leu 


Gly 


Leu 


Ala 


Pro 


Tyr 


Gly 


Arg 


Val 


Asp 


Leu 


Ser 










165 










17 0 










175 




Gly 


Gly 


Ser 


Phe 


Gly 


Thr 


Phe 


Arg 


Arg 


Ser 


Val 


Lys 


Leu 


Gly 


Ser 


Gly 








180 










105 










1 90 






Arq 


He 


Gly 


Arg 


His 


Trp 


Ala 


Val 


Asp 


Al a 


Arg 


Leu 


Ser 


Lys 


He 


Gly 






195 










2 00 










205 








Ser 


Asp 


Gly 


Tyr 


Val 


Asp 


Arg 


Gly 


Ser 


Val 


Asp 


Leu 


Lys 


Ser 


Tyr 


Phe 




210 










215 










220 










Ala 


Gin 


Val 


Gly 


Tyr 


Phe 


Gly 


Ser 


Asn 


Thr 


Ala 


Leu 


Arg 


Phe 


He 


Thr 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCIDi <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



426/ 490 
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20 



25 
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35 



40 



45 
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55 



60 



65 



75 



22-5 










230 










235 










24 0 


Phe 


Gly 


Gly 


Lys 


Glu 


Val 


Thr 


Gly 


lie 


Ala 


Tirp 


Asn 


Gly 


Leu 


Ser 


Lys 










245 










250 










255 




Glu 


Asp 


Gl Li 


Ala 


Lys 


Tyr 


Gly 


Arg 


Arg 


Tyr 


Asn 


Ser 


Ala 


Gly 


Leu 


Met 








260 










265 










27 0 






Tyr 


Val 


Asp 


Al a 


Gin 


Gly 


Val 


Pro 


Hi £ 


Tyr 


Tyr 


His 


Asn 


Thr 


Asp 


Asn 






27 5 










280 










2 85 








Tyr 


Gi u 


Gin 


Arg 


His 


Tyr 


His 


Al a 


lie 


Met 


Thr 


His 


Ser 


Phe 


Ser 


Pro 




2 90 










2 95 










300 










Ser 


Vrtl 


Tie 


Leu 


Asn 


Leu 


Thr 


Al a 


Hi s 


Tyr 


Thr 


Ala 


Gly 


Tyr 


Gly 


Tyr 


3 05 










310 










315 










320 


Thr 


Asp 


Glu 


Tyr 


Arg 


Thr 


Gly 


Arg 




Leu 


Lys 


Glu 


Tyr 


Ala 


Leu 


Gin 










325 










330 










335 




Pro 


Tyr 


Va J. 


Glu 


Asn 


Ser 


Val 


Thr 


Val 


Lys 


Lys 


Thr 


Asp 


Leu 


He 


Arg 








34 0 










345 










350 






Gin 


Lys 


Tyr 


Leu 


Asp 


Asn 


Asp 


Phe 


Gly 


Gly 


Leu 


He 


Gly 


Ser 


Leu 


Asn 






3 55 










360 










365 








Trp 


His 


Thr 


Gly 


Ala 


Trp 


Asp 


Leu 


Gin 


Phe 


Gly 


Ala 


Ser 


Gly 


Asn 


He 




37 0 










375 










380 










Tyr 


Lys 


G 1 y 


Asp 


His 


Phe 


Gly 


Arg 


lie 


Thr 


Tyr 


He 


Lys 


Lys 


Tyr 


Asn 


3 65 










390 










395 










4 00 


Gin 


Pro 


Leu 


Al a 


Pro 


Asp 


Phe 


Glu 


Tyr 


Tyr 


Arg 


Asn 


Arq 


Ala 


Asp 


Lys 










4 05 










4 10 










415 




Arg 


Glu 


Gly 


Al a 


Ala 


Phe 


Ala 


Lys 


Ala 


Asn 


Trp 


Gin 


He 


Thr 


Pre 


Glu 








420 










4 25 










A 30 






Leu 


As n 


Met 


Tyr 


Ala 


Asp 


Leu 


Gin 


Tyr 


Arg 


Thr 


He 


Gly 


Tyr 


Thr 


He 






4 35 










44 0 










44 5 








Asn 


Gly 


lie 


Thr 


Asp 


Glu 


Tyr 


Asp 


Glu 


Val 


Gin 


Gly 


Ser 


Met 


Gin 


His 




4 50 










455 










4 60 










lie 


Asp 


Leu 


Asp 


Lys 


Thr 


Phe 


Arg 


Phe 


Leu 


Asn 


Pro 


Lys 


Ala 


Gly 


Leu 


4 65 










47 0 










475 










4 80 


Thr 


Tyr 


Ser 


Phe 


Asp 


Asp 


Ala 


His 


Thr 


Ala 


Tyr 


Ala 


Ser 


Val 


Ala 


Val 










485 










4 90 










4 95 




Ala 


His 


Arg 


Glu 


Pro 


Asn 


Arg 


Thr 


Asn 


Tyr 


Thr 


Glu 


Ala 


Gly 


He 


Gly 








500 










505 










510 






Gin 


Tyr 


Pro 


Thr 


Pro 


Glu 


Arq 


Leu 


lie 


Asp 


Tyr 


Glu 


Leu 


Gly 


Tyr 


Arg 






515 










520 










525 








Tyr 


Ala 


Ser 


Pro 


Leu 


Leu 


Ser 


Ala 


Gly 


Val 


Gly 


Leu 


Tyr 


Tyr 


Met 


Gin 




530 










535 










54 0 










Tyr 


Lys 


Asp 


Gin 


Leu 


Val 


Leu 


Asp 


Gly 


Arg 


Leu 


Ser 


Asp 


Val 


Gly 


Gin 












550 








555 










560 


1 let 


Leu 


Thr 


Ser 


Asn 


Val 


Pro 


Asp 


Ser 


Tyr 


Arg 


Met 


Gly 


Leu 


Glu 


Leu 










565 










57 0 










575 




Thr 


Leu 


Gly 


Trp 


Gin 


lie 


Leu 


Pro 


Arg 


Leu 


Leu 


Arg 


Trp 


Asp 


Ala 


Ser 








580 










585 










590 






Phe 


Thr 


Met 


Ser 


Arg 


Asn 


Lys 


lie 


Asp 


Arg 


Tyr 


Val 


Gl n 


Tyr 


Thr 


Ser 






5 95 










600 










605 








Val 


Tyr 


Asp 


Ala 


Asp 


Tyr 


Asn 


Trp 


Leu 


Gl u 


Leu 


Lys 


Gl u 


Gl u 


Thr 


Leu 




6 ] C' 










615 










620 










Glu 


Ser 


Thr 


Asp 


lie 


Ala 


Tyr 


Ser 


Pro 


Asn 


Val 


He 


Ala 


Gly 


Ser 


1 let 


62 5 










630 










635 










64 0 


Leu 


Thr 


Leu 


Ser 


His 


Ala 


Gly 


Phe 


Glu 


Met 


Ala 


Trp 


Thr 


Ser 


Arg 


Phe 










54 5 










650 










655 




Val 


Ser 


Lys 


Gin 


Tyr 


Leu 


Asp 


Asn 


Thr 


Gin 


Arg 


Ser 


Asp 


Arg 


Met 


Leu 








660 










665 










670 






Ser 


Ser 


Tyr 


Trp 


Val 


Asn 


Asp 


Leu 


Arg 


Leu 


Gly 


Tyr 


Val 


Leu 


Pro 


Val 






67 5 










680 










685 








His 


Phe 


Val 


Lys 


Arg 


Val 


Ala 


Leu 


Gly 


Val 


Gin 


Leu 


Asn 


Asn 


Leu 


Phe 




690 










695 










7 00 










Asm 


Leu 


Met 


Tyr 


Ala 


Ser 


Asn 


Ala 


Tyr 


He 


Tyr 


Asp 


Ala 


Gly 


Tyr 


Val 


705 










710 










715 










720 


Gin 


Ala 


Ser 


Gly 


Glu 


Leu 


Ser 


Ala 


Tyr 


Ala 


Asp 


Leu 


Arg 


Tyr 


Tyr 


Fro 










725 










7 30 










735 




Gin 


Ala 


Gly 


Phe 


Asn 


Ala 


Leu 


Gly 


Ser 


Leu 


Thr 


He 


Asp 


Phe 












740 










745 










7 50 







(2) 111 FORMAT 1 01 J FOR SEQ ID 110:478 



(i) SEQUEIJCE CHARACTERISTICS: 
70 (A) LEIfGTH: 4 94 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 
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20 
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35 



40 



45 



50 



55 



60 



65 



70 



427/490 



(iii) HYPOTHETICAL: YES 

(vi) OR I Git J AL SOURCE: 

(A) ORGAUXSil: Porphyromonas gingival is 

5 

(iM) FEATURE: 

(A) HAI^IE/KEY: mi sc_f eat ur e 

(B) LOCATIOH 1, . .494 

10 (y.i) SEQUENCE DESCRIPTION: SEQ ID NO: 478 



75 (2) in FORI I AT I CM FOR SEQ ID MO:47Q 



Met 


L' "S 


Aig 


Arg 


Phe 


Leu 


Ser 


Leu 


Leu 


Leu 


Leu 


Tyr 


He 


Leu 




S*^ r 


I 








5 










1 0 










1 5 




He 


Sec 


Leu 


Se r 


Ala 


Gin 




Phe 


Pro 


\ let 


Val 


Gin 


Gl \.- 
ui ]i 


He 


Glu 


Leu 








20 




















30 








Thr 




Se r 


Leu 


Phe 


Ser 


L°u 


Pro 


T -.re- 




Pro 


T rr> 


Arg 


Ala 


He 






35 










4 0 










4 5 








Gl V 


L-s 


Thr 


He 


Glv 


Vs 1 


As n 


Leu 


Ala 


Val 


T rn 


G 1 V 


Phe 


A«n 


hi is 


Phe 




50 










55 










60 










He 


(let. 


AoH 


Gl u 




Phe 


Ala 




He 


Ser 


T m 
J. J. 


Gin 


Thr 


He 




Ser 


65 










7 0 










7 5 










80 




Phe 


Gin 


Thr 


Gl 


P^'je 


Gl y 


i rp 


sp 


A«^n 


Asp 


Lys 


Phe 


Val 


Thr 


Asn 










8 5 










90 










95 




Leu 


Phe 


Al a 


His 


Pro 




His 


Gl 


S =^ r 






Phe 


Asn 


Al a 


Al a 










1 oo 










105 










2 2 n 






S^^r 


As n 


Li J. y 


Leu 




Phe 


rg 


His 




Al a 


ro 


Phe 


Al a 


Phe 










] 1 5 










120 










12 5 








S'^ r 


L'^u 


i t 


i L p 




L'^u 


L'='U 


( 1 o 1- 

t le L 


("111 


Asn 


i. u 


Pro 


Pro 


C c »- 
o e 1 


Tie 

lie 


As n 




130 










135 










14 0 










M-P 


Leu 


" J s 


Al a 


Thr 


Thr 


1 1 


^ y 


Gl y 


Tic. 

lie 




L'^u 


G 1 ' * 




i-ie X, 


tal 


14 5 










1 50 










1 










1 6 




Arg 


Leu 


Ser 


Asp 


ijeu 


Leu 


Tic 

lie 


Asp 


As 


Arg 


Thr 


Thr 


Giy 


Trp 


Glu 










165 










1 T fi 
1 ' U 










1 1 o 






Me t 




Arg 


Glu 


Va 1 


Al a 


lie 


Ml a 


Leu 


Tic 

lie 


Asn 


ro 


M«='t 


Arg 


Phe 








180 










185 










1 90 






Leu 


A^n 
sn 


Ara 


Leu 


Thr 


Al a 


Gly 


Glu 


Val 


Thr 


S'^ r 


Val 


Gl y 


Ser 




er 






1 95 










2 00 










205 








Gl V 


Gin 


He 


Phe 


Gin 


Ser 


Val 


Pro 


He 


Asn 


He 


Val 


Val 


Ac-n 
^-H 


Al a 


Gl 




210 










215 










220 










Phe 


Arg 


Phe 


Leu 


Ala 


Mi JJ 


Lys 




His 


Ala 


Arg 


Thr 


Gl'^'" 


Al a 


Thr 


Al 5 


->25 










230 










235 










24 0 


Leu 


Thr 


Leu 


Asn 


Leu 


Arg 


Phe 


MSp 


Tyr 


Uly 


A ~r. 


ro 


Phe 




S*^ r 


Glu 










245 










2 50 










2 55 




Thr 


Phe 


Ser 


Pro 


Tyr 


p 


Phe 


Phe 


Gin 


Ph'=- 




Al a 


Gl \-- 
Ul y 


' =u 


S'^' r 


Ph ^ 








260 










265 










9 7 0 






S'^ r 


'''I 


5 = r 


Gin 


Pro 


L=^U 


L'=^u 


.3 _r 


Gin 


lie 


Asn 


Tell 

ijeu 


H*= 


Gl"'* 


T 1 (=. 

lie 


L^ u 






275 










2 80 










285 








S*^ I" 






Gin 


Leu 


L'^U 




His 


(''111 

(j1 u 


Arg 


m 

Thr 


V a 1 


Leu 


V « 1 




•jI j 




2 90 










2 95 








3 on 










Leu 


Phe 


Gin 


His 


Phe 


Asp 


Tvr 


Tvr 


As n 


Ser 


Glu 




Arg 


He 


Ser 


ijj 


305 










310 










315 










320 


Asn 


Ser 


Glu 


Glu 


Val 


Leu 


Val 


Thr 


Pro 


Tyr 


Arg 


He 


Ser 


Gin 


Val 


Ala 










325 










330 










335 




Ala 


Leu 


Gly 


Gly Gly 


Leu 


He 


Phe 


Gin 


His 


His 


Gly 


Lys 


Phe 


Arg 


Arg 








340 










345 










350 






Arg 


Pro 


Leu 


Glu 


Leu 


Tyr 


Ala 


Glu 


Thr 


Tyr 


Leu 


Asn 


Val 


Val 


Pro 


Met 






355 










360 










365 








Giy 


Ala 


Ser 


Leu 


Ser 


Asp 


His 


Tyr 


Asn 


Val 


Asp 


Asn 


Arg 


Asp 


Tyr 


Asn 




370 










375 










380 










Leu 


Gly 


Ser 


Gly 


Leu 


Ser 


Gly 


Lys 


Leu 


Tyr 


Leu 


Gly 


Ala 


Thr 


Tyr 


Asn 


305 










390 










395 










400 


Aspi 


Leu 


Trp 


Ser 


Trp 


Leu 


Leu 


Gly 


Val 


Glu 


Ser 


Tyr 


Arg 


Leu 


Tyr 


Th r 










4 05 










410 










415 




Trp 


He 


Gly 


Tyr 


Glu 


Glu 


Pro 


His 


Gin 


Lys 


Asn 


Thr 


Asp 


Val 


Ser 


Ser 








420 










425 










4 30 






Phe 


Met 


V«l 


Gin 


Gly 


Asp 


Glu 


Ser 


Lys 


Ala 


Arg 


Leu 


Leu 


Val 


Thr 


Ser 






435 










440 










445 








Seu 


Glu 


Phe 


Ala 


Phe 


His 


Pro 


Gly 


Pro 


Trp 


His 


Val 


Ala 


He 


Val 


Ala 




450 










455 










460 










Arg 


Arg 


Phe 


He 


Arg 


Lys 


Thr 


Ala 


Tyr 


Gin 


Phe 


Tyr 


Pro 


Asn 


Val 


Ser 












470 










475 










4 60 


Phe 


Asp 


Thr 


Gly 


Asp 


He 


Gin 


Leu 


Arg 


Val 


Gly 


Phe 


His 


Pi»e 














485 








4 90 















SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCI D: < WO_9929870A 1 _L> 



10 



25 



30 



35 



40 



50 



55 



60 



65 



70 



75 



WO 99/29870 PCT/AU98/01023 

428/490 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 amino acid^ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii} HYPOTHETICAL: YES 

<v-i) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival if 



(ix) FEATURE: 
15 (A) NAME/KEY: misc_feature 

(B) LOCATION 1 . . .294 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 9 

20 Met Ly^ Arg Leu He Val Phe Leu Ala Met Gly Gly Leu Leu Phe Thr 
is 10 15 

Leu Ala Asn Ala Gin Glu Ala Asn Thr Ala Ser Asp Thr Px-o Lys Lys 

^0 25 30 

A.-p Trp Thr He Lys Gly Val Thr Gly Leu Asn Ala Ser Gin Thr Ser 

35 40 45 

Leu Thr Asn Trp Ala Ala Gly Gly Glu Asn Thr Val Ala Gly Asn Leu 

50 55 60 

Tyr Leu Asn He Asp Ala Asn Tyr Leu Lys Asp Lys Trp Ser Trp Asp 
^5 70 75 80 

Asn Gly Leu Arg Thr Asp Phe Gly Leu Thr Tyr Thr Thr Ala Asn Lys 

8 5 90 95 

Trp Asn Lys Ser Val Asp Lys He Glu Leu Phe Thr Lys Ala Gly Tyr 

100 105 110 

Glu He Gly Lys His Trp Tyr Gly Ser Ala Leu Phe Thr Phe Leu Ser 

lis 120 125 

Gin Tyr Ala Lys Gly Tyr Glu Lys Pro Ser Asp His Leu Thr Gly Val 

i30 135 
Lys His He Ser Asn Phe Phe Ala Pro Ala Tyr Leu Thr Leu GJ y He 
150 155 160 

Gly Ala Asp Tyr Lys Pro Asn Glu Lys Phe Ser Leu Tyr Leu Ser Pro 

165 170 175 

Thr Thr Gly Lys Leu Thr Val Val Ala Asp Asp Tyr Leu Ser Ser Leu 

180 185 190 

Gly Ala Phe Gly Val Lys Val Gly Glu Lys Thr Met Phe Glu Leu Gly 

1-5 200 205 

Ala Leu Val Val Gly Ser Ala Asn He Asn Leu Met Glu Asn Val Asn 

210 215 220 

Leu He Thr Lys Ala Ser Phe Phe Ser Ala Tyr Thr His Asp Phe Gly 
230 235 240 

Asn He Asp He Asn Trp Glu Ala Met Leu Ala Net Lys He Asn Lys 

245 250 255 

Phe Leu Thr Ala Thr He Ala Thr Asn Leu He Tyr Asp Asp Asp Val 

260 265 270 

Lys He Asn Asp Gly Pro Lys He Gin Phe Lys Glu Val Val Gly Val 

275 280 285 

Gly Val Ala Tyr Thr Phe 
2 90 



(2) INF-ORMATION FOR SEQ ID NO: ^80 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEtJGTH: 204 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL: YES 

( V i ) OR I G I J lAL SOU RCE : 

(A) ORGAIIISM: Po rphyromonas gingival is 

(i:0 FEATURE: 

(A) MAIIE/?:EY: misc_feature 

(B) LOCATION 1. . .204 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



429/ 490 



30 



50 



55 



60 



65 



70 



75 



(xi) SBQUEMCE DESCRI PTIOl ! : SEQ ID MO:^80 





net 


Lys 




net 


He 


Leu 


Ala 


Ala 


Thr 


Met 


Leu 


Leu 


Ala 


Thr 


He 


Gly 


5 


1 






5 










10 










15 






Phe 


Ala 


Asn 


Ala 
20 


Gin 


Ser 


Arg 


Pro 


Ala 
25 


Leu 


Arg 


Leu 


Asp 


Ala 
30 


Asn 


Phe 




Val 


Gly 


Ser 


Asn 


Leu 


Met 


Gin 


Lys 


Val 


Ala 


Asn 


Thr 


Ser 


Val 


Asn 


Asn 






35 










40 










45 








10 


hys 


Met 


He 


Val 


Gly 


Leu 


Arg 


Val 


Gly 


Ala 


Ala 


Ala 


Glu 


Phe 


Ala 


Leu 




50 










55 






















Ser 


Asn 


Asp 


Gly 


Phe 


Tyr 


Leu 


Ala 


Pro 


Gly 


Leu 


Ala 


Tyr 


Thr 


Met 


Arg 




65 








70 










75 










80 




Gly 


Ala 


Lys 


Met 


Glu 


Ser 


Leu 


Ser 


Glu 


Thr 


Thr 


Thr 


Arg 


Leu 


His 


Tyr 


15 








85 










90 
















Leu 


Gin 


He 


Pro 
100 


vra 


Asn 


Ala 


Gly 


Met 
105 


Arg 


Phe 


Ser 


Phe 


Al a 
110 


Asp 


Asn 




net 


Ala 


He 
115 


Ser 


Leu 


Glu 


Al a 


Gly 
120 


Pro 


Tyr 


Phe 


Ala 


Tyr 
125 


Gly 


Val 


Ala 


20 


Gly 


Thr 


He 


Lys 


Thr 


Lys 


Val 


Ala 


Gly 


Val 


Thr 


Ala 


Ser 


Va 1 


Asp 


Ala 




130 










135 










14 0 












Phe 


Gly 


Asp 


Asn 


Gly 


Tyr 


Asn 


Arg 


Phe 


Asp 


Leu 


Gly 


Leu 


Gly 


Leu 


Ser 












150 










155 










160 




Ala 


Ala 


Leu 


Ser 


Tyr 


Asp 


Arg 


Tyr 


Tyr 


Val 


Gin 


He 


Gly 


Tyr 


Glu 


His 


25 










165 










170 










175 






Gly 


Leu 


Leu 


A£-n 


Met 


Leu 


Lys 


Asp 


Ala 


Pro 


Asp 


Lys 


Thr 


Ser 


Leu 


Arg 








180 










185 










1 90 








Asn 


His 


Asp 
195 


Phe 


Phe 


Val 


Gly 


Leu 
200 


Gly 


Val 


Arg 


Phe 











{2) in FORMATION FOR SEQ ID 110:481 



(i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 24 3 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

40 {iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAIIISM: Porphyromonas gingival is 

45 (i::) FEATURE: 

(A) NAME/ KEY: nusc_feature 

(B) LOCATIOI! 1 . . .24 3 



{::i) SEQUENCE DESCRIPTION: SEQ ID NO: 4 01 



Met 


Lys 


Arg 


He 


Phe 


Thr 


Val 


Ala 


Leu 


Val 


Leu 


Leu 


Ala 


Ser 


Val 


Thr 


1 






5 










10 










15 




Met 


Ala 


He 


Gly 


Gin 


Ser 


Arg 


Pro 


Ala 


Leu 


Arg 


Val 


Asp 


Ala 


Asn 


Phe 








20 










25 










30 






Val 


Gly 


Ser 


Asn 


Gin 


Ser 


Met 


Lys 


Arg 


Asp 


Gly 


Tyr 


Val 


Trp 


Asp 


Thr 




35 










40 










45 








Lys 


Met 


Asn 


Val 


Gly 


Leu 


Arg 


Val 


Gly 


Ala 


Ala 


Ala 


Glu 


Phe 


I let 


He 




50 










55 










60 










Gly 


Ser 


Arg 


Gly 


Phe 


Tyr 


Leu 


Ala 


Pro 


Gly 


Leu 


Asn 


Tyr 


Thr 


1 let 


Lys 


65 








70 










75 










8 0 


Gly 


Ser 


hy^ 


Thr 


Glu 


Trp 


Asp 


He 


Pro 


Glu 


Met 


Val 


Pro 


Gly 


Thr 


Tyr 








85 










90 










95 




He 


Thr 


Met 


Val 


Ser 


Thr 


Arg 


Leu 


His 


Tyr 


Leu 


Gin 


Leu 


Pro 


He 


Asn 








100 










105 










110 






Ala 


Gly 


Met 


Arg 


Phe 


Asp 


Leu 


Met 


Asn 


Asp 


Met 


Ala 


Val 


Ser 


He 


Glu 






115 










120 










125 








Ala 


Gly 


Pro 


Phe 


Leu 


Ala 


Tyr 


Gly 


He 


Tyr 


Gly 


Thr 


Tyr 


Arg 


Gin 


Lys 




130 










135 










140 










Leu 


Glu 


Gly 


Trp 


Lys 


Pro 


Asn 


Asn 


Tyr 


Ser 


Thr 


Glu 


Phe 


Phe 


Gly 


Pro 


145 










150 










155 










160 


Thr 


Leu 


Gly 


Gly 


Pro 


Thr 


Asn 


He 


Arg 


Trp 


Asp 


He 


Gly 


Ala 


Asn 


He 










165 










170 










175 




He 


Ala 


Ala 


Phe 


His 


Tyr 


Lys 


Arg 


Tyr 


Tyr 


He 


Gin 


He 


Gly 


Tyr 


Glu 








1 80 










185 










190 






His 


Gly 


Phe 


Val 


Asp 


He 


Val 


Ser 


Gly 


Gly 


Gly 


Ser 


Asp 


He 


Pro 


Arg 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 



10 



15 



55 



WO 99/29870 PCT/AU98/01023 

430/ 490 

19b 200 205 

Leu A^n Asp Asn Arg Gin Ser Ser Ser Thr Thr Ala Leu Arg Glu Lys 

210 215 22 0 

Gly A^n Am Glu Tyr Ala Tyr Asn Arg Asp Phe Phe Val Gly He Gly 
225 230 235 2-^0 

Tyr Arg Pho 

(2) IJiFORMATIOU FOR SEQ ID 110:482 

( i ) SEQUEIICE CHARACTERISTICS : 

(A) LENGTH: 2 07 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGV: linear 

(ii) MOLECULE TYPE: protein 

( i i i ) MY POT H ET 1 C AL : YES 



20 (vi) ORIGIIIAL SOURCE: 

(A) ORGAIIISll: Porphyromor.as gingival! ; 

ii::) FEATURE: 

(A) IIAJIE/KEY: mi5C_feature 
25 (P.) l.OCATIOI! l..,207 

irS) SEQUEliCE DESCRI PTIOil : SEO ID 110:482 



30 


Met 


Ly- 


Arg 


Met 


Leu 


Leu 


Leu 


Leu 


Val 


Val 


Leu 


Leu 


Tyr 


Gly 


He 


Ala 


1 




5 










10 








15 






Gly 


Arg 


Leu 


Ala 
20 


Ala 


Gin 


Asp 


Val 


He 
25 


Arg 


Pro 


Trp 


Ser 


Leu 
30 


Gin 


Val 




Gly 


Ala 


Gly 


Tyr 


Ser 


Asp 


Thr 


Glu 


Asn 


He 


Pro 


Gly 


Gly 


Phe 


Thr 


Tyr 


35 






35 










40 


















Gly 


Phe 
50 


Tyr 


Leu 


Gly 


L-ys 


Arg 
55 


Met 


Gly 


Ser 


Phe 


Leu 

60 


Glu 


Val 


Gly 


Leu 




Ser 


Met 


Tyr 


Asn 


Ser 


Thr 


Arg 


Gin 


Thr 


Ala 


Asn 


Asn 


Ala 


Asp 


Ser 


Phe 




65 










7 0 










75 










80 


40 


Ala 


Ser 


Asn 


Glu 


Gly 


Asp 


Gly 


Ser 


Phe 


Gin 


Val 


Asn 


Met 


Ser 


Ser 


Pro 










85 






90 










95 






Asn 


Glu 


Ly^ 


Trp 

100 


Ser 


Phe 


Phe 


Asp 


Ala 
105 


Gly 


Ser 


Ala 


Asn 


Cys 
110 


Tyr 


Met 




He 


Val 


Val 


Gly 


Val 


Asn 


Pro 


Leu 


His 


Leu 


Phe 


Trp 


Gin 


Asn 


Ser 


Arg 


45 






115 










120 










125 








His 


Asn 
130 


Leu 


Phe 


Leu 


Ala 


Val 
135 


Gin 


Ala 


Gly 


Leu 


Ser 
14 0 


Asn 


Lys 


His 


Asn 




He 


Hi^ 


Phe 


Ho 


Tyr 


Gly 


Asp 


Lys 


Gly 


Ala 


Lvs 


Val 


Ser 


He 


Tyr 


Thr 




1^5 










150 










155 










160 


50 


Asn 


Ser 


Asn 


Thr 


Tyr 


He 


Gly 


Tyr 


Gly 


Ala 


Arg 


Val 


Ala 


Tyr 


Glu 


Tyr 










165 










17 0 










175 






Gin 


He 


His 


hys 
180 


Asn 


Val 


Gly 


Ala 


Gly 
185 


Ala 


Al a 


Val 


1 let 


Tyr 
190 


Asp 


His 




G 1 y 


Asn 


Lys 
195 


Met 


Leu 


Thr 


Ala 


Met 
200 


Ala 


Thr 


Leu 


Ser 


Thr 
205 


His 


Phe 





(2) INFORMATION FOR SEQ ID NO: 4 83 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 amino acids 
60 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

65 (Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Porphyrontonas gingival is 

70 (ix) FEATURE: 

(A) NAME/ KEY: misc^feature 

(B) LOCATION 1 ... 951 



75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: -185 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID; <WO 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



431/490 

Ik-t Arg Val Ser Asp Leu Cys Ser Arg Leu Ser Trp Leu Leu Pro Val 
15 10 15 

He Leu Val Gly Leu Leu Cys Ala Thr Leu Val Ala Ala Glu Arg Pro 
20 25 30 

5 Met Ala Gly A3 a Val Gly Leu His His Arg Arg His Ala Ala Leu Ser 
35 40 45 

A«^p S^r Thr Ala Lys Asp Thr Val Pro Leu Ala Lys Pro He Pro Asp 

50 55 60 

S«r Ala Fhe Arg Asp Ser Leu Pro Ala Asp Ser Thr Giy Ser Met Arg 
10 65 70 75 80 

Gin Asp Ser Val Tyr Asp Asp Glu Phe Glu Leu Glu Asp He Val Glu 

85 90 95 

Tyr Glu Ala Ala Asp Ser He Val Leu Leu Gly Gin Asn Arg Ala Tyr 
3 00 105 110 

15 Leu Phe Gly Lys Ser Tyr Val Ser Tyr Gin Lys Ser Arg Leu Glu Ala 
115 120 125 

Asn Phe Met Tyr Leu Asn Thr Asp Ser Ser Thr Val Tyr Thr Arg Tyr 

130 135 140 

Val Leu Asp Thr Ala Gly Tyr Pro Met Ala Phe Pro Val Phe Lys Asp 
20 145 ISO 165 160 

Gly Glu Gin Ser Phe Glu Ala Lys Asn Phe Thr Tyr Asn Phe Arg Thr 

165 170 175 

Glu Lys Gly He He Ser Gly Val He Thr Gin Gin Gly Glu Gly Tyr 
180 185 190 

25 Leu Thr Ala Gly Lys Thr Lys Lys Met Pro Asp Asn He Met Phe Met 
195 200 205 

Gin Gly Gly Arg Tyr Thr Thr Cys Asp Asn His Asp His Pro His Phe 

210 215 220 

Tyr He Asn Leu Ser Lys Ala Lys Val His Pro Glu Lys Asp He VaJ 
30 225 230 235 240 

Thr Gly Pro Val Asn Leu Val He Ala Asp Met Pro Leu Pro He Gly 

245 250 255 

Leu Pro Phe Gly Tyr Phe Pro Phe Ser Asn Lys Tyr Ser Ser Gly He 
260 265 270 

35 Leu Met Pro Thr Tyr Gly Glu Asp Asn Arg Tyr Gly Phe Tyr Leu Arg 
275 280 285 

Asn Gly Gly Tyr Tyr Phe Ala Phe Ser Asp Tyr He Asp Leu Ala Leu 

290 295 300 

Arg Gly Glu He Phe Ser Lys Gly Ser Trp Gly He Ser Ala Gin Ser 
40 305 310 315 320 

Lys Tyr Lys Lys Arg Tyr Lys Tyr Asn Gly Ser Phe Glu Ala Asn Tyr 

325 330 335 

Leu Val Ser Lys Ser Gly Asp Lys Tyr Val Pro Gly Asp Tyr Ser Lys 
340 345 350 

45 Thr Thr Ser Leu Asn He Arg Trp Thr His Ser Gin Asp Pro Lys Ala 
355 360 365 

Asn Fro Leu Gin Thr Leu Ser Ala Asn Val Asn Fhe Ala Thr Gly Ser 

370 375 380 

T-'-r Phe Gin Asn Ser Leu Asn Thr Thr Tyr Asp Val Asn Ala Arg Thr 
50 385 390 395 400 

Ala Thr Thr Ara Ser Ser Ala Val Ser Tyr Ser Arg Lys Phe Pro Gly 

405 410 415 

Thr Pro Phe Ser He Thr Gly Ser Met Asp He Ser Gin Asn Met Arg 
420 425 ^30 

55 Asp Thr Thr Val Ser Leu Thr Leu Pro Asn Leu Ser He Asn Met Ser 
435 440 445 

Thr Arg Tyr Pro Phe Lys Arg Lys Thr Arg Val Gly Pro Glu Arg Trp 

450 455 460 

Tyr Glu Lys Leu Ser Val Gly Tyr Ser Gly Gin Leu Arg Asn Ser He 
60 465 470 475 480 

Leu Thr Lys Glu Lys Asp Leu Leu Gin Ser Asn Leu Val Arg Asp Trp 

485 490 495 

Lys Asn Gly Met Arg His Ser Val Pro He Ser Leu Thr Val Pro Leu 
500 505 510 

65 Leu Asp Tyr He Asn Leu Thr Met Gly Val Asn Tyr Asn Glu Trp Trp 
515 520 525 

Tyr Thr Lys Gly He Arg Lys Ser Trp Asn Glu Asp Lys Lys Thr Phe 

530 535 540 

Leu Pro Ser Asp Thr Thr Tyr Lys Phe Arg Arg Leu Tyr Asp Tyr Ser 
70 545 550 555 560 

Leu Ser Ala Gly Leu Ser Thr Thr Leu Tyr Gly Met Phe Lys Pro Trp 

565 570 575 

Lys Pro Phe Ser Phe Gly Gly Asn Leu He Met He Arg His Arg Phe 
580 585 590 

75 Thr Pro Thr Val Ser Phe Ser Tyr Met Pro Asp Phe Thr Lys Arg Arg 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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432/ 490 

S?5 600 605 

Tyr Gly Phe Trp Glu Leu Leu Glu His Thr A^p Gin Asn Gly Lys L-u 

^10 615 620 

His Thr Lc-u Leu Tyr Ser Pro Tyr Phe Glu Gin He Phe Gly Ala Prr. 
5 625 630 635 640 

Ser Met Gly Asn Ala Gly Ser Val Asn Phe Ser Phe Asp Asn Asn Leu 

645 650 655 

Glu Ala Lys He Lys Ser Lys Ser Asp Ser Thr Gly He Lys Lys He 

665 670 
lU Ser Leu He Asp Gin Phe Thr Trp Ser Thr Ser Tyr Asn Met Phe Ala 
675 680 685 

Asp Ser He Arg Trp Ser Asn He Ser Ala Ser Leu Ala Leu Arg L»u 

690 695 700 

Ser Lys Ser Phe Thr Leu Arg Leu Ser Gly Leu Phe Asp Pro Tyr Leu 
15 705 710 715 720 

Thr Lys Tyr Tyr Glu Gly Glu Asp Gly Lys He He Pro Tyr Lys Ser 

725 730 735 

Asn Asp Leu Arg He Phe Asn Gly Lys Gly Leu Ala Ara Leu He Ser 

ry^i "^"^^ " 750 

ZU Thr Gly Thr Ser Phe Ser Tyr Thr Leu Asn Lvs Glu Ser Leu Ser Gly 
755 760 765 

Leu He Ala Leu Phe Ser Gly Lys Lys Glu Ara Arg Asp Glu Lys L"s 

■77':' 775 ^ 780 

Asn Thr Gly Ala Thr Pro His Glu Gly Asp Asp Ala Ala Asp He L-u 
25 785 790 -Toc ^ 

Glu Gly Giy Arg Pro Gin Asn Glu Ser Gly Gly Ser Leu Leu Glu Arg 

805 810 815 

Asn Arg Gin Gly Gly Ala Val Asp Gin Asp Gly Tyr Phe Ala Tyr Ser 
on 925 830 

He Fro Trp Ser Leu Ser Phe Asp Tyr Ser Trp Asn He Ala Thr Asp 

835 840 845 

Tyr Asn Arg Tyr Asn Val Asn Lys Met Glu His Tyr Tyr Arg Val Thr 

850 855 860 

Gin Asn Leu Ser Phe Arg Gly Asn He Gin Pro Thr Pro Asn Trp S'^^r 
OD 865 870 P75 ggo 

Phe Gly Phe Asn Ala Asn Tyr Asn Phe Asp Leu Lys Lys He Thr Ser 

885 890 895 

Leu Thr Cys Asn Val Thr Ara Asp Met His Cys Trp Ala He Ser Ala 

900 905 910 

Ser Phe He Pro He Gly Ala Tyr Lys Ser Tyr Asn Phe Val He Ser 

915 920 925 

Val Lys Ser Ser Leu Leu Gin Asp Leu Lys Tyr Gin Gin Ser Asn Ara 

930 935 
Pro He Thr Asn Thr Trp Tyr 
45 945 950 

(2) IHFORnATIOH FOR SEQ ID tJO:484 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LEIIGTH: 1226 amino acids 

(B) TYPE: amino acid 
{ D) TOPOLOGY: linear 

(ii) HOLECLfLE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAIIISII: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAI IE/KEY: misc_feature 

(B) LOCATIOn 1. . . 1226 

(xi) SEQUENCE DESCRIPTIOH: SEQ ID 110:484 



40 



50 



55 



60 



65 



70 



75 



Met 


liet 


Lys 


Arg 


Tyr 


Thr 


He 


He 


Leu 


Ala 


Val 


Phe 


Leu 


Leu 


Phe 




1 








5 










10 










15 




Thr 


Val 


Phe 


Thr 
20 


Phe 


Gin 


li- 


Lys 


Ala 
25 


Arg 


Pre 


Tyr 


Glu 


Arg 

30 


Phe 


Ala 


Asp 


Val 


Glu 


Lys 


Pro 


Trp 


ne 


Gin 


Lys 


His 


Ser 


Met 


Asp 


Ser 


Lys 


Leu 


Val 




35 










40 










45 






Pro 


Ala 


Asn 


Lys 


Gly 


Asn 


Leu 


He 


Gin 


Ala 


Glu 


He 


Val 


Tyr 


Gin 


Ser 


50 










55 










60 










Val 


Ser 


Glu 


His 


Ser 


Asp 


Leu 


Val 


He 


Ser 


Pro 


Val 


Asn 


Glu 


He 
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65 70 75 80 

Arg Pro Ala Asn Arg Ph- Pro Ser His Arg hys Ser Phe Phe Ala Glu 

85 90 ^'5 

Asn Leu Arg Ala Ser Pro Pro Val Val Pro Val Ala Val A:rp Ly^ Tyr 
5 100 105 110 

Ala Val Pro Val Ala Asn Pro Met Asp Pro Glu Asn Pro Asn Ala Trp 

115 120 125 

Asp Val Thr Leu Lys lie Thr Thr Lys Ala Val Thr Val Pro Val Asp 
130 135 1^0 

10 Val Val Met Val He Asp Gin Ser Ser Ser Ilet Gly Gly Gin Asn He 
145 150 156 160 

Ala Ara Leu Lys Ser Ala He Ala Ser Gly Gin Arg Phe Val Lys Lys 

165 170 175 

Met Leu Pro Lys Gly Thr Ala Thr Glu Gly Val Arg He Ala Leu Val 
15 180 185 190 

Ser Tyr Asp His Glu Pro His Arg Leu Ser Asp Phe Thr Lys Asp Thr 

195 200 205 

Ala Phe Leu Cys Gin Lys He Arg Ala Leu Thr Pro He Trp Gly Thr 
210 215 220 

20 His Thr Gin Gly Gly Leu Lys (let Ala Arg Asn He llet Ala Thr Ser 
2 25 23 0 2 35 2-4 0 

Thr Ala Val Asp Lys Hi^- He He Leu Met Ser Asp Gly Leu Ala Thr 

245 250 255 

Glu Gin Tyr Pro Val Lys Asn Val Thr Thr Ala Asp Phe He Gly Lys 
25 260 2G5 270 

Thr Gly Asn Ala Asn Asp Pro He Asp Leu Val He Gin Gly Ala He 

275 280 285 

Asn The Pro Thr Asn Tyr Val Ser Asn Asn Pro Ser Thr Pro Leu Thr 
290 295 300 

30 Pro Asn Tyr Pro Thr His Ser Ser Lys Val Gly Arg Arg Asn Leu Pro 
305 310 315 320 

Glu S-r Lys Phe Asp Tyr Ser Asn Leu Ser Ala Arg He Thr Phe Asp 

325 330 335 

Gly Val Ala Gly Ala Leu Val Tyr Glu Pro Arg Phe Pro His Pro Tyr 
35 340 345 350 

Tyr Tyr Tyr Phe Pro Cys Asn Ala Ala He Asn Glu Ala Gin Phe Ala 

355 360 365 

Lys Asn Ser Gly Tyr Thr He His Thr He Gly Tyr Asp Leu Gly Asp 
370 375 380 

40 Phe Ala Leu Ala Asn Asn Ser Leu Lys Leu Thr Ala Thr Asp Glu Asn 
305 390 395 400 

His Phe Phe Thr Ala Thr Pro Ala Asn Leu Ala Ala Ala Phe Asp Asn 

405 410 415 

He Ala Gin Thr He Asn He Gly He Gin Arg Gly Glu Val Thr Asp 
45 420 425 430 

Pile Val Ala Pro Gly Phe He Val Lys Asn Leu Thr Gin Ser Gly Asp 

4 35 44 0 44 5 

Val Thr His Leu Leu Asn Val Ser Asn Gly Thr Val His Tyr Asp Val 
450 455 460 

50 Ser Thr Lys Lys Leu Thr Trp Thr Thr Gly Thr He Leu Ser Ser Ser 
465 470 475 480 

Glu Ala Thr He Thr Tyr Arg He Tyr Ala Asp Leu Asp Tyr He Gin 

485 490 495 

Asn Asn Asp He Pro Val Asn Thr Thr Ser Ala He Gly Pro Asp Leu 
55 500 505 510 

Gly Gly Phe Asp Thr Asn Thr Glu Ala Lys Leu Thr Tyr Thr Asn Ser 

515 520 525 

Asn Gly Glu Pro Asn Gin Gin Leu He Phe Pro Arg Pro Thr Val Lys 
530 535 540 

60 Leu Gly Tyr Gly Val He Lys Arg His Tyr Val Leu Val Asn Lys Asp 
545 550 555 560 

Gly Gin Pro He Gin Ala Asn Gly Thr Val Val Ser Ser Leu Ser Glu 

565 570 575 

Ala His Val Leu Gin Ser Gin Asp Phe Phe Leu Pro Ser Gly Gly Gly 
65 580 585 590 

His He Val Pro Lys Trp He Lys Leu Asp Lys Thr Thr Glu Ala Leu 

596 600 e05 

Gin Tyr Tyr Ser Val Pro Pro Thr Asn Thr Val He Thr Thr Ala Asp 
610 615 620 

70 Gly Lys Arg Tyr Arg Phe Val Glu Val Pro Gly Ser Thr Pro Asn Pro 
625 630 635 640 

Gly Gin He Gly He Ser Trp Lys Lys Pro Ala Gly Asn Ala Tyr Phe 

645 650 655 

Ala Tyr Lys Leu Leu Asn Tyr Trp Met Gly Gly Thr Thr Asp Gin Gin 
75 660 665 67 0 
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Set Glu Trp Asp Val Thr Ser Asn Trp Thr Gly Ala Gin Val Pro Leu 

675 680 685 

Thr Gly Glu Asp Val Glu Phe Ala Thr Thr Glu Asn Phe Gly Ser Pro 
690 695 700 

5 Ala Val Ala Asp Leu His Val Pro Thr Thr Asn Pro Lys He He Gly 
705 710 715 720 

Asn Leu He Asn Asn Ser Asp Lys Asp Leu Val Val Thr Thr Ser Ser 

725 730 735 

Gin Leu Thr He Asn Gly Val Val Glu Asp Asn Asn Pro Asn Val Gly 
10 74 0 745 750 

Thr He Val Val Lys Ser Ser Lys Asp Asn Pro Thr Gly Thr Leu Leu 

755 760 765 

Phe Ala Asn Pro Gly Tyr Asn Gin Asn Val Gly Gly Thr Val Glu Phe 
770 775 790 

15 Tyr Asn Gin Gly Tyr Asp Cys Ala Asp Cys Gly Met Tyr Arg Arg Ser 
785 790 795 800 

Trp Gin Tyr Phe Gly He Pro Val Asn Glu Ser Gly Phe Pro He Asn 

805 810 815 

Asp Val Gly Gly Asn Glu Thr Val Asn Gin Trp Va3 Glu Pro Phe Asm 
20 82 0 825 830 

Gly Asp Lys Trp Arg Pro Ala Pro Tyr Ala Pro Asp Tlir Glu Leu Gin 

835 840 845 

Lys Phe Lys Gly Tyr Gin He Thr Asn Asp Val G] n Ala Gin Pro Thr 
850 855 860 

25 Gly Val Tyr Ser Phe Lys Gly Met He Cys Val Cys Asp Ala Phe Leu 
865 870 875 880 

Asn Leu Thr Arg Thr Ser Gly Val Asn Tyr Ser Gly Ala Asn Leu He 

885 890 895 

Gly Asn Ser Tyr Thr Gly Ala He Asp He Lys Gin Gly He Val Phe 
30 900 905 910 

Pro Pro Glu Val Glu Gin Thr Val Tyr Leu Phe Asn Thr Gly Thr Arg 

915 920 925 

Asp Gin Trp Arg Lys Leu Asn Gly Ser Thr Val Ser Gly Tyr Arg Ala 
930 935 94 0 

35 Gly Gin Tyr Leu Ser Val Pro Lys Asn Thr Ala Gly Gin Asp Asn Leu 
945 950 955 960 

Pro Asp Arg He Pro Ser Het His Ser Phe Leu Val Lys Met Gin Asn 

965 970 975 

Gly Ala Ser Cys Thr Leu Unk He Leu Tyr Asp Lys Leu Leu Lys Asn 
40 980 985 990 

Thr Thr Val Asn Asn Gly Asn Gly Thr Gin He Thr Trp Arg Ser Gly 

995 1000 1005 

Asn Ser Gly Ser Ala Asn Met Pro Ser Leu Val Met Asp Val Leu Gly 
1010 1015 1020 

45 Asn Glu Ser Ala Asp Arg Leu Trp He Phe Thr Asp Gly Gly Leu Ser 

1025 1030 1035 1040 

Phe Gly Phe Asp Asn Gly Trp Asp Gly Arg Lys Leu Thr Glu Lys Gly 

1045 1050 1055 

Leu Ser Gin Leu Tyr Ala Met Ser Asp He Gly Asn Asp Lys Phe Gin 
50 1060 1065 1070 

Val Ala Gly Val Pro Glu Leu Asn Asn Leu Leu IJe Gly Phe Asp Ala 

1075 1080 1085 

Asp Lys Asp Gly Gin Tyr Thr Leu Glu Fhe Ala Leu Ser Asp His Phe 
1090 1095 HOC 

55 Ala Lys Gly Ala Val Tyr Leu His Asp Leu Gin Ser Gly Ala Lys His 

1105 1110 1115 1120 

Arg He Thr Asn Ser Thr Ser Tyr Ser Phe Asp Ala Lys Arg Gly Asp 

1125 1130 1135 

Ser Gly Ala Arg Phe Arg Leu Ser Tyr Gly Cys Asp Glu Asn Val Asp 
60 1140 1145 1150 

Asp Ser His Val Val Ser Thr Asn Gly Arg Glu He He He Leu Asn 

1155 1160 1165 

Gin Asp Ala Leu Asp Cys Thr Val Thr Leu Phe Thr He Glu Gly Lys 
1170 1175 1180 

bo Leu Leu Arg Arg Leu Lys Val Leu Ala Gly His Arg Glu Val Met Lys 

HQS 1190 1195 1200 

Val Gin Thr Gly Gly Ala Tyr Tie Val His Leu Gin Asn Ala Phe Thr 

1205 1210 1215 

Asn Asp Val His Lys Val Leu Val Glu Tyr 
70 1220 1225 

{2) II I FORMATION FOR SEQ ID NO: 4 85 

(i) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 1225 amino acids 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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wo 99/29870 
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435/490 



10 



( B ) TYPE : ami no acid 
( D) TOPOLOGY : linear 

(ii) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 

(A) ORGAIIISII: Porphyromonas gingivalif 

(ix) L'EATURE: 

(A) IJAJ'IE/KEY: misc_f ea t ur e 

(B) LOCATIOM 1 . . . 1225 



15 (xi) SEQUEMCE DESCRI PTIOi I : SEQ ID 110:485 

II- 1 Lys Arg Tyr Thr lie lie Leu Ala Val Phe Leu Leu Phe Cy:s Thr 
1 5 10 15 

Val Phe Thr F?he Gin He Lys Ala Arg Pro Tyr Glu Arg Plie Ala A^p 
20 20 25 30 

Val Glu Lys- Pro Trp He Gin Lys Hi^ Ser (let A?p Ser Lys Leu Val 

35 4 0 4S 

Pro Ala A-n Ly^ Gly Asn Leu He Gin Ala Glu He Val Tyr Gin Ser 
50 55 60 

25 Val Ser Glu His Ser Asp Leu Val He Ser Pro Val Asn Glu He Arg 
65 70 75 80 

Pro Ala Asn Arg Phe Pro Ser His Arg Lys Ser Phe Phe Ala Glu Asn 

05 90 95 

Leu Ara Ala Ser Pro Pro Val Val Pro Val Ala Val Asp Lys Tyr Ala 
30 ^ 100 105 110 

Val Pro Val Ala Asn Pro Met Asp Pro Glu Asn Pro Asn Ala Trp Asp 

115 120 125 

Val Thr Leu Lys He Thr Thr Lys Ala Val Thr Val Pro Val Asp Val 
130 135 140 

35 Val Met Val He Asp Gin Ser Ser Ser Met Gly Gly Gin Asn He Ala 
145 150 155 160 

Arg Leu Lys Ser Ala He Ala Ser Gly Gin Arg Phe Val Lys Lys Met 

165 170 175 

Leu Pro Lys Gly Thr Ala Thr Glu Gly Val Arg He Ala Leu Val Ser 
40 180 185 190 

Tyr Asp His Glu Pro His Arg Leu Ser Asp Phe Thr Lys Asp Thr Ala 

195 200 205 

Phe Leu Cys Gin Lys He Arg Ala Leu Thr Pro He Trp Gly Thr His 
210 215 220 

45 Thr Gin Gly Gly Leu Lys Met Ala Arg Asn He Met Ala Thr Ser Thr 
225 230 235 240 

Ala Val Asp Lys His He He Leu Met Ser Asp Gly Leu Ala Thr Glu 

245 250 255 

Gin Tyr Pro Val Lys Asn Val Thr Thr Ala Asp Phe He Gly Lys Thr 
50 260 265 270 

Gly Asn Ala Asn Asp Pro He Asp Leu Val He Gin Gly Ala He Asn 

275 280 285 

Phe Pro Thr Asn Tyr Val Ser Asn Asn Pro Ser Thr Pro Leu Thr Pro 
''Gi't 295 300 

55 Asn Tyr Pro Thr His Ser Ser Lys Val Gly Arg Arg Asn Leu Pro Glu 
305 310 315 320 

Ser Lys Phe Asp Tyr Ser Asn Leu Ser Ala Arg He Thr Phe Asp Gly 

325 330 335 

Val Ala Gly Ala Leu Val Tyr Glu Pro Arg Phe Pro His Pro Tyr Tyr 
60 340 345 350 

Tyr Tyr Phe Pro Cys Asn Ala Ala He Asn Glu Ala Gin Phe Ala Lys 

355 360 365 

Asn Ser Gly Tyr Thr He His Thr He Gly Tyr Asp Leu Gly Asp Phe 
37 0 37 5 380 

65 Ala Leu Ala Asn Asn Ser Leu Lys Leu Thr Ala Thr Asp Glu Asn His 
385 390 395 400 

Phe Phe Thr Ala Thr Pro Ala Asn Leu Ala Ala Ala Phe Asp Asn He 

4 05 410 415 

Ala Gin Thr He Asn He Gly He Gin Arg Gly Glu Val Thr Asp Phe 
70 420 425 430 

Val Ala Pro Gly Phe He Val Lys Asn Leu Thr Gin Ser Gly Asp Val 

4 35 4 40 4 45 

Thr His Leu Leu Asn Val Ser Asn Gly Thr Val His Tyr Asp Val Ser 
4 50 4 55 4 60 

75 Thr Lys Lys Leu Thr Trp Thr Thr Gly Thr He Leu Ser Ser Ser Glu 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



4 65 










4 70 










4 75 










4 80 


Ala 


Thr 


lie 


Thr 


Tyr 


Arg 


He 




Ala 


Asp 


Leu 


Asp 


Tyr 


He 


Gin 


Asn 










485 










4 90 










4 95 




Asn 


Asp 


lie 


Pro 


Val 


Asn 


Thr 


Thr 


Ser 


Ala 


He 


Gly 


Pro 


Asp 


Leu 


Gly 








500 










505 










510 






Gly 


Phe 


Asp 


Thr 


Asn 


Thr 


Glu 


Ala 


Lys 


Leu 


Thr 


Tyr 


Thr 


Asn 


Ser 


Asn 




515 










520 








525 








Gly 


Gl u 


Pro 


Asn 


Gin 


Gin 


Leu 


He 


Phe 


Pro 


Arg 


Pro 


Thr 


Val 


Lys 


Leu 




530 










535 










5 '10 










Gly 


Tyr 


Gly 


Val 


lie 


Lys 


Arg 


His 


Tyr 


Val 


Leu 


Val 


Asn 


Lys 


Asp 




54 5 










550 










555 










560 


Gin 


Pro 


lie 


Gin 


Ala 


Asn 


Gly 


Thr 


Val 


Val 


Ser 


Ser 


Leu 


Ser 


Glu 


Ala 










565 










570 










575 




His 


Val 


Leu 


Gin 


Ser 


Gin 


Asp 


Phe 


Phe 


Leu 


Pro 


Ser 


Gly 


Gly 


Gly 


His 








58 0 










585 










590 






lie 


Val 


Pro 


Lys 


Trp 


He 


Lys 


Leu 


Asp 


Lys 


Thr 


Thr 


Glu 


Ala 


Leu 


Gin 






595 










600 










605 








Tyr 


Tyr 


Ser 


Val 


Fro 


Pro 


Thr 


Asn 


Thr 


Val 


He 


Thr 


Thr 


Al a 


Asp 


Gly 




610 










615 










62 0 










Ly5 


Arg 


Tyr 


Arg 


Phe 


Val 


Glu 


Val 


Pro 


Gly 


Ser 


Thr 


Pro 


Asn 


Pro 


Gly 


625 










630 










635 










640 


Gin 


lie 


Gly 


lie 


Ser 


Trp 


Lys 


Lys 


Pro 


Ala 


Gly 


Asn 


Ala 


Tyr 


Phe 


Ala 










645 










650 










655 




Tyr 


Lys 


Leu 


Leu 


Asn 


Tyr 


Trp 


Met 


Gly 


Gly 


Thr 


Thr 


A-p 


Gin 


Gin 


Ser 








660 










665 










67 0 






Glu 


Trp 


Asp 


Val 


Thr 


Ser 


Asn 


Trp 


Thr 


Gl y 


Ala 


Gin 


Val 


Pro 


Leu 


Thr 






67 5 










680 










685 








Gly 


Glu 


Asp 


Val 


Glu 


Phe 


Ala 


Thr 


Thr 


Gl u 


Asn 


Phe 


Gly 


Ser 


Pro 


Ala 




690 










695 










700 










Vai 


Ala 


Asp 


Leu 


His 


Val 


Pro 


Thr 


Thr 


Asn 


Pro 


Lys 


He 


He 


Gl y 


Asn 


7 05 










710 










715 










7 2 0 


Leu 


lie 


Asn 


Asn 


Ser 


Asp 


Lys 


Asp 


Leu 


Val 


Val 


Thr 


Thr 


Ser 


Ser 


Gin 










725 










730 










7 35 




Leu 


Thr 


lie 


Asn 


Gly 


Val 


Val 


Glu 


Asp 


Asn 


Asn 


Pro 


Asn 


Vai 


Gly 


Thr 








74 0 










745 










750 






lie 


Val 


Val 


Lys 


Ser 


Ser 


Lys 


Asp 


Asn 


Pro 


Thr 


Gly 


Thr 


Leu 


Leu 


Phe 






755 










760 










765 








Ala 


Asn 


Pro 


Gly 


Tyr 


Asn 


Gin 


Asn 


Val 


Gly 


Gly 


Thr 


Val 


Glu 


Phe 


Tyr 




770 










775 










780 










Asn 


Gin 


Gly 


Tyr 


Asp 


Cys 


Ala 


Asp 


Cys 


Gly 


Met 


Tyr 


Arg 


Arg 


Ser 


Trp 


785 










790 










7 95 










800 


Gin 


Tyr 


Phe 


Gly 


lie 


Pro 


Val 


Asn 


Glu 


Ser 


Gly 


Phe 


Pro 


He 


Asn 


Asp 










805 










810 










815 




Val 


Gly 


Gly 


Asn 


Glu 


Thr 


Val 


Asn 


Gin 


Trp 


Val 


Glu 


Pro 


Phe 


Asn 


Gly 








820 










825 










830 






Asp 


Lys 


Trp 


Arg 


Pro 


Ala 


Pro 


Tyr 


Ala 


Pro 


Asp 


Thr 


Glu 


Leu 


Gin 


Lys 






83 5 










840 










84 5 








Phe 


Lys 


Gly 


Tyr 


Gin 


He 


Thr 


Asn 


Asp 


Val 


Gin 


Ala 


Gin 


Pro 


Thr 


Gly 




850 










855 










860 










Val 


Tyr 


Ser 


Phe 


Lys 


Gly 


1 let 


He 


Cys 


Val 


Cys 


Asp 


Ala 


Phe 


Leu 


Asn 


865 










870 










875 










830 


Leu 


Thr 


Arg 


Thr 


Ser 


Gly 


Val 


Asn 


Tyr 


Ser 


Gly 


Ala 


Asn 


Leu 


He 


Gly 










885 










8 90 










895 




Asn 


Ser 


Tyr 


Thr 


Gly 


Ala 


He 


Asp 


He 


Lys 


Gin 


G 1 y 


He 


Val 


Phe 


Pro 








900 










905 










910 






Pro 


Glu 


Val 


Glu 


Gin 


Thr 


Val 


Tyr 


Leu 


Phe 


Asn 


Thr 


Gly 


Thr 


Arg 


Asp 






9] 5 










920 










925 








Gin 


Trp 


Arg 


Lys 


Leu 


Asn 


Gly 


Ser 


Thr 


Val 


Ser 


Gly 


Tyr 


Arg 


Ala 


Gl j- 




930 










935 










94 0 










Gin 


Tyr 


Leu 


Ser 


Val 


Pro 


Lys 


Asn 


Thr 


Ala 


Gly 


Gin 


Asp 


Asn 


Leu 


Pro 


94 5 










950 










955 










960 


Asp 


Arg 


lie 


Pro 


Ser 


Met 


His 


Ser 


Phe 


Leu 


Val 


Lys 


Met 


Gin 


Asn 


Gly 










965 










97 0 










975 




Ala 


Ser 


Cys 


Thr 


Leu 


Unk 


He 


Leu 


Tyr 


Asp 


Lys 


Leu 


Leu 


Lys 


Asn 


Thr 








980 










985 










990 






Thr 


Val 


Asn 


Asn 


Gly 


Asn 


Gly 


Thr 


Gin 


He 


Thr 


Trp 


Arg 


Ser 


Gly 


Asn 






995 










1000 








1005 






S°r 


Gly 


Ser 


Ala 


Asn 


Met 


Pro 


Ser 


Leu 


Val 


Met 


Asp 


Val 


Leu 


Gly 


Asn 




1010 








1015 








1020 








Giu 


Ser 


Ala 


Asp 


Arg 


Leu 


Trp 


He 


Phe 


Thr 


Asp 


Gly 


Gly 


Leu 


Ser 


Phe 


102 


5 








1030 








1035 










Gly 


Phe 


Asp 


Asn 


Gly 


Trp 


Asp 


Gly 


Arg 


Lys 


Leu 


Thr 


Glu 


hys 


Gly 


Leu 










1045 








10 


50 








1055 


Ser 


Gin 


Leu 


Tyr 


Ala 


Met 


Ser 


Asp 


He 


Gly 


Asn 


Asp 


Lys 


Phe 


Gin 


Val 



104 0 



75 1060 1065 107 0 
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Ala Gly Val Pro Glu Leu Asn Asn Leu Leu lie Gly Phe Asp Ala Asp 

107S 1080 1086 

Lys A£-p Gly Gin Tyr Thr Leu Glu Phe Ala Leu Ser Asp His Phe Ala 
1090 1095 1100 

5 Lv^ Gly Ala Val Tyr Leu His Asp Leu Gin Ser Gly Ala Lys His Arg 

1105 1110 1115 1120 

He Thr Asn Ser Thr Ser Tyr Ser Phe Asp Ala Lys Arg Gly Asp Ser 

1125 1130 1135 

Gl'>- Ala Ara Phe Arg Leu Ser Tyr Gly Cys Asp Glu Asn Val Asp Asp 
10 ' " 1140 1145 1150 

Ser His Val Val Ser Thr Asn Gly Arg Glu He He He Leu Asn Gin 

1155 1160 1165 

Asp Ala Leu Asp Cys Thr Val Thr Leu Phe Thr He Glu Gly Lys Leu 
117 0 1175 118 0 

15 Leu Arg Arg Leu Lys Val Leu Ala Gly His Arg Glu Val Met Lys Val 

1185 1190 1195 1200 

Gin Thr Gly Gly Ala Tyr He Val His Leu Gin Asn Ala Phe Thr Asn 

1205 1210 1215 

Asp V=il His Lys Val Leu Val Glu Tyr 
20 1220 1225 

(2) II I FORMAT I Oil FOR SEQ ID MO: 4 86 

i\) SEQUCUCE CHARACTERISTICS: 
25 (A) LENGTH: 425 amino acids 

{[?) TYPE: amino acid 
(D) TOPOLOGY: linear 



30 



35 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAJJISM: Porphyromonas gingivali£ 

(i:;) FEATURE: 

(A) MAI IE/ KEY: misc_feature 

(B) LOCATION 1 ... 425 



40 




(xi) 


SEQUENC 


:E DESCRIPTION: SEQ ID NO: 4 86 












Met 


Glu 


Val 


Lys 


Lys 


Asn 


Thr 


Val 


Val 


Leu 


Arg 


Leu 


Leu 


He 


Trp 


Phe 




1 








5 










10 










15 






Val 


Ala 


He 


Leu 


Leu 


Phe 


His 


Ser 


Ser 


Arg 


Leu 


Trp 


Gly 


Gin 


Glu 


Gly 


45 








20 










25 










30 








Glu 


Gly 


Ser 


Ala 


Arg 


Tyr 


Arg 


Phe 


Lys 


Gly 


Phe 


Val 


Asp 


Thr 


Tyr 


His 








35 










4 0 










45 










Ala 


Val 


Arg 


Ser 


Ser 


Ser 


Pro 


Phe 


Asp 


Phe 


Met 


Ser 


Ser 


Arg 


Thr 


Arg 






50 








5 5 










60 










50 


Val 


Arg 


Gly 


Glu 


Leu 


Glu 


Arg 


Ser 


Phe 


Gly 


Asn 


Ser 


Lys 


Val 


Ala 


Val 




65 








7 0 










75 










8 0 




Ser 


Val 


Asn 


Ala 


Thr 


Tyr 


Asn 


Ala 


Leu 


Leu 


Lys 


Asp 


Glu 


Thr 


Gly 


Leu 












85 










90 










95 






Arg 


Leu 


Arg 


Glu 


Ala 


Phe 


Phe 


Glu 


His 


Gin 


Glu 


Glu 


His 


Trp 


Gly 


Leu 


55 




100 










105 










HO 








Arg 


Leu 


Gly 


Arg 


Gin 


He 


Val 


He 


Trp 


Gly 


Ala 


Ala 


Asp 


Gly 


Val 


Arg 






115 










120 










125 










He 


Thr 


Asp 


Leu 


li- 


Ser 


Pro 


Met 


Asp 


Met 


Thr 


Glu 


Phe 


Leu 


Ala 


Gin 






130 








135 










140 










60 


Asp 


Tyr 


Asp 


Asp 


ne 


Arq 


Met 


Pro 


Val 


Asn 


Ala 


Leu 


Arg 


Phe 


Ser 


Val 












150 










155 










160 




Phe 


Asn 


Glu 


Ser 


Met 


Lys 


Val 


Glu 


Val 


Val 


Val 


Leu 


Pro 


Val 


Phe 


Glu 












165 








170 










175 






Gly 


Tyr 


Arg 


Leu 


Pro 


Val 


Asp 


Pro 


Arg 


Asn 


Pro 


Trp 


Asn 


He 


Phe 


Ser 


65 




130 










185 










190 








Leu 


Ser 


Pro 


He 


Ala 


Gin 


Gly 


Met 


Asn 


He 


Val 


Trp 


Lys 


Glu 


Glu 


Ala 








195 










200 










205 










Gly 


Lys 


Pro 


Ala 


Phe 


Lys 


Val 


Ala 


Asn 


He 


Glu 


Tyr 


Gly 


Ala 


Arg 


Trp 






210 










215 










22 0 










70 


Ser 


Tt-.r 


Thr 


Leu 


Ser 


Gly 


He 


Asp 


Phe 


Ala 


Leu 


Ala 


Ala 


Leu 


His 


Thr 




2 25 










230 










235 










24 0 




Trp 


Asn 


Lys 


Met 


Pro 


Val 


He 


Glu 


Val 


Gin 


Gly 


He 


Val 


Pro 


Thr 


Glu 










245 










250 










255 






He 


He 


Val 


Ser 


Pro 


Arg 


Tyr 


Tyr 


Ara 


Met 


Gly 


Phe 


Val 


Gly 


Gly 


Asp 


75 








2 60 








265 










270 
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10 



15 



20 



25 



30 



35 



40 



45 



55 



60 



65 



Leu 


Sei- 


Val 
275 


Pro 


Val 


Gly 


Gin 


Phe 
280 


Val 


Phe 


Arg 


Gly 


Glu 

285 


Ala 


Ala 


Phe 


Asn 


ne 


Asp 


Lys 


His 


Phe 


Thr 


Tyr 


Lys 


Ser 


His 


Ala 


Glu 


Gin 


Glu 


Gly 




290 










295 










300 








Phe 


Gin 


Thr 


He 


Asn 


Trp 


Leu 


Ala 


Gly 


Ala 


Asp 


Trp 


Tyr 


Ala 


Pro 


Gly 


305 










310 










315 








320 


Glu 


Trp 


Met 


He 


Ser 
325 


Gl y 


Gin 


Phe 


Ser 


I'let 
330 


Gl u 


Ser 


He 


Phe 


Arg 
335 


Tyr 


Arg 


Asp 


Phe 


He 
340 


Ser 


Gin 


Arg 


Gin 


His 
3.J5 


Ser 


Thr 


Leu 


He 


Thr 
350 


Leu 


Asn 


Val 


Ser 


Lys 
355 


Lys 


Phe 


Fhe 


Gly 


Ser 
360 


Thr 


Leu 


Gin 


Leu 


Ser 
365 


Asp 


Phe 


Thr 


Tyr 


Tyr 
370 


Asp 


Leu 


Thr 


Gly 


Lys 

375 


Gly 


Trp 


Phe 


Ser 


Arg 
380 


Phe 


Ala 


Ala 


Asp 


Tyr 


Ala 


Leu 


Asn 


Asp 


Gl n 


li- 


His 


Leu 


Met 


Ala 


Gly 


Tyr 


Asp 


Trp 


Phe 


385 










390 










395 








400 


Ser 


Ser 


Lys 


Gly 


Ser 


Gly 


ne 


Phe 


Asp 


Arg 


Tyr 


Lys 


Asp 


Asn 


Ser 


Glu 










405 










^10 








415 




Leu 


Trp 


Pile 


Lys 
4 20 


Ala 


Arg 


Tyr 


Ser 


Phe 
425 

















(2) IMFORIIATIOII FOR SEQ ID NO: 4 87 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LEJiGTH: 404 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAI-IE/KEY: mi sc_f ea ture 

(B) LOCATION 1. . .404 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487 



Met Ser Ser Cys Glu Val Ala Tyr Phe Ser Leu Lys Pro He Asp Leu 
1 5 10 15 

Gin Asn He Arg Glu Arg Asn His Ser Ser Asp He Ala Leu Ser Asn 

20 25 30 

Leu Leu Asp Asn Ser Asn Gin Leu Leu Ala Thr He Leu He Gly Asn 

35 40 45 

Asn Val He Asn Val Ala He Val He Leu Ser Asn Tyr Ala II- Glu 
50 55 g. 

50 Gin Thr Phe Val Phe Ser Ser Pro He He Gly Phe Leu He Gin Thr 
65 70 75 80 

He Leu Leu Thr Thr Val Leu Leu Leu Phe Gly Glu He Leu Pro Lys 

85 90 95 

Val Tyr Ala Arg Lys Asn Pro Leu Gin Tyr Ser Arg Phe Ser Ala Ala 

100 105 110 

Ala Met Ser Val He Tyr Lys He Leu Ser Pro Phe Ser Lys Leu Leu 

11^ 120 125 

Val Lys Ser Thr Gly He Val Thr Arg Gly He Ser Lys Lys L'-'s Tyr 

130 135 140 

Asp Met Ser Val Asp Glu Leu Ser Lys Ala Val Ala Leu Thr Thr Thr 
1^- 150 155 160 

Glu Gly Glu Pro Glu Glu Lys Glu Met He Asn Glu He He Lys Phe 

165 170 175 

Tyr Asn Lys Thr Ala Cys Glu He Met Val Pro Arg He Asp He Val 

180 185 190 

Asp Val Asp Leu Ser Trp Pro Phe Arg Lys Met Leu Asp Phe Val Val 

195 200 205 

Ser Ser Gly Tyr Ser Arg Leu Pro Val Ser Glu Gly Ser Glu Asp Asn 
7n 215 220 

/U He Lys Gly Val He Tyr He Lys Asp Leu He Pro His Met Asp Ly- 
225 230 235 240 

Gly Asp Glu Phe Asp Trp His Pro Leu He Arg Lys Ala Tyr Phe Val 

245 250 255 

Pro Glu Asn Lys Arg He Asp Asp Leu Leu Glu Glu Phe Ara Ala Asn 
2 60 2 65 27 0 



75 
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30 





Val 


His 


Val 


Ser 


ne 


Val 


Val 


Asp 


Glu 


Phe 


Gly 


Gly 


Thr 


Cys 


Gly 




275 










280 










285 








Leu 


He 


Thr 


Met 


Glu 


Asp 


He 


Leu 


Glu 


Gl u 


He 


Val 


Gly 


Glu 


He 


Thr 




290 










295 










300 








oX j 


Asp 


Glu 


Tyr 


Asp 


Glu 


Glu 


Glu 


Leu 


Pro 


Phe 


Lys 


Va 1 


Leu 


Gly 


Asp 


305 






31 0 










J i O 










320 


Ser 


Tyr 


Leu 


Phe 


Glu 


Gly 


Lys 


Thr 


Ser 


Leu 


Ser 


Asp 


Val 


Arg 


His 


Tyr 










325 










330 










335 




Leu 


Asp 


Leu 


Pro 


Glu 


Asn 


Ala 


Phe 


Gly 


Glu 


Leu 


Gly 


Asp 


Glu 


Val 


Asp 






340 










345 










350 






Thr 


Leu 


Ser 


Gly 


Leu 


Phe 


Leu 


Glu 


He 


Lys 


Gin 


Glu 


Leu 


Pro 


His 


Val 






355 










360 










365 








Gly 


A^p 


Thr 


Ala 


Val 


Tyr 


Glu 


Pro 


Phe 


Arg 


Phe 


Gin 


Val 


Thr 


Gl n 


net 




370 










375 










380 










Asp 


Lys 


Arq 


Arg 


li- 


lie 


Glu 


He 


Lys 


He 


Phe 


Pro 


Phe 


Glu 


Arg 


Thr 


385 




390 










395 










A 00 


Trp 


Glu 


Val 


Gl u 



























10 



15 



20 (2) I M FORMAT I OI J FOR SEQ ID UO:4 8B 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LEMGTH: 260 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(ii) llOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGIMAL SOURCE: 

(A) ORGANIStl: Porphyromonas gingival is 



iix) FEATURE: 
35 (A) UAI IE/KEY: misc_feature 

(B) LOCATIOII 1 ... 260 

(xi) SEQUENCE DESCRIPTIOM: SEQ ID HO: 4 88 

40 11-t Lys Leu Leu Leu Tyr Leu Leu Leu Val Leu Ser Thr Leu Ser Pro 
15 10 15 

net Tyr Ser Gin Met Leu Phe Ser Glu Asn Leu Thr Met Asn He Asp 

2 0 2 5 30 

Ser Thr Lys Thr He Gin Gly Thr He Leu Pro Val Leu Asp Phe Lys 
45 35 40 45 

Thr Glu L:i-s Glu Asn Val Phe Thr Phe Lys Asn Thr Ala Asn Leu Asn 

50 55 60 

Leu Leu He Lys His Gly Gin Val He Asn Leu He Asn Lys Leu Glu 
65 7 0 7.'^ 8 0 

50 Phe Ser Thr Tyr Gly Asn Lys Val Thr Val Ser Gly Gly Tyr Val His 

85 90 95 

Thr Glu Tyr Arg Tyr Leu Leu His His Val Phe Glu Val Tyr Pro Tyr 

100 105 110 

Val Glu Ser Gin Trp Ala Glu Ser Arg Gly Met Lys Tyr Lys Val Ser 
55 115 120 125 

Thr Gly Leu Gin Ser Arg Tyr Arg Leu Val Asn Ser Asp Asn Cys Leu 

130 135 140 

I!-t Phe Ala Thr Leu Gly Val Phe Phe Glu Phe Glu Lys Trp Glu Gin 
145 150 155 160 

60 Pro Ala Thr Ser Leu Phe Ala Gly Thr Tyr Ala Tyr Ser Arg Ser lie 

165 170 175 

Lys Ser His Leu Ser He Ser Phe Arg His Arg Leu Gly Glu His Trp 

180 • 185 190 

Glu Phe Thr Thr Thr Ala He His Gin Gly Lys Pro Asp Ser Tyr Phe 
65 195 200 205 

Lys Lys Ala Arg Phe Gly Gly Ala He Asp Leu Lys Tyr His He Thr 

210 215 220 

Pro Thr He Gly He Ara Gly Ala Tyr Arg He He Tyr Asp Thr Ala 
225 230 235 240 

70 Pro He Val Pro Val Arg Lys Asp Tyr Asn Thr Val Asp Val Gly He 

245 250 255 

Asp He Ser Phe 
260 

75 (2) IllFORnATIOn for SEQ id MO: 489 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



440/ 490 



10 



15 



20 



25 



40 



45 



55 



65 



75 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LEIJGTH: 834 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 

(A) ORGAIJI SH: Porphyromonas gingivaiis 

ii>:) FEATURE: 

(A) MAME/KEY: misc__f eature 

(B) LOCATION 1 ... 834 

(xi) SEQUEHCE DESCRIPTION: SEQ ID lJO:4 89 



Met Lys Arg He Val Leu Ser Ser Phe Leu Phe Val Leu Ser He Leu 
1 ^ 10 15 

Ser Leu Met Ala Gin Asn Asn Thr Leu Asp Val His He Ser Gly Thr 

20 25 30 

He Lys Asp Ala Ser Ser Gly Glu Pro Val Pro Tyr Ala Thr Val Ser 

35 40 45 

He Arg Leu Thr Gly Ala Asp Thr Thr Gin Val Phe Arg Gin Val Thr 

-C- 55 60 

Asp Gly Asn Gly Tyr Phe Val He Gly Leu Pro Ala Ala Pro Ser Tyr 
on "^^ 75 80 

CJU His Leu Thr Ala Ser Phe Val Gly Met Lys Thr His Thr Met Gin He 

85 90 95 

Ser Arg Gly Asn Gly Gin His Asp He Lys Ser He Asp He Ser Leu 

100 105 110 

Glu Ser Glu Asp Lys Gin Leu Ser Thr Val Thr Val Ser Ala Ala Ara 
35 115 120 125 

Pro Leu Val Lys Met Glu He Asp Arg Leu Ser Tyr Asn Met Lys Asp 

130 135 140 

Asp Pro Ala Ala Lys Thr Asn Asn Leu Leu Glu Met Leu Arg Asn Val 
l'*^ ISO 155 160 

Pro Leu Val Thr Val Asp Gly Gin Gly Asn He Gin Val Lys Gly Ser 

165 170 175 

Ser Asn Phe Lys He His Leu Asn Gly Arg Pro Ser Thr Met Val Ser 

180 185 190 

Ser Asn Fro Lys Glu Val Phe Arg Ser He Pro Ala His Thr He L'-s 

155 200 205 

Arg Val Glu Val He Thr Asp Pro Gly Val Lys Tyr Asp Ala Glu Gl-- 

210 215 220 

Thr Ser Ala He Leu Asp He Val Thr Glu Glu Gly Lys Lys Leu Glu 
t-n 235 24n 

OU Gly Tyr Ser Gly Ser He Thr Ala Ser Val Ser Asn Asn Pro Thr Ala 

245 250 255 

Asn Gly Ser He Phe Leu Thr Ala Lys Ser Gly Lys Val Gly Leu Thr 

260 265 270 

Thr Asn Tyr Asn Tyr Tyr Gly Gly Lys Asn Lys Gly Ser Arg Tyr Ph- 

275 280 285 

Thr Glu Arg Thr Thr Ser Met Leu Gin Thr He Glu Glu Gly Lys Gl- 

295 300 
Gin Glu Thr Phe Gly G3 y His Phe Gly Asn Ala Leu Leu Ser Phe Glu 
RH 315 320 

DU He Asp Ser Leu Asn Leu Phe Thr Val Gly Gly Asn Val Arg Leu Trp 

325 330 335 

L.lu Met Thr Thr Asp Arg Asn Ser Val Glu Lys Ser Phe Ala Gly S^r 

3-1^ 345 350 

Asn Leu Met Ser Tyr He Asp Arg Lys Leu Lys Thr Gin Met Asp Ala 

355 360 365 

Gly Ser Tyr Glu Leu Asn Ala Asp Tyr Gin His Ser Thr Arg Leu Pro 

3'^^ 375 380 

Gly Glu Leu Leu Thr Val Ser Tyr Arg Phe Thr His Asn Pro Asn Asn 

70 c^^ . 395 400 

/U Ser <^lu Thr Phe He Asp Gin Trp Lys Arg Asp Pro Leu Asn Thr Ala 

'IQS 410 415 

Asn Thr He Gin Tyr Ala Gly Gin His Ser Lys Ser Asp Ala Gl- M-t 

420 42b 430 

Asp .olu His Thr Ala Gin Val Asp Tyr Thr Ara Pro Leu Gly Gin Ala 
435 440 445 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCiD; <WO 9929870A1_I_> 



wo 99/29870 PCT/AU98/01023 

441/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



60 



65 



His 


Ser 


Leu 


Glu 


Ala 


Gly 


Leu 


Lys 


Tyr 


He 


Tyr 


Arg 


His 


Ala 


Thr 


Ser 




450 










455 










4 60 










Asp 


Pro 


Leu 


Tyr 


Glu 


He 


Arg 


Pro 


Ser 


Glu 


Asp 


Ala 


Pro 


Trp 


Gin 


Pro 


4 65 










410 










475 










480 


Gl y 


Ser 


Leu 


Tyr 


Ala 


Gin 


Asn 


Pro 


Ser 


Asn 


Gly 


Lys 


Phe 


Arg 


His 


Asp 








485 










4 90 










495 




Gin 


Tyr 


He 


Gly 


Ala 


Ala 


Tyr 


Ala 


Gly 


Tyr 


Asn 


Tyr 


Arg 


Lys 


Asp 


Gin 






500 










505 










510 






Tyr 


Ser 


Leu 


Gin 


Thr 


Gly 


Leu 


Arg 


Val 


Glu 


Ser 


Ser 


Arg 


Leu 


Lys 


Ala 






515 










520 










525 








Leu 


Phe 


Pro 


Glu 


Asn 


Ala 


Ala 


Ala 


Asp 


Phe 


Ser 


His 


Asn 


Ser 


Phe 


Asp 




530 










535 










540 










Trp 


Val 


Pro 


Gin 


Leu 


Thr 


Leu 


Gly 


Tyr 


Thr 


Pro 


Ser 


Pro 


Met 


Lys 


Gin 












550 










555 










560 


Leu 


Lys 


Leu 


Al ci 


Tyr 


Asn 


Phe 


Arg 


He 


Gin 


Arg 


Pro 


Ala 


He 


Gly 


Gin 










565 










570 










575 




Leu 


Asn 


Pro 


Tyr 


Arg 


Leu 


Gin 


Thr 


Asn 


Asp 


Tyr 


Gin 


Val 


Gin 


Tyr 


Gly 








580 










505 










590 






Asn 


Pro 


Asp 


Leu 


Lys 


Ser 


Glu 


Lys 


Arg 


His 


His 


Va 1 


Gly 


Leu 


Ser 


Tyr 






595 










600 










605 








A.-n 


Gin 


Tyr 


Gly 


Ala 


Lys 


Val 


Met 


Leu 


Thr 


Ala 


Ser 


Leu 


Asp 


Tyr 


Asp 




610 










615 










62 0 










Phe 


Cys 


Asn 


Asn 


Ala 


He 


Gin 


Asn 


Tyr 


Thr 


Phe 


Ser 


Asp 


Pro 


Ala 


Asn 


625 










6 30 










635 










6 4 0 


Pro 


Asn 


Leu 


Phe 


His 


Gin 


Thr 


Tyr 


Gly 


Asn 


He 


Gly 


Aro 


Glu 


His 


Ser 










64 5 










650 










655 




Pho 


Ser 


Leu 


Asn 


Thr 


Tyr 


Ala 


Met 


Tyr 


Thr 


Pro 


Ala 


Val 


Trp 


Val 


Arg 








660 










665 










670 






lie 


I'let 


Leu 


Asn 


Gly 


Asn 


He 


Asp 


Arg 


Thr 


Phe 


Gin 


Lys 


Ser 


Glu 


Ala 






675 










680 










685 








Leu 


Gly 


He 


Asp 


Val 


Asn 


Ser 


Trp 


Ser 


Gly 


Met 


Val 


Tyr 


Ser 


Gly 


Leu 




690 










695 










700 










Met 


Phe 


Thr 


Leu 


Pro 


Lys 


Asp 


Trp 


Thr 


Val 


Asn 


Leu 


Phe 


Gly 


Gly 


Tyr 


7 05 










710 










715 










720 


Tyr 


His 


Gly 


Gly Arg 


Ser 


Tyr 


Gin 


Thr 


Lys 


Tyr 


Asp 


Gly Asn 


Val 


Phe 








725 










730 










735 




Asn 


Asn 


He 


Gly 


He 


Ala 


Lys 


Gin 


Leu 


Phe 


Asp 


Lys 


Lys 


Leu 


Arg 


Val 








740 










745 










750 






Ser 


Leu 


Ser 


Ala 


Asn 


As n 


He 


His 


Ala 


Lys 


Tyr 


Ser 


Thr 


Trp 


Lys. 


Ser 






755 










760 










765 








Arg 


Thr 


He 


Gly 


Asn 


Gly 


Phe 


Thr 


He 


Tyr 


Ser 


Glu 


Asn 


Ala 


Gly 


He 


770 










775 










780 










Gin 


Arg 


Ser 


Val 


Ser 


Leu 


Ser 


Leu 


Thr 


Tyr 


Ser 


Phe 


Gly 


Lys 


Met 


Asn 


785 








790 










795 










800 


Thr 


Gin 


Val 


Arg 


Lys 


Val 


Glu 


Arg 


Thr 


He 


Val 


Asn 


Asp 


Asp 


Leu 


Lys 










805 










810 










815 




Gin 


Thr 


Ser 


Ser 


Gin 


Gly 


Gin 


Gin 


Gly 


Gly 


Gly 


Gin 


Gly 


Asn 


Pro 


Thr 








8 20 










825 










830 






Gly 


Asn 































(2) IN FORMAT I Oli FOR SEQ ID 110:490 



ti) SEQUEHCE CHARACTERISTICS: 
55 (A) LEIIGTH: 39 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGV: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

( vi ) ORIG J 1 lAL SOURCE : 

'(A) ORGAtilSM: Porphyromonas gingivalis 

(i::) FEATURE: 

(A) HA) IE/KEY : misc_feature 
{ B) LOCATIOn 1 ... 399 



70 (::il SEQUEHCE DESCRI PTIOJ J : SEQ ID HO: 490 

Met Arg Leu Ser Ala He Leu He Ala Leu He Val Met Leu Pro Ala 

1 ' 5 10 15 

Val Leu Ser Gly Gin His Tyr Tyr Ser Met Ala Gly Glu Arg Leu Glu 

75 20 25 30 



BNSDOCID: <WO 9929870A1_I_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



442/490 



65 





Asp 


Ser 


He 


Arg 


Pro 


Asn 


Glu 


Leu 


Ser 


Ala 


Ser 


He 


Arg 


Ser 


Ala 






35 










40 










45 








Leu 


Phe 


Phe 


Arg 


Asn 


Asn 


Glu 


Tyr 


Asn 


Ala 


Arg 


Ser 


Val 




Gly 


Tyr 




50 










55 










60 










Thr 


Leu 


Pro 


Gly 


Ala 


Arg 


Val 


Ser 


Ala 


Phe 


Ala 


Ser 


Tyr 


Ser 


Leu 


Pro 


65 










70 










75 










80 


Al a 


Ala 


Hi£' 


Gly 


Val 


Lys 


Leu 


Ser 


Leu 


Gly 


Val 


Ser 


Thr 


Leu 


Asn 


Tyr 










85 










90 










95 




Trp 


Giy 


Al a 


Ser 


Arg 


Tyr 


Pro 


Ala 


Gly 


He 


Al a 


Tyr 


Ser 


Asp 


Leu 


Pro 








1 00 










105 










110 






Tyr 


Trp 


Tlir 


A£-p 


T y r 


Asn 


Asp 


Tyr 


Val 


Arg 


Leu 


Arg 


He 


Leu 


Pro 


Ty^ 






115 










120 










125 








Va 1 


Gin 


Al a 


Met 


Leu 


Lys 


Pro 


Thr 


Ala 


Thr 


Thr 


Ala 


Leu 


Met 


Leu 


Gly 




130 










1 35 










i 4 0 










A5n 


lie 


Ala 


G 1 y 


Gly 


Thr 


Ala 


His 


Gly 


Leu 


He 


Gl u 


Pro 


He 


Tyr 


Asn 


14 5 










1 50 










155 










160 


Px"o 


("'111 




ep 


T £31 1 


Thr 


Al 3 


Asp 


Pro 


Glu 


Ala 


o 1 y 


val 


Gin 


Phe 


Arg 










165 










17 0 










17 5 




Gly 


A^p 


Trp 


Thr 


Arg 


P)ie 


Arg 


14et 


Asp 


Va 1 


T rp 


Val 


Ae n 


Trp 


!-Iet 


Ser 








180 










185 










1 yO 






I let 


lie 


PJie 


Lys 


Asn 


Asp 


Asn 


His 


Gin 


Glu 


Ser 


Phe 


Val 


Phe 


Gly 


Leu 






195 










200 










205 








Ser 


Thr 


Thr 


Ser 


Lys 


Leu 


Leu 


Ser 


Gly 


Glu 


Gly 


Lys 


Trp 


Arg 


Leu 


Glu 




2 1 0 










215 










220 










Leu 


Pro 


Leu 


Gin 


Ala 


He 


Ala 


Thr 


His 


Arg 


Gly 


Gly 


Glu 


Tyr 


Asn 


Trp 


2 5 










230 










235 










2 4 0 


Ala 


Gin 


Giri 


Asp 


Thr 


Val 


His 


Thr 


Trp 


Val 


Asn 


Gly 


Ala 


'v" a 1 


Gly 


Leu 










245 










250 










255 




Ly^ 


Leu 


Ser 


Tyr 


Arg 


Pro 


Arg 


Thr 


Asp 


Lys 


Pro 


Met 


Gin 


He 


Trp 


Gly 








260 










265 










27 0 






Ser 


Ala 


Tyr 


'21 y 


Val 


Ala 


Ala 


Leu 


Ser 


Ser 


Gly 


Gly 


Tyr 


Fhe 


Pro 


Tyr 






275 










280 










285 








Glu 


Arg 


Gly 


Trp 


Gly 


Gly 


Tyr 


Leu 


Ser 


Leu 


Gly 


Met 


Asp 


Leu 


Glu 


His 




290 










2 95 










300 










Phe 


Ala 


Phe 


Arg 


Thr 


Asp 


Tyr 


Trp 


Tyr 


Gly 


Arg 


His 


Tyr 


V3l 


Ser 


Pro 


305 










310 










315 










320 


Phe 


Ala 


Ala 


Pro 


Phe 


Ala 


Asn 


Ser 


Leu 


Thr 


Tyr 


Asp 


Lys 


Gin 


Pro 


Leu 










325 










330 










335 




Thr 


As-n 


Gly 


Trp 


Gly 


Asp 


Tyr 


He 


Arg 


Leu 


Tyr 


Ala 


Asp 


Tyr 


Ser 


Trp 








340 










345 










350 






Arg 


net 


Ala 


Arg 


Ser 


Val 


Ser 


Leu 


Ala 


Ala 


Val 


Ala 


Arg 


Val 


Trp 


Phe 






355 










360 










365 








Gin 


Pro 


Ser 


Asp 


Arg 


Phe 


Ala 


Met 


Ser 


His 


Ala 


Leu 


Glu 


Leu 


Thr 


Met 




370 










375 










380 










Arg 


He 


As-p 


Pro 


Lys 


Phe 


Pro 


He 


Ala 


Phe 


Leu 


Lys 


Gly 


Asn 


His 




385 










390 










3 95 












(C) 


II I FORMAT I on 


FOR 


SEQ 


ID MO: 4 91 

















10 



15 



20 



25 



30 



35 



40 



45 



50 (i) SEQUEliCE CHARACTERISTICS: 

(A) LEHGTH: 382 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(vi) OR I GI UAL SOURCE: 
60 (A) ORGAHISM: Porphyromonas gingival is 



(i.x) FEATURE: 

(A) MAI IE/ KEY: misc_f eature 
(E) LOCATION 1 . . . 382 

(xH SEQUEMCE DESCRIPTIOH: SEQ ID MO: 4 91 



Met Asn Lys Ser Leu Leu Ser Leu Ala Cys Leu He Leu Cys Gly Met 
1 5 10 15 

70 Pro Ala He Ala Gin Gin Thr Gly Pro Ala Glu Arg Ser Gly Glu Pro 
2 0 25 3'- 

Ser Leu Ala Glu Arg Val Phe Gly Leu Glu Gin Lys Gin Lys Lys Leu 

35 40 45 

Lys Val Tyr Leu Gly He Gin Ser Phe Tyr Asp Gin Pro Leu Val Asp 
75 50 55 60 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 PCT/AU98/01023 

443/490 





Asp 


Giu 


Ser 


His 


He 


Gly 


His 


Phe 


Lys 


Val 


Gin 


Glu 


Leu 


Arg 


Met 


Ser 




65 










70 










75 










80 




Ala 


His 


Gly 


Glu 


Leu 


Asn 


Arg 


His 


Leu 


Ser 


Phe 


Asp 


Trp 


Arg 


Gin 


Arg 










85 










90 










95 




5 


Leu 


Asn 


Arg 


Ala 


Ala 


Asp 


Gly 


Tlir 


Ser 


Phe 


Ala 


Asp 


Asn 


Leu 


Ser 


Asn 








100 










105 










110 








Ala 


He 


Asp 
115 


He 


Ala 


Gly 


Val 


Asp 
120 


Trp 


His 


Pro 


Asn 


Asp 

125 


Lys 


Val 


Ser 




Phe 


Phe 


Phe 


Gly 


Arg 


Gin 


Tyr 


Ala 


Arg 


Phe 


Gly 


Gly 


He 


Glu 


Tyr 


Asp 


10 




130 










135 










14 0 












Met 


Asn 


Pro 


Val 


Glu 


He 


Tyr 


Gin 


Tyr 


Ser 


Asp 


Leu 


Val 


Asp 


Tyr 


Met 




1^5 










150 










155 










1(30 




Thr 


Cys 


Tyr 


Thr 


Ser 


Gly 


Val 


Asn 


Phe 


Ala 


Trp 


Asn 


Phe 


His 


Pro 


Glu 








165 










170 










175 




15 


Gin 


Gin 


Leu 


Gin 
180 


Leu 


Gin 


Val 


Leu 


Asn 
185 


Ala 


Tyr 


Asn 


Asn 


Arg 
190 


Phe 


Ala 




Asp 


Arg 


Tyr 


His 


Val 


Tiir 


Pro 


Asp 


Val 


Ala 


Thr 


Ala 


Thr 


Ser 


Tyr 


Pro 






195 










200 










205 










Leu 


Leu 


Tyr 


Ser 


Ala 


Gin 


Trp 


Asn 


Gly 


Thr 


Leu 


Leu 


Gly 


Gly 


Ala 


Leu 


20 




210 








215 










220 












His 


Met 


Arg 


Tyr 


Ala 


Val 


Ser 


Met 


Ala 


His 


Gin 


Ala 


Gin 


Glu 


Arg 


Asn 




2 25 








230 










235 










240 




Met 


Trp 


Tyr 


Phe 


Thr 
245 


Ala 


Gly 


Asn 


Leu 


Phe 
250 


Asn 


Pro 


Gly 


Lys 


Arg 

255 


He 


25 


Asn 


Gly 


Tyr 


Leu 

2 60 


Asp 


Leu 


Thr 


Tyr 


Ser 
265 


He 


Glu 


Gly 


Leu 


Asp 
270 


Asp 


Lys 




01 y 


He 


Met 
275 


Thr 


Ala 


Arg 


Tyr 


Gly 
280 


Lys 








Leu 
285 


Thr 


Asp 


Val 




Ly^ 


Tyr 


Tyr 


Ala 


Leu 


Val 


Ser 


Lys 


Trp 


Asn 


Phe 


Arg 


He 


Phe 


Asp 


Gin 


30 


2 90 










295 










300 












Val 


Asn 


Leu 


Phe 


Leu 


Lys 


Gly 


Met 


Tyr 


Glu 


Asn 


Gly 


Tyr 


Ala 


Pro 


Ala 




305 










310 










315 










32 0 




Gin 


Tyr 


Gly 


Glu 


Ser 


Ser 


His 


Thr 


Arg 


His 


Ser 


Tyr 


Gly 


Tyr 


Met 


Gly 










325 










330 










335 




35 


Gly 


Val 


Glu 


Tyr 
340 


Tyr 


Pro 


Thr 


Glu 


Thr 
345 


Asn 


Phe 


Arg 


Leu 


Phe 
350 


Val 


Thr 




Tyr 


He 


Gly 


Arg 


His 


Tyr 


Arg 


Tyr 


Ser 


Ala 


Thr 


Glu 


Thr 


Glu 


Ser 


Thr 






355 










360 










365 










Asn 


Ala 


Leu 


Arg 


Ala 


Gly 


Leu 


He 


Tyr 


Gin 


He 


Pro 


Phe 


Leu 






40 




37 0 










37 5 










380 











50 



55 



70 



75 



(2) IMFORMATIOII FOR SEQ ID 1J0:4 92 

(i) SEQUEMCE CHARACTERISTICS: 
45 (A) LEMGTH: 222 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIiAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 



(ix) FEATURE: 

(A) NAI^E/P:eY: mis cofeature 

(B) LOCATIOll 1 . , . 222 

60 (Mi) SEQUEllCE DESCRIPTIOIi: SEQ ID 110:4 92 



65 



Met 


Lys 


His 


Leu 


Phe 


Lys 


Ser 


Thr 


Leu 


Val 


Leu 


Leu 


Cys 


Ala 


Leu 


Ser 


1 






5 










10 










15 




Phe 


Ser 


Gly 


Thr 


Tyr 


Thr 


Phe 


Ala 


Gin 


Glu 


Asn 


Asn 


Thr 


Glu 


Lys 


Ser 






20 










25 










30 






Arg 


Phe 


Asp 


Phe 


Ser 


Val 


Arg 


Leu 


Gly 


Gin 


Gly 


Tyr 


He 


Ala 


Gly 


Ser 






35 










40 










45 








Thr 


Thr 


Asn 


Leu 


Met 


Tyr 


Gly 


Tyr 


Thr 


Ser 


Ala 


Asn 


Asp 


Arg 


Leu 


Leu 




50 










55 










60 










Ser 


'31 y 


Ala 


He 


Tyr 


Leu 


Gly 


Leu 


Thr 


Pro 


Ser 


Lys 


Lys 


Glu 


Asn 


Ala 


65 










70 










75 










80 


Thr 


Gly 


Val 


Ala 


Phe 


Arg 


Phe 


Leu 


Ser 


Pro 


Ser 


Pro 


Gly 


Tyr 


Tyr 


Val 










85 










90 










95 




Asp 


He 


Ser 


Gly 


Lys 


Glu 


Asn 


Thr 


Leu 


Asn 


Tyr 


Ala 


Phe 


Tyr 


Val 


Val 






100 








105 










110 
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15 



35 



40 



45 



50 



55 



60 



65 



70 



75 
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Gly 


Ala 


Tyr 
1 1 -J 


Asn 


Arg 


He 


Ala 


He 


Pro 


He 


Arg 


Pro 


He 
12 5 


Lys 


Asn 


Phe 




Asn 


Phe 
130 


lie 


Phe 


Ser 


Thr 


Glu 
135 


Val 


Gly 


Met 


Ala 


Trp 
140 


riet 


Ser 


Arg 


His 


5 


Glu 
1-15 


Gin 


He 


Tyr 


Asn 


Ser 
150 


Thr 


Ser 


Gin 


Thr 


Trp 
155 


Asp 


Lys 


Gin 


Arg 


Lys 

160 




Ser 


Arg 


Ser 


Gly 


Leu 
165 


Asp 


Phe 


Gly 


Leu 


Gly 
170 


ilet 


His 


Leu 


Gin 


Unk 
175 


His 


10 


He 


As 11 


Lys 


Thi: 


Val 


Tyr 


Phe 


Met 


Ala 


Gly 


Thr 


Asp 


Leu 


Thr 


Ser 


Cys 








180 










185 










190 








Met 


Phe 


Gly 
195 


hys 


Arg 


He 


Asn 


Asp 
200 


Tyr 


Gin 


Gin 


Lys 


Asp 
205 


Arg 


Thr 


Phe 




He 


Ala 

210 


Leu 


He 


Asp 


Asn 


Ser 
215 


He 


Gly 


He 


Gly 


Leu 

220 


Asn 


Leu 







(2) Hi FORMAT I Oil FOR SEQ ID HO: 4 93 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 391 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGV: linear 

(ii) MOLECULE TYPE: protein 

25 (iiii HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 

(A) ORGAIIISM: Porphyroinonas gingival i 5 

30 (ix) FEATURE: 

( A ) 1 1 AI-IE / KEY : mi s c__f e a t u r e 

(B) LOCATION 1 ... 391 



SEQUEMCE DESCRIPTIOM: SEQ ID MO: 4 93 



Met 


He 


He 


Lys 


Lys 


Met 


Leu 


Lys 


Asn 


Lys 


Leu 


Ala 


Pro 


Leu 


Ala 


He 


1 








5 










10 










15 




Leu 


Phe 


Leu 


Phe 


Ala 


Pro 


Lys 


Ala 


Met 


Lys 


Ala 


Gin 


Glu 


Gin 


Leu 


Asn 








20 










25 










30 






Val 


Val 


His 


Thr 


Ser 


Val 


Pro 


Ser 


Leu 


Asn 


He 


Ser 


Pro 


Asp 


Ala 


Arg 






35 










40 










45 








Ala 


Ala 


Gly 


Met 


Gly 


Asp 


He 


Gly 


Val 


Ala 


Thr 


Thr 


Pro 


Asp 


Ala 


Tyr 




50 










55 










60 










Ser 


Gin 


Tyr 


Trp 


Asn 


Pro 


Ser 


Lys 


Tyr 


Ala 


Phe 


Met 


Asp 


Thr 


Lys 


Ala 


65 










70 










75 










80 


Gly 


He 


Ser 


Phe 


Ser 


Tyr 


Thr 


Pro 


Trp 


Leu 


Ser 


Lys 


Leu 


Val 


Asn 


Asp 










85 










90 










95 




He 


Ala 


Leu 


Met 


Gin 


Met 


Thr 


Gly 


Phe 


Tyr 


Lys 


Leu 


Gly 


Thr 


Asp 


Glu 








100 










105 










110 






Asn 


Gin 


Ala 


He 


Ser 


Ala 


Ser 


Leu 


Arg 


Tyr 


Phe 


Thr 


Leu 


Gly 


Lys 


Leu 






115 










120 










125 








Glu 


Thr 


Phe 


Asp 


Glu 


Leu 


Gly 


Glu 


Ser 


Met 


Gly 


Glu 


Ala 


His 


Pro 


Asn 




130 










135 










140 










Glu 


Phe 


Ala 


Val 


Asp 


Leu 


Gly 


Tyr 


Ser 


Arg 


Gin 


Leu 


Ser 


Glu 


Asn 


Phe 


145 










150 










155 










160 


Ser 


Met 


Ala 


Val 


Ala 


Leu 


Arg 


Tyr 


He 


Arg 


Ser 


Asp 


Gin 


Ser 


Thr 


His 










165 










170 










175 




Asn 


Thr 


Gly 


Glu 


Asn 


Gin 


Ala 


Gly 


Asn 


Ala 


Phe 


Ala 


Ala 


Asp 


He 


Ala 








180 










185 










190 






Gly 


Tyr 


Leu 


Gin 


Lys 


Tyr 


Val 


Leu 


Leu 


Gly 


Asn 


Ala 


Glu 


Ser 


Leu 


Trp 






195 










200 










205 








Ser 


Leu 


Gly 


Phe 


Asn 


Val 


Lys 


Asn 


He 


Gly 


Thr 


Lys 


He 


Ser 


Tyr 


Asp 




210 










215 










22 0 










Gly 


Gly 


Val 


Thr 


Ser 


Phe 


Phe 


He 


Pro 


Thr 


Ser 


Leu 


Asn 


Leu 


Gly 


Thr 


225 










230 










235 










24 0 


Gly 


Leu 


Leu 


Tyr 


Pro 


lie 


Asp 


Asp 


Tyr 


Asn 


Ser 


li- 


Asn 


Phe 


Asn 


Leu 










24 5 










2 50 










255 




Glu 


Leu 


Ser 


Lys 


Leu 


Leu 


Val 


Pro 


Thr 


Fro 


Pro 


ne 


Met 


Asp 


Gin 


Asn 








260 










265 










270 






Asp 


Gin 


Ala 


Gly 


Tyr 


Glu 


Ala 


Ala 


Leu 


Lys 


Lys 


Tyr 


Gin 


Glu 


Thr 


Ser 






275 










280 










285 








Ser 


He 


Ser 


Gly 


He 


Phe 


Ser 


Ser 


Phe 


Gly 


Asp 


Ala 


Pro 


Gly Gly 


Leu 




290 










295 










300 










Lys 


Glu 


Glu 


Phe 




Glu 


He 


Thr 


Trp 


Gly 


Leu 


Gly 


Ala 


Glu 


Tyr 


Ser 


305 










310 










315 










320 
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20 



25 





Asp 


Asp 


Lys 


Phe 


Phe 


Val 


Arg 


Ala 


Gly 


Tyr 


Ser 


Tyr 


Leu 


His 


Pre 






325 










330 










335 




Tiii- 


Lys 


Gly 


Asn 


Leu 


Gin 


Tyr 


Phe 


Thr 


Ala 


Gly 


Ala 


Gly 


Phe 


hya 


Met 






3 AO 










3^5 










350 






A^fn 


lie 


Phe 


Arg 


lie 


Asp 


Ala 


Ser 


Tyr 


Leu 


Leu 


Ser 


Thr 


He 


Gin 


Ser 






355 










360 










365 








Asn 


Pro 


Leu 


A^p 


Gin 


Thr 


Leu 


Arg 


Phe 


Thr 


Leu 


Ala 


Phe 


Asp 


Met 


Asp 




370 










375 










380 










Gly 


Leu 


Arg 


Asn 


Leu 


Phe 


His 





















10 385 390 

(2) IIIFORMATIOM FOR SEQ ID MO:4 9<; 

(i) SEQUEIICE CHARACTERISTICS: 
15 (A) LEIIGTH: •J46 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIJAL SOURCE: 

(A) ORGAHISM: Porphyromona 3 gingival if 

{ix) FEATURE: 

(A) llAME/KEY: misc_feature 
( B ) LOCATIOI i 1 ... 4 4 6 



30 (:-:i) SEQUEriCE DESCRI PTIOI I : SEQ ID nO:4 94 

Met Lys Tlir Thr Val Gin Gin He He Leu Cys Leu Ala Leu Met Met 
15 10 15 

Ser Gly Val Leu Gly Gly Asn Ala Gin Ser Phe Trp Glu Glu He Ala 
35 20 25 30 

Pro Pro E'he He Ser Asn Glu Pro Asn Val Lys Tyr He He Pro Asn 

35 40 45 

Met Gly He Asp Ser Lys Gly Thr He Tyr Val Thr Val Thr Lys Arg 
50 55 60 

40 He Gin Gin Gly Ala Asn Tyr Thr Ser Glu Gin Leu Gly Met Tyr Tyr 
65 70 75 80 

Arg Pro Leu Gly Asp Asn Glu Gin Trp Trp Lys His Asp Pro Tyr Phe 

85 90 95 

Asp A^p Lys He Val Ala Asp He Gin Thr Asp Ala Tyr Gly Arg Val 
45 100 105 HO 

Tyr Val Cys Thr Thr Ser Ser Arg Asp Gin Glu Tyr Gin Leu Tyr He 

115 120 125 

Asn Glu Gin Asn Glu Trp Arg Cys He Phe Lys Thr Ser Val Ser Thr 
130 135 140 

50 Tyr Glu His Gly Met Ala Val Phe Arg Ser Ser Thr Gly Val Thr Tyr 
145 150 155 160 

He Gly Thr Arg His His He Phe Ala Ser Gly Val Asn Asp Phe Glu 

' 165 170 17 5 

Phe Asn Thr He Tyr Glu Asp Ser Thr Pro Met Ser Cys Arg Phe Ala 
55 180 185 190 

Glu Ala Thr Asn Ser Gly Thr He Tyr Leu Ala Leu Met His Glu Thr 

195 200 205 

Thr Met Ser Thr Thr He Leu Thr Tyr Gin Asn Gly Glu Phe Val Asp 
210 215 220 

60 He Ser Glu Ser Glu Leu Ser Asn Ser He He Ala Ser Met Cys Ser 
225 230 235 24 0 

Asn Lys Glu Gly Asp He He Ala Leu Val Thr Ser Tyr Thr Gly Phe 

24 5 250 255 

Met Ser GJ y Thr Leu Ala He Ara Lys Ala Asp Glu Gly Lys Trp Gin 
65 260 ' 265 2^0 

Leu Val Gly Gl -y Asp He Gin Asn Ala He Val Gin Asn He Cys Me". 

27 5 2 80 2 85 

Met Asp Asp Asn Lys lie Ala Cys Glu Val Phe Gly Thr Pro Asn Gly 
290 295 300 

70 Val Asp Gly Arg Thr Arg Val Cys Val Ser Asp Ala Ser Val Phe Asp 
305 310 315 320 

Phe Glu Trp Tyr Glu Asp Glu He Tyr Gly Gly Leu He Phe Asp Thr 

325 330 335 

Phe Phe Tyr Ser Pro Trp Asp Lys Leu Leu Tyr Ala Lys Phe Gly Gly 
75 34 0 34 5 350 
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10 



15 



20 



45 



50 



55 



65 



70 



He 


Met 


Leu 
355 


Arg 


Ser 


Lys 


Glu 


Ser 
360 


Phe 


He 


Thr 


Ser 


Phe 
365 


He 


Ser 


Pro 


Thr 


Val 




Gin 


Gl y 


VaJ 


Asp 


Val 


Tyr 


Thr 


Leu 


Ala 


Gly 


Lys 


lie 


Arg 




37 0 










375 










380 






He 


Glu 


Ser 


Glu 


Thr 


Pro 


Val 


Ser 


Glu 


Val 


Leu 


Leu 


Phe 


Asp 


Leu 


Ala 


385 










390 










395 










4 00 


Gly 


Arg 


Met 


Val 


Leu 
40b 


Arg 


Gin 


Thr 


He 


Asp 
410 


A^-n 


Lys 


He 


Tyr 


Ser 
415 


Asp 


He 


Asp 


Thr 


Asn 


Gly 


Leu 


Lys 


Arg 


Ser 
425 


Gly 


He 


Tyr 


Val 


Val 
430 


Ser 


Val 


Arg 


Leu 


Ser 
435 


Ser 


Gly 


Gin 


Val 


Phe 

440 


Ser 


His 


Lys 


Val 


Gin 
44b 


Val 







(2) IIJE-ORMATIOM FOR SEQ ID IIO:4 95 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LEHGTH: 308 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGV: linear 

(ii) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

25 (vi) ORTGIIIAL SOURCE: 

(A) ORGAIIISM: Porphyromonas gingival is 

(iM) FEATURE: 

(A) tJAI IE/KEY: misc_feature 
30 (B) LOCATIOH 1,..308 

(:-:i) SEQUEIICE DESCRI PTIOi I : SEQ ID MO: 4 95 

Met He He Arg Cys Leu He Arg Arg Pro Arg Thr Val Leu Phe Gly 
OO 1 5 10 15 

Leu He Phe Val Val Gly Leu Phe Ser Ala Met Ala Gin Glu Lys Lys 

20 25 30 

Asp Ser Leu Ser Thr Val Gin Pro Val Pro Asn Ser Ser Met Val Glu 
35 40 45 

4U Gin Thr Pro Leu Leu Ser He Asp His Pro Val Leu Pro Ala Ser Phe 
50 55 60 

Gin Asn Thr Arg Thr Leu Lys Arg Phe Arg Asp Lys His Leu Ser Asp 
^5 70 75 80 

Ala Leu Leu Asn Gly Leu Lys Pro His Arg Ser Ser Leu Gin Leu Asn 

85 90 95 

Glu Glu Leu Asn Phe Ala Ala Glu Arg Arg Asp Phe Val Ser Pro Leu 

100 105 110 

Leu Gin Thr Arg His Ala Ala Gly Val Leu Ser Trp Arg Pro Thr Asp 

115 120 125 

Arg Met His Phe Tyr Thr Ser Gly Asn He Gly Leu Gly His Asp Leu 

130 135 140 

Leu Thr Gly Val Arg Lys Asp Phe Gly Trp Asn Ala Gly Ala Asp Phe 
I'l^ 150 155 igo 

Leu Leu Ser Gin Asn Leu Thr Ala His Val Gin Gly Gly Trp Gin Gin 

1^5 170 175 

Asn Phe Gly Phe He Pro Met Thr Ala Val Asn Gly Gin Leu Arg Trp 

180 185 190 

Gin Ala Thr Glu Arg Leu Ser Phe Thr Thr Gly He Asp Tyr Arg Gin 
cn 200 205 

DU Val Gin Trp Asn Ala Phe Asp Asn Arg Thr Phe Ser Leu Lys Gly S-r 
210 215 220 

Ala Arg Tyr Glu Val Met Asp Asn Val Phe Val Asn Gly Phe Gly Ser 
225 230 235 240 

Tyr Pro Leu Tyr Ser Ser Thr Arg Ser Gly Leu Asn Met Ala Val Pro 

245 250 255 

Met His Gly Phe Gly Pro Gin Tyr Gly Gly Ser Leu Glu Leu Lys Val 

260 265 270 

Ser Glu Arg Phe Gly Phe Ala Val Gly Met Glu Arg Glu Tyr Asn II- 

275 280 285 

Trp Thr Arg Arg Trp Glu Thr His Tyr Phe Ala Tyr Pro Val Ph- T-r 

2?0 295 300 

Gly Asp Lys Lys 
305 



75 (2) II J FORMATION FOR SEQ ID NO: 4 96 
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10 



60 
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( i ) SEQUEMCE CHARACTERISTICS : 

(A) LEIIGTH: 315 amino acid^ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

( A ) ORGAN I Sn ; Po r ph y r omo n a .'v g i n g i v a 1 i s 



(ix) FEATURE: 
15 (A) DAI IE/KEY: mi£c_feature 

(B) LOCATION 1 . . .315 

(xi) SEQUEMCE DESCRIPTION: SEQ ID UOr-l^G 

20 Met hys Thv A^n Arq Arg Tyr Ala Phe Val Leu Pro Leu Leu Leu Leu 
i 5 10 15 

Thr Gly Leu Leu Ala Trp Gly Gin Asp Ser Ser His Gly Ser Asn Thr 

20 25 30 

Ala Phe Ala Thr Asp Ser Ser Ser Arg Glu Leu Pro Thr Glu Gin Ser 
25 35 40 45 

Ala Tyr Arg lie His Ser Ala Tyr Met Vai Gly Gly Gly Gly Ser lie 

50 55 60 

Thr Arg Asp Thr Tyr Leu Ser Pro Leu Arg Tyr Gly Gly Trp Thr Leu 
65 7 0 7 5 80 

30 Asn Leu Leu Gly Glu Lys Thr Phe Pro Leu Lys Ala Ser Asp Ser Arg 

85 90 95 

Trp Met lie Arg Thr Gly His Glu Leu Asp Fhe Ala Leu Met Asp Asn 

100 105 110 

Pro Ala Asn Asn Ala His Phe Tyr Ser Leu Leu Tyr Asn Gly Ser Ala 
35 115 120 125 

Ala Ala Leu Tyr Arg Leu Gly Ala Lys His Leu Arg Ala Ala Trp Met 

130 135 140 

Asp Asn Leu Arg Leu Ala Phe Gly Pro Gly Leu Glu lie Gly Leu Gly 
145 150 155 160 

40 Gly lie Tyr Ser Thr Arg Asn Gly Asn Asn Pro Ala Thr Leu Lys Leu 

165 " 170 175 

Tyr Thr Asn Ala He Ala Gin Ala Ser He Gly Tyr Tyr Val Pro Ser 

180 185 190 

Glu Thr Fhe Pro Leu Tyr Phe Arg Leu Leu Ser Gin He Asn Leu Phe 
45 195 200 205 

Gly He Ala Tyr Gly Asn Gly Phe Gly Glu Ser Tyr Tyr Glu Asn Phe 

210 215 220 

Leu Leu Asn Asn Gly He Ala Gly Ser Leu His Phe Thr Tyr Pro Gly 
225 230 235 240 

50 Lys Phe Thr Ara Phe Thr Thr Leu He Thr Ala Asp He Pro He Arg 

245 250 255 

Asn Phe Cys Thr Leu Arg Val Gly Tyr Arg Tyr Ser His Leu Gly Ser 

260 265 270 

Ser Leu Asn Ala Leu Asp Thr Arg He His Ser His Thr Ala Phe He 
55 275 280 285 

Gly Phe Val Thr Glu Phe Tyr Arg Phe Arg Gl*^* Arg Lys Ala Met Asn 

2 90 2 95 300 

Thr Gly Arg Arg Thr Ser Leu Tyr Tyr His Asp 
305 310 315 



(2) IMFORMATIOH FOR SEQ ID 110:4 97 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 285 amino acids 
65 (P) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

70 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

75 (ix) FEATURE: 
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BNSDOCID; <WO 9929870A1_L> 
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55 



60 



65 



70 



75 



(A) MAl'IE/KEY: misc_f eature 

(B) LOCATION 1 . . .2 85 

(Xi) SEQUEIJCE DESCRIPTIOM: SEQ ID 1)0:497 



Met 


Val 


Val 


Thr 


Leu 


Leu 


Val 


He 


Val 


Gly 


He 


Val 


Val 


Val 


Val 


Arg 


1 








5 










10 










15 




Tyr 


Sei- 


Leu 


Arg 

20 


Val 


His 


Val 


His 


Lys 
25 


Thr 


Gly 


Thr 


Val 


Val 

30 


Ser 


Ala 


Ala 


ne 


Phe 


Gly 


Phe 


He 


Leu 


Leu 


Gly 


Lys 


Thr 


Val 


Pro 


Cys 


Asp 


Thr 






35 










40 










45 




Arg 


Asn 


Phe 


Phe 


Ser 


Ser 


Glu 


Ser 


Asp 


Glu 


Pro 


Glu 


Ser 


Arg 


Val 


Ala 




50 










55 










60 








Thr 


Glu 


lie 


Ala 


His 


Leu 


Cys 


Glu 


He 


Gly 


Phe 


Gin 


He 


His 


Ala 


Ser 


65 










70 










75 










80 


Ser 


He 


His 


Val 


Ala 


Val 


Arg 


Thr 


Asp 


Phe 


Gly 


Gin 


Ala 


Gly 


He 


His 










85 










90 








95 




Cys 


Pro 


Met 


Ala 
100 


Thr 


Asp 


Ala 


Ser 


Ala 
105 


Thr 


Glu 


Phe 


Asp 


Arg 
110 


Ser 


Ala 


Glu 


Cys 


Ala 


Glu 


Arg 


Thr 


Ser 


Ala 


Gin 


He 


Asp 


Thr 


Ala 


He 


Arg 


Ser 






115 










120 










125 






Gill 


Ser 


Gin 


lie 


He 


Arg 


Thr 


His 


He 


Asp 


Thr 


Cys 


Pro 


Lys 


Ser 


Ser 




130 










135 










140 








Gly 


Thr 


lie 


Gly 


Gly 


Ser 


Thr 


His 


Thr 


Ser 


Leu 


His 


Leu 


Lys 


Val 


Phe 


14b 










150 










155 








160 


Asp 


Gly 


Arg 


Gly 


Glu 


Val 


Gly 


His 


He 


His 


Pro 


Lys 


Asp 


Gly 


Leu 


Arg 










165 










170 










175 


Phe 


Gly 


Val 


Val 
180 


Glu 


Gly 


Tyr 


Ser 


He 
185 


Gly 


Ser 


Tyr 


Val 


Asp 
190 


Ala 


He 


G3 y 


lie 


Gly 


Ala 


Thr 


His 


Ala 


Lys 


Ala 


Gly 


He 


Ser 


Asp 


Thr 


Arg 


Thr 






195 










200 










205 






Gly 


l]e 


Ala 


Gly 


Gly 


Tyr 


His 


Gly 


Gly 


Ser 


Gl n 


His 


Gin 


Gin 


He 


Gly 




21.0 










215 










220 








Asp 


Val 


Thr 


Thr 


He 


He 


Gly 


Leu 


Gly 


Lys 


Phe 


Gly 


Leu 


Ala 


Asn 


Val 


225 










230 










235 










24 0 


Gly 


Val 


Ser 


Asp 


Arg 
245 


Gly 


Phe 


Arg 


Arg 


Gly 
250 


Ala 


Ser 


Ser 


Tyr 


Asp 
255 


Leu 


Tyr 


Gly 


Leu 


Glu 


Leu 


His 


He 


Ala 


Lys 


Thr 


He 


Tyr 


Leu 


Ala 


Val 


Gly 








260 










265 










270 




Asp 


Gly 


Cys 

275 


Leu 


Cys 


Arg 


Gin 


Ala 
280 


Glu 


Arg 


Gin 


Asp 


Gly 
285 









10 



15 



20 



25 



30 



35 



40 



(2) II ] FORMATION FOR SEQ ID HO: 4 98 

45 U) SEQUEIICE CHARACTERISTICS: 

(A) LEI4GTH: 599 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGII-JAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) HAME./KEY: misc_feature 
{B) LOCATIOIJ 1 ... 599 

(xi) SEQUEJJCE DESCRIPTIOM: SEQ ID MO: 4 96 



Met 


Glu 


Asn 


Leu 


Lys 


Asn 


He 


Gin 


Pro 


Arg 


Glu 


Asp 


Phe 


Asn 


Trp 


Glu 


1 








5 










10 










15 




Glu 


Phe 


Glu 


Ala 


Gly 


Gly 


Val 


His 


Ala 


Ala 


Val 


Ser 


Arg 


Gin 


Glu 


Gin 








20 










25 








30 






Glu 


Ala 


Ala 


Tyr 


Asp 


Lys 


Thr 


Leu 


Asn 


Thr 


He 


Lys 


Glu 


Lys 


Glu 


Val 






35 










40 








45 








Val 


Met 


Gly 


Arg 


Val 


Thr 


Ala 


He 


Asn 


Lys 


Arg 


Glu 


Val 


Val 


He 


Asn 




50 










55 








60 










Val 


Gly 


Tyr 


Lys 


Ser 


Glu 


Gly 


Val 


Val 


Pro 


Ala 


Thr 


Glu 


Phe 


Arg 


Tyr 


65 










70 










75 








80 


Asn 


Pro 


Glu 


Leu 


Lys 


Val 


Gly 


Asp 


Glu 


Val 


Glu 


Val 


Tyr 


He 


Glu 


Asn 










85 










90 










95 




Gin 


Glu 


Asp 


Lys 


Lys 


Gly 


Gin 


Leu 


Val 


Leu 


Ser 


His 


Arg 


Lys 


Gly 


Arg 
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100 105 110 

Ala Ala Arg Ser Trp Glu Arg Val Asn Glu Ala Leu Glu Lys Asp Glu 

115 120 125 

He Val Lys Gly Tyr Val Lys Cys Arg Thr Lys Gly Gly Met He Val 
5 130 135 140 

hsp Val Phe Gly He Glu Ala Plie Leu Pro Gly Ser Gin lie /\sp Val 
1^5 150 155 160 

Arg Pro He Arg A^p Tyr Asp Ala Phe Val Glu Lys Thr Met Glu Phe 
165 170 175 

10 Lys He Val Lys He Asn Gin Glu Tyr Lys Asn Val Val Val Ser His 
180 105 190 

Lys Val Leu He Glu Ala Glu Leu Glu Gin Gin Lys Lys Glu He He 

195 200 205 

Gly Lys Leu Glu Lys Gly Gin Val Leu Glu Gly He Val Lys Asn He 
15 210 215 220 

Thr Ser Tyr Gly Val Phe He Asp Leu Gly Gly Val Asp Gly Leu He 
225 230 235 2-30 

His He Thr Asp Leu Ser Trp Gly Arg Val Ala His Pro Glu Glu He 
2^5 250 255 

20 Val Gin Leu Asp Gin Lys He Asn Val Val He Leu Asp Phe Asp Glu 
260 265 270 

Asp Arg Lys Arg He Ala Leu Gly Leu Lys Gin Leu Met Pro His Pro 

275 280 285 

Trp Asp Ala Leu Asp Ser Glu Leu Lys Val Gly Asp Lys Val Lys Gly 
25 2 90 2 95 300 

Lys Val Val Val I let Ala Asp Tyr Gly Ala Phe Val Glu He Ala Gin 
305 310 315 320 

Gly Val Glu Gly Leu He His Val Ser Glu Met Ser Trp Thr Gin His 
325 330 335 

30 Leu Arg Ser Ala Gin Asp Phe Leu His Val Gly Asp Glu Val Glu Ala 
340 345 350 

Val He Leu Thr Leu Asp Arg Glu Glu Arg Lys Met Ser Leu Gly Leu 

355 360 365 

Lys Gin Leu Lys Pro Asp Pro Trp Ala Asp He Glu Thr Arg Phe Pro 
35 370 375 380 

Val Gly Ser Arg His His Ala Arg Val Arg Asn Phe Thr Asn Phe Gly 
385 390 395 -400 

Val Phe Val Glu He Glu Glu Gly Val Asp Gly Leu He His He Ser 
405 410 415 

40 Asp Leu Ser Trp Thr Lys Lys He Lys His Pro Ser Glu Phe Thr Glu 
420 425 430 

Val Gly Ala Asp He Glu Val Gin Val He Glu He Asp Lys Glu Asn 

435 440 445 

Arg Arg Leu Ser Leu Gly His Lys Gin Leu Glu Glu Asn Pro Trp Asp 
45 450 455 460 

Val Phe Glu Thr Val Phe Thr Val Gly Ser He His Glu Gly Thr Val 
465 470 475 480 

He Glu Val Met Asp Lys Gly Ala Val Val Ser Leu Pro Tyr Gly Val 
485 490 495 

50 Glu Gly Phe Ala Thr Pro Lys His Met Val Lys Glu Asp Gly Ser Gin 
500 505 510 

Ala Val Leu Glu Glu Lys Leu Pro Phe Lys Val He Glu Phe Asn Lys 

515 520 525 

Asp Ala Lys Arg He He Val Ser His Ser Arg Val Phe Glu Asp Glu 
55 530 535 540 

Gin Lys Met Ala Gin Arg Glu Ala Asn Ala Glu Arg Lys Ala Glu Ala 
545 550 555 560 

Lys Ala Ala Gin Lys Glu Ala Ala Ala Glu Ala Ala Asn Pro Ala Gin 
565 570 575 

60 Ala Val Glu Lys Ala Thr Leu Gly Asp Leu Gly Glu Leu Ala Ala Leu 
580 585 590 

Lys Glu Lys Leu Ser Glu Asn 
595 

65 (2) IHFORMATIOM FOR SEQ ID MO: 4 99 

(i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 amino acids 

(B) TYPE: amino acid 
70 (D) TOPOLOGY: linear 

(ii} MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID; <WO 9929870A1_I_> 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



WO 99/29870 PCT/AU98/01 023 

450/490 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromona^ gingival is 

(ix) FEATURE: 

(A) NAI-IE/KEY: misc_f eature 

(B) LOCATION I , . . 550 

SEQUENCE DESCRIPTION: SEQ ID NO: 499 



I let 


Gly 


Arg 


Val 


Thr 


Ala 


He 


Asn 


Lys 


Arg 


Glu 


Val 


Val 


He 


Asn 


Val 


1 








5 










10 










15 




Gly 


Tyr 


Lys 


Ser 


Glu 


Gly 


Val 


Val 


Pro 


Ala 


Thr 


Gl u 


Phe 


Arg 


Tyr 


Asn 








20 










25 










30 




Pro 


Glu 


Leu 


Lys 


Val 


Gly 


Asp 


Glu 


Val 


Glu 


Val 


Tyr 


He 


Glu 


Asn 


Gin 






35 










40 










4 5 








Glu 


Asp 


Lys 


Lys 


Gly 


Gin 


Leu 


Val 


Leu 


Ser 


His 


Arg 


Lys 


Gly 


Arg 


Ala 




50 










55 










60 










Ala 


Arg 


Ser 


Trp 


Glu 


Arg 


Val 


Asn 


Glu 


Ala 


Leu 


Glu 


Lys 


Asp 


Glu 


He 


65 










70 










75 








80 


Val 


Lys 


Gly 


Tyr 


Val 


Lys 


Cys 


Arg 


Thr 


Lys 


Gly 


Gly 


riet 


He 


Val 


Asp 










85 










90 










95 




Val 


Phe 


Gly 


He 


Glu 


Ala 


Phe 


Leu 


Pro 


Gly 


Ser 


Gin 


He 


Asp 


Val 


Arg 








100 










105 










110 




Pro 


He 


Arg 


Asp 


Tyr 


Asp 


Ala 


Phe 


Val 


Glu 


Lys 


Thr 


Met 


Glu 


Phe 


Lvs 






115 










120 










125 








He 


Val 


Lys 


He 


Asn 


Gin 


Glu 


Tyr 


Lys 


As n 


Val 


Val 


Va 1 


Ser 


His 






130 










135 










140 










Val 


Leu 


He 


Glu 


Ala 


Glu 


Leu 


Glu 


Gin 


Gin 


Lj-s 


Lys 


Glu 


He 


He 


Gl \^ 


145 










150 










155 










160 


Lys 


Leu 


Glu 


Lys 


Gly 


Gin 


Val 


Leu 


Glu 


Glv 


He 


Val 


Lys 


Asn 


He 


Thr 










165 










170 










175 




Ser 


Tyr 


Gly 


Val 


Phe 


He 


ASD 


Leu 


Gly 




Val 


Asp 


Gl V 


L^u 


He 


His 








180 










185 










190 






He 


Thr 


Asp 


Leu 


Ser 


Trp 


Glv 


Arg 


Val 


Ala 


His 


Pro 


Glu 




He 


Va 1 






195 










200 










2 05 








Gin 


Leu 


Asp 


Gin 


Lys 


He 


Asn 


Val 


Val 


He 


Leu 


Asp 


Phe 


Asp 


Glu 


ASp 




210 










215 










220 










Arg 


Lys 


Arg 


He 


Ala 


Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


Met 


Pro 


His 


Pro 


Tro 


r> r> 










230 










235 










24 0 


Asp 


Ala 


Leu 


Asp 


Ser 


Glu 


Leu 


Lys 


Val 


Gl\- 




Lvs 


Val 


Lys 


Gly 


Lys 










245 










250 










255 




Val 


Val 


Val 


Met 


Ala 


Asp 


Tvr 


Gly Ala 


Phe 


Val 


Glu 


He 


Al a 


Gin 


Gl 








260 










265 










270 




Val 


Glu 


Gly 


Leu 


He 


His 


Val 


Ser 


Glu 


Met 


Ser 


T rr> 


Thr 


Gin 


His 


Leu 






27 5 










280 










285 








Arg 


Ser 


Ala 


Gin 


Asp 


Phe 


Leu 


His 


Val 


Gl V 
^x J 


Asp 


Glu 


Val 


Glu 


Ala 


Va 1 




2 90 










295 








300 










He 


Leu 


•I'hr 


Leu 


Asp 


Arg 


Glu 


Glu 


Arg 


Lys 


Met 


Ser 


Leu 


Glv 


Leu 


Lvs 


305 










310 










315 










320 


Gin 


Leu 


Lys 


Pro 


Asp 


Pro 


Trp 


Ala 


Asp 


He 


Glu 


Thr 


Arg 


Phe 


Pro 


Va 1 










325 










330 










335 




Gly 


Ser 


Arg 


His 


His 


Ala 


Arg 


Val 


Arg 


Asn 


Phe 


Thr 


Asn 


Phe 


Gly 


Val 








340 










345 










350 






Phe 


Val 


Glu 


He 


Glu 


Glu 


Gly 


Val 


Asp 


Gly 


Leu 


He 


His 


He 


Ser 


Asp 






355 










360 










365 








Leu 


Ser 


Trp 


Thr 


Lys 


Lys 


He 


Lys 


His 


Pro 


Ser 


Glu 


Phe 


Thr 


Glu 


Val 




370 










375 










380 










Gly 


Ala 


Asp 


He 


Glu 


Val 


Gin 


Val 


He 


Glu 


He 


Asp 


Lys 


Glu 


Asn 


Arg 


385 










390 










395 








400 


Arg 


Leu 


Ser 


Leu 


Gly 


His 


Lys 


Gin 


Leu 


Glu 


Glu 


Asn 


Pro 


Trp 


Asp 


Val 










405 










410 










415 




Phe 


Glu 


Thr 


Val 


Phe 


Thr 


Val 


Gly 


Ser 


He 


His 


Glu 


Gly 


Thr 


Val 


He 








420 










4 2 5 










430 






Glu 


Val 


Met 


Asp 


Lys 


Gly 


Ala 


Val 


Val 


Ser 


Leu 


Pro 


Tyr 


Gly 


Val 


Glu 






435 










440 










445 






Gly 


Phe 


Ala 


Thr 


Pro 


Ly^ 


His 


Met 


Val 


Lys 


Glu 


Asp 


Gly 


Ser 


Gin 


Ala 




4 50 










455 










4 60 








Val 


Leu 


Glu 


Glu 


Lys 


Leu 


Pro 


Phe 


Lys 


Val 


He 


Glu 


Phe 


Asn 


Lys 


Asp 


465 










470 










475 










480 


Ala 


Lys 


Arg 


He 


He 


Val 


Ser 


His 


Ser 


Arg 


Val 


Phe 


Glu 


Asp 


Glu 


Gin 










485 










4 90 










495 




Lys 


Met 


Ala 


Gin 


Arg 


Glu 


Ala 


Asn 


Ala 


Glu 


Arg 


Lys 


Ala 


Glu 


Ala 


Lys 








500 










505 






510 




Ala 


Ala 


Gin 


Lys 


Glu 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Asn 


Pro 


Ala 


Gin 


Ala 






515 










520 










525 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



451/ 490 



Val G] u Lys Ala Thr Leu Gly Asp Leu Gly Glu Leu Ala Ala Leu Lys 

530 535 5^0 

Glu Lys Leu Ser Glu Asn 
5^5 550 

5 

{2} li 1 FORI lATI Oil FOR SEQ ID NO: 500 

(i) SEQUEUCE C}jARACTERISTlCS : 

(A) LEllGTH: 458 amino acid^ 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

15 (iii) HYPOTHETICAL: YES 

(vi) OR I GI HAL SOURCE: 

(A) ORGAIIISM: Porphyromona s gingival is 

20 (ix) FEATURE: 

(A) rJAJIE/KEY: misc_feature 

( B) LOCATIOM 1 ... 458 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(xi) SEQUEUCE DESCRIPTIOU : SEQ ID 110:500 



net 


11 e 


Val 


Asp 


Val 


Phe 


Gly 


He 


Glu 


Ala 


Phe 


Leu 


Fro 


Gly 


Ser 


Gin 


1 








c 










10 










15 




lie 


Asp 


Val 


Arg 


Pro 


He 


Aug 


Asp 


Tyr 


Asp 


Ala 


Phe 


Val 


Glu 


Lys 


Thr 








2 0 










25 










30 






1 let 


Glu 


Phe 


Lys 


He 


Val 


Lys 


He 


Asm 


Gin 


Glu 


Tyr 


Lys 


Asn 


Val 


Val 






35 










40 










4 5 








Val 


Ser 


His 


Lys 


Val 


Leu 


He 


Glu 


Ala 


Glu 


Leu 


Glu 


Gin 


Gin 


Lys 


Lys 




50 




















60 










Glu 


He 


li- 


Gly 


Lys 


Leu 


Glu 


Lys 


Gly 


Gin 


Val 


Leu 


Glu 


Gly 


He 


Val 


65 








7 0 










75 










80 


Lys 


Asn 


ne 


Thr 


Ser 


Tyr 


Gly 


Val 


Phe 


He 


Asp 


Leu 


Gly 


Gly 


Val 


Asp 










85 










90 










95 




Gly 


Leu 


li- 


His 


He 


Thr 


Asp 


Leu 


Ser 


Trp 


Gly 


Arg 


Val 


Ala 


His 


Pro 








100 










105 










110 






Glu 


Glu 


ne 


Val 


Gin 


Leu 


Asp 


Gin 


Lys 


He 


Asn 


Val 


Va 1 


He 


Leu 


Asp 






115 










120 










125 








Pile 


Asp 


Glu 


Asp 


Arg 


Lys 


Arg 


He 


Ala 


Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


l-Iet 




130 










135 










140 










Pro 


His 


Pro 


Trp 


Asp 


Ala 


Leu 


Asp 


Ser 


Glu 


Leu 


Lys 


Val 


Gly 


Asp 


Lys 


14 5 










150 










155 










160 


Val 


Lys 


Gly 


Lys 


Val 


Val 


Val 


Met 


Ala 


Asp 


Tyr 


Gly 


Ala 


Phe 


Val 


Glu 










165 










170 










175 




He 


Ala 


Gin 


Gly 


Val 


Glu 


Gly 


Leu 


He 


His 


Val 


Ser 


Glu 


I4et 


Ser 


Trp 








180 










185 










1 90 






Thr 


Gin 


His 


Leu 


Arg 


Ser 


Ala 


Gin 


Asp 


Phe 


Leu 


His 


Val 


Gly 


Asp 


Glu 






1 95 










200 










205 








Val 


Glu 


Ala 


Val 


He 


Leu 


Thr 


Leu 


Asp 


Arg 


Glu 


Glu 


Arg 


Lys 


Met 


Ser 




210 










215 










220 










Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


Lys 


Pro 


Asp 


Pro 


Trp 


Ala 


Asp 


He 


Glu 


Thr 


225 










230 










235 










240 


Arg 


Phe 


Pro 


Val 


Gly 


Ser 


Arg 


Hi s 


His 


Ala 


Arg 


Val 


Arg 


Asn 


Phe 


Thr 










245 










2 50 










255 






Phe 


Gly 


Val 


Phe 


Val 


Glu 


He 


Glu 


Glu 


Gly 


Val 


Asp 


Gly 


Leu 


He 








2 60 










265 










27 0 






Hi r 


He 


Ser 


Asp 


Leu 


Ser 


Trp 


Thr 


Lys 


Lys 


Tie 


Lys 


His 


Pro 


Ser 


Glu 






275 










200 










285 








Phe 


Thr 


Glu 


Val 


Gly 


Ala 


Asp 


He 


Gl u 


Val 


G i n 


Val 


He 


Glu 


He 


Asp 




2D0 










295 










300 










Lys 


Glu 


Asn 


Arg 


Arg 


Leu 


Ser 


Leu 


Gly 


His 


Lys 


Gin 


Leu 


Gl u 


Glu 


Asn 


305 










310 










315 










320 


Pro 


Trp 


Asp 


\' a 1 


Phe 


Glu 


Thr 


Val 


Phe 


Thr 


Val 


Gly 


Ser 


He 


His 


Glu 










325 










330 










335 




Gly 


Thr 


Val 


He 


Glu 


Val 


net 


Asp 


Lys 


Gly 


Ala 


Val 


Val 


Ser 


Leu 


Pro 








3-1 0 










345 










350 






Tyr 


Gly 


Va 1 


Glu 


Gly 


Phe 


Ala 


Thr 


Pro 


Lys 


His 


net 


Val 


Lys 


Glu 


Asp 






355 










360 










3 65 








Gly 


Ser 


Gin 


Ala 


Val 


Leu 


Glu 


Glu 


Lys 


Leu 


Fro 


Phe 


Lys 


Val 


He 


Glu 




370 










375 










380 










Phe 


Asn 


Lys 


Asp 


Ala 


Lys 


Arg 


He 


He 


Val 


Ser 


His 


Ser 


Arg 


Val 


Phe 


385 










390 










395 










4 00 



BNSDOCID: <WO 9929870A1J_> 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



wo 99/29870 



PCT/AU98/01023 



452/ 490 

Glu Asp Glu Gin Lys Met Ala Gin Arg Glu Ala Asn Ala Glu Arg Lys 

4 05 410 415 

Ala Glu Ala Lys Ala Ala Gin Lys Glu Ala Ala Ala Glu Ala Ala Asn 

420 ^25 430 

5 Pro Ala Gin Ala Val Glu Lys Ala Thr Leu Gly Asp Leu Gly Glu Leu 

435 440 445 

A3 a Ala Leu Lys Glu Lys Leu Ser Glu Asn 

450 455 

10 (2) IIIFORI lATIOII FOR SEQ ID HO; 501 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LEtlGTH: 426 amino acids 

( B) TYPE : amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



20 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(vi) ORIGIIIAL SOURCE: 

(A) ORGAMISM: Forphyromor.as gingiva lis 



(i::) FEATURE: 
25 (A) MAJ IE/KEY: misc_f£atur = 

(B) LOCATION 1. . .426 



(xi) SEQUEUCE DESCRI PTIOIJ : SEQ ID IJO: 501 



Met 


Glu 


Phe 


Lys 


He 


Val 


Lys 


He 


As n 


Gin 


Glu 


Tvr 


Lvs 


Asn 


Va 1 


Va 1 


1 








5 










10 








15 




Val 


Ser 


His 


Lys 


Val 


Leu 


He 


Glu 


Ala 


Glu 


Leu 


Glu 


Gin 


Gin 


Lvs 


Lvs 








20 










25 










30 


Glu 


He 


He 


Gly 


Lys 


Leu 


Glu 


Lys 


Gly 


Gin 


Val 


Leu 


Glu 


Gly 


He 


Val 






35 










40 










45 






Ly^ 


A£-n 


He 


Thr 


Ser 


Tyr 


Glv 


Val 


Phe 


He 


Asp 


Leu 


Gly 


Gly 


Val 


Asp 




50 










55 










60 








Gly 


Leu 


He 


His 


He 


Thr 


Asp 


Leu 


Ser 


Trp 


Gly 


Arg 


Val 


Ala 


His 


Pro 


65 










70 








7 5 








80 


Glu 


Glu 


He 


Val 


Gin 


Leu 


Asp 


Gin 


Lys 


He 


Asn 


Val 


Val 


He 


Leu 


Asp 










85 










90 










95 




Phe 


Asp 


Glu 


Asp 


Arg 


Lys 


Arg 


He 


Ala 


Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


Met 








100 










105 










110 






Pro 


His 


Pro 


Trp 


Asp 


Ala 


Leu 


Asp 


Ser 


Glu 


Leu 


Lys 


Val 


Gly 


Asp 


Lys 






115 










120 










125 








Val 


Lys 


Gly 


Lys 


Val 


Val 


Val 


Met 


Ala 


Asp 


Tyr 


Gly 


Ala 


Phe 


Val 


Gl u 




130 










135 










14 0 










He 


Ala 


Gin 


Gly 


Val 


Glu 


Gly 


Leu 


He 


His 


Val 


Ser 


Glu 


Met 


Ser 


Trp 


145 










150 










155 










160 


Thr 


Gin 


His 


Leu 


Arg 


Ser 


Ala 


Gin 


Asp 


Phe 


Leu 


His 


Val 


Gly 


Asp 


Glu 










165 










170 








175 




Val 


Glu 


Ala 


Val 


He 


Leu 


Thr 


Leu 


Asp 


Arg 


Glu 


Glu 


Arg 


Lys 


Met 


Ser 








180 










185 










190 






Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


Lys 


Pro 


Asp 


Pro 


Trp 


Ala 


Asp 


He 


Glu 


Thr 






195 










200 








205 








Arg 


Phe 


Pro 


Val 


Gly 


Ser 


Arg 


His 


His 


Ala 


Arg 


Val 


Arg 


Asn 


Phe 


Thr 




210 










215 










220 










Asn 


Phe 


Gly 


Val 


Phe 


Val 


Glu 


He 


Glu 


Glu 


Gly 


Val 


Asp 


Gly 


Leu 


He 


2 2 5 










230 










235 








240 


His 


He 


Ser 


Asp 


Leu 


Ser 


Trp 


Thr 


Lys 


Lys 


He 


Lys 


His 


Pro 


Ser 


Glu 










245 










250 










255 




Phe 


Thr 


Glu 


Val 


Gly 


Ala 


Asp 


He 


Glu 


Val 


Gin 


Val 


He 


Glu 


He 


Asp 








2 60 










265 










270 




Lys 


Glu 


Asn 


Arg 


Arg 


Leu 


Ser 


Leu 


Gly 


His 


Lys 


Gin 


Leu 


Glu 


Glu 


Asn 






275 










280 










285 








Pro 


Trp 


Asp 


Val 


Phe 


Glu 


Thr 


Val 


Phe 


Thr 


Val 


Gly 


Ser 


He 


His 


Glu 




2 90 










295 










300 










Gly 


Thr 


Val 


He 


Glu 


Val 


Met 


Asp 


Lys 


Gl y 


Ala 


Val 


Val 


Ser 


Leu 


Pro 


305 










310 








315 










320 


Tyr 


Gly 


Val 


Glu 


Gly 


Phe 


Ala 


Thr 


Pre 


Lys 


His 


Met 


Val 


Lys 


Glu 


Asp 










325 










330 










335 


Gly 


Ser 


Gin 


Ala 


Val 


Leu 


Glu 


Glu 


Lys 


Leu 


Pro 


Phe 


Lys 


Val 


He 


Glu 








340 










34 5 










350 






Phe 


Asn 


Lys 


Asp 


Ala 


Lys 


Arg 


He 


He 


Val 


Ser 


His 


Ser 


Arg 


Va 1 


Phe 






355 










360 










365 







SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 



20 



30 



35 



40 



45 



50 



55 



60 



WO 99/29870 PCT/AU98/01023 

453/ 490 



Glu 


Asp 

37 0 


Glu 


Gin 


Lys 


Het 


Al a 
375 


Glti 


Arg 


Glu 


Ala 


380 


Ala 


Glu 


Arg 


Ly^ 


Ala 


Giu 


Ala 


Lys 


Ala 


Ala 


Gin 


Lys 


Gl u 


Ala 


Ala 


Aia 


Glu 


Ala 


Ala 


A^n 


3G5 










390 










395 










4 00 


Pro 


Ala 


Gl n 


Ala 


Val 
4 05 


Glu 


hys 


Ala 


Thr 


Leu 
410 


Gly 


A.-^p 


Leu 


Gly 


Glu 
415 


Leu 


Ala 


Ala 


Leu 


Ly- 
420 


Glu 


hys 


Leu 


Ser 


Glu 

425 


A^n 















10 (2) IlIFORI lATlOti FOR SEQ ID 110:502 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 24 0 amino acids 

(B) TYPE: amino acid 
15 iO) TOPOLOGY: linear 

(ii) IIOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 



(vi) ORTGIilAL SOURCE: 

(A) ORGAIIISM: Porphyromonas gingiva lis 



(ix) FEATURE: 
25 {A) IIAIiE/KEY: mi sc_f eatur-E 

(B) LOCATIOM 1 ... 240 



(xi) SEQUEIICE DESCRIPTION: SEQ ID NO: 502 



Met 


Lys 


Lys 


Ala 


He 


Leu 


Ser 


Gly 


Ala 


Ala 


Leu 


Leu 


Leu 


Gly 


Leu 


Cys 


1 








5 










10 










15 




Ala 


Asn 


Ala 


Gin 


Asn 


Val 


Gin 


Leu 


His 


Tyr 


Asp 


Phe 


Gly 


His 


Ser 


He 








20 










25 










30 






Tyr 


A^p 


Glu 


Leu 


Asp 


Gly 


Arg 


Pro 


Lys 


Leu 


Thr 


Thr 


Thr 


Val 


Glu 


Asn 






35 










4 0 










45 








Phe 


Thr 


Pro 


Asp 


Lys 


Trp 


Gly 


Ser 


Thr 


Phe 


Phe 


Phe 


He 


Asp 


Met 


Asp 




50 










55 










60 












Thr 


Gly 


Lys 


Gly 


lie 


Gin 


Ser 


Ala 


Tyr 


Trp 


Glu 


He 


Ser 


Arg 


Glu 


65 










70 










75 










80 


Leu 


Lys 


Phe 


Trp 


Gin 


Ala 


Pro 


Val 


Ser 


He 


His 


Leu 


Glu 


Tyr 


Asn 


Gly 










85 










90 










95 




Gly 


Leu 


Ser 


Thr 


Ser 


Phe 


Thr 


Phe 


Gly 


His 


Asp 


Ala 


Leu 


He 


Gly 


Ala 








100 










105 










HO 






Thr 


Tyr 


Thr 


Tyr 


Asn 


Asn 


Pro 


Ser 


Phe 


Thr 


Arg 


Gly 


Phe 


Thr 


He 


Thr 






115 








120 










125 








Pro 


Met 


Tyr 


Lys 


His 


Leu 


Gly 


Ala 


His 


Asp 


Phe 


His 


Thr 


Tyr 


Gin 


He 




130 










135 










14 0 










Thr 


Gly 


Thr 


Trp 


Tyr 


net 


His 


Phe 


Leu 


Asp 


Gly 


Leu 


Leu 


Thr 


Phe 


Asn 


14 5 










150 










155 










160 


Gly 


Phe 


Leu 


Asp 


Leu 


Trp 


Gly 


Phe 


Pro 


Gin 


Glu 


Asn 


Pro 


He 


Gly 


Gly 










165 










170 










175 




Pro 


Val 


Leu 


Lys 


Glu 


Gly 


Asp 


Lys 


Phe 


Val 


Phe 


Leu 


Ser 


Glu 


Pro 


Gin 








180 










185 










190 






Phe 


Trp 


He 


Asn 


Leu 


Asn 


Arg 


He 


Lys 


Gly 


He 


Asp 


Lys 


Asp 


Phe 


Asn 






195 










200 










205 








Leu 


Ser 


He 


Gly 


Thr 


Glu 


Met 


Glu 


He 


Ser 


Arg 


Asn 


Phe 


Ala 


Arg 


Met 




210 










215 










220 










A£-p 


Lys 


Phe 


Ser 


Cys 


He 


Pro 


Thr 


Leu 


Ala 


Val 


Lys 


Trp 


Thr 


Phe 


Asn 


22 5 










2 30 










235 










2 10 



{2} INFORilATIOH FOR SEQ ID MO: 503 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 
65 (B) TYPE: amino acid 

( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

70 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival ii 

75 (ix) FEATURE: 



BNSDOCID; <WO 9929e70Al J_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



454/ 490 



(A) IIAIIE/ KEY : mi sc_f ea ture 

(B) LOCATIOIJ 1 . , .A3A 

(xi) SEQUENCE DESCRI PTIO! I : SEQ ID 110:503 

5 

Ilet Tyr Lys Asp Tyr Lys Gly Leu Tyr Ala Ser Leu Arcj Trp Tyr Ala 
1 5 10 15 

Leu lie lie Gly Leu Leu Phe Ala Ala Asp Gly lie Gin Ala Gin Asn 
20 25 30 

10 Asn Asn Phe Thr Giu Ser Pro Tyr Thr Arg Phe Gly Leu Gly Arg Leu 
35 40 45 

Gly Glu Arg Thr Thr lie Ser Gly His Ser 1 let Gly Gly Leu Gly Val 

50 55 6 0 

Gly Leu Acq Gin Gly Thr Tyr Val Asn Ala V^l Am Pro Ala Ser Tyr 
15 66 70 75 80 

Ser Ala Val Asp Ser Viet Thr Phe lie Phe Asp Phe Gly Ala Ser Thr 

85 90 95 

Glv He Thr Trp Tyr Ala Glu Asn Gly Lys Lys Asp Asn Arg Lys Met 
100 105 110 

20 Gly Asn He Glu Tyr Phe Ala Met Leu Phe Pro He Ser Lys Ser He 
115 120 125 

Ala Met Ser Ala Gly Val Leu Pro Tyr Ser Ala Ser Gly Tyr Gin Phe 

130 135 1*10 

Gly Ser Val Asp Gin Val Glu Gly Gly Ser Val Gin Tyr Thr Arg Lys 
25 1-15 150 155 160 

Tyr Leu Gly Thr Gly Asn Leu Asn Asp Leu Tyr Val Gly He Gly Ala 

165 17 0 175 

Thr Pro Phe Lys Asn Phe Ser He Gly Ala Asn Ala Ser Ser Leu Phe 
180 185 190 

30 Gly Arg Phe Thr His Ser Arg Gin Val He Phe Ser Thr Glu A.la Pro 
195 200 205 

Tyr Asn Pro Val His Leu Ser Thr Leu Tyr Leu Lys Ala Ala Lys Phe 

210 215 220 

Asp Phe Gly Met Gin Tyr His Leu Leu Leu Lys Ser Asp Arg Ser Leu 
35 225 230 235 2-10 

Val He Gly Ala Val Tyr Ser Pro Arg Val Lys Met His Ser Glu Leu 

245 250 255 

Thr Gin He Lys Asn Gin Val Gin Asn Giy Val Val Val Glu Ser Glu 
260 265 270 

40 Thr Gin Glu Tyr He Lys Gly Met Asp Tyr Tyr Thr Leu Pro His Thr 
275 280 285 

Leu Gly He Gly Phe Ser Tyr Glu Lys Lys Asp Lys Leu Leu Leu Gly 

290 295 300 

Ala Asp Val Gin Tyr Ser Lys Trp Lys Gly Glu Lys Phe Tyr Lys Ser 
45 305 310 315 320 

Asp Cys L-^-s Phe Gin Asp Arg He Arg Val Ser Leu Gly Gly Glu He 

325 " 330 335 

He Pro Asp He Asn Ala Val Gly Met Trp Pro Lys Val Arg Tyr Arg 
3^10 345 350 

50 Phe Gly Leu His Gly Glu Asn Ser Tyr Leu Lys Val Pro Thr Lys Gly 
355 360 365 

Gly Val Tyr Gin Gly Tyr His He Val Gly Ala Val Phe Gly He Gly 

370 375 380 

He Pro Leu Asn Asp Arg A.rg Ser Phe Val Asn Val Ser Leu Glu Tyr 
55 385 390 395 400 

Asp Arg Leu He Pro Lys Glu Gly Met He Lys Glu Asn Ala Leu Lys 

405 410 415 

Leu Thr Phe Gly Leu Thr Phe Asn Glu Ser Trp Phe Lys Lys Leu Lys 
420 425 430 

60 Leu Asn 

(2) 1 11 FORMAT I OM FOR SEQ ID MO: 504 

65 (i) SEQUEnCE CHARACTERISTICS: 

(A) LEMGTH: 926 amino acids 
(E) TYPE: amino acid 
( D) TOPOLOGY : linear 

70 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 
75 (A) OFcGAtllSM: Porphyromonas gingival is 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID; <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



455/490 



(ix) FEATURE: 

(A) HAI-IE/KEY: mis cofeature 

(B) LOCATIOIJ 1 ... 926 

5 

} SEQUENCE DESCRI PTIOII : SEQ ID MO: 504 

Met Arg Ser He Tyr Gin Leu Leu Leu Ser He Leu Leu Ala Ser Leu 
1 5 10 15 

10 Gly Phe Val Gly Leu Glu Ala Gin Gin Ala Gly Val Ala Gly Arg Val 
20 2 5 30 

Leu Asp Glu Glu Gly Asn Pro Het He Gin Ala Ai?n Val Gin Leu Val 

35 40 
Gin Ser Thr Gly Gin Val Ala Val Ala Ala Gly Ala Thr A-n Glu Ly^ 
15 50 55 60 

Gly Leu Phe Ser Leu Lys Thr Ser Gin Glu Gly Asp Tyr He Leu Arg 
65 70 -75 80 

Val Ser Tyr Val Gly Tyr Thr Thr His Asp Glu Lys He Ser Leu Arg 
85 90 95 

20 Asn Gly Gin Thr He Thr Leu Lys Asp He Ser Het Asn Glu Asp Ala 
100 105 110 

Aro Leu Leu Gin Ser Val Thr Val Gin Ala Lys Ala Ala Glu Val Val 

115 120 125 

Val Arc) Asn A^p Thr Leu Glu Phe Asn Ala Gly Ser Tyr Thr Val Ala 
25 130 135 140 

Gin Gly Ala Ser He Glu Glu Leu He Lys Lys Leu Pro Gly Ala Glu 
1^5 150 155 160 

He Gly Ser Asp Gly Lys He Thr He Asn Gly Lys Asp He Ser Lys 
165 1*70 175 

30 He Leu Val Asp Gly Lys Glu Phe Phe Ser Lys Asp Pro Gin Val Ala 
180 185 190 

He Lys Asn Leu Pro Ala Asp Met Val Asn Lys Val Gin Val Leu Asn 

195 200 205 

Lys Leu Ser Glu Leu Ser Arg Met Ser Gly Phe Asp Asp Gly Glu Glu 
35 210 215 220 

Glu Thr Val He Asn Leu Thr Val Lys Pro Glu Lys Lys Lys Gly Leu 
225 230 235 240 

Phe Gly Thr Leu Gin Ala Gly Tyr Gly Thr Asp Gin Arg Tyr Met Ala 
24 5 2 50 2 55 

40 Gly Gly Asn Val Asn Arg Phe Asp Gly Asn Lys Gin Trp Thr Leu He 
260 265 2-70 

Gly Ser Ala Asn Asn Thr Asn Asn Met Gly Phe Ser Glu Met Asp Ser 

275 280 285 

Glu net Gly Ser Met Thr Phe Phe Ser Pro Gin Gly Gly Gly Arg Arg 
45 290 295 300 

Gly Phe Gl^y Asn Ser Gly Gly Val Thr Ser Ser Ser Met Leu Gly Gly 
305 310 315 320 

Asn Phe Ser Val Glu Plie Ser Ser Ala Leu Asn Thr Gly Gly Asp Ala 
325 330 335 

50 Arg Tyr Gly Tyr Asn Asp Lys Ala He Glu Thr Thr Lys Arg Val Glu 
340 345 350 

Asn He Leu Ala Glu Gly Asn Thr Tyr Met Asp Glu Asn He Leu Glu 

355 360 365 

Arg Ser Phe Ser His Asn Gly Gin Ala Arg Phe Arg Met Gin Trp Lys 
55 370 375 380 

Pro Ser Glu Arg Thr Glu Val Val Phe Glu Pro Asp Leu Ser He Ser 
385 390 395 400 

Lys He Asp Gly Phe Phe Asn Asp Thr Tyr Glu Thr Lys Asp Ala Thr 
405 410 415 

60 Gly He Ser He Asn Lys Gly Ser He His Gin Thr Thr Gin Gly Asn 
4 20 4 25 4 30 

Asn Phe Arg Leu Asn Gly Glu Leu Asp He Ser His Lys Leu Asn Asp 

435 440 445 

Glu Gly Arg Thr He Ser Ala Ser Val Ser Gly Gly Leu Thr Asp Glu 
65 '150 4 55 4 60 

Asp Gly Asp Gly He Tyr Gin Ala Val Leu Gin Ser Val Glu Thr Asn 
465 470 475 ABO 

Gin Lys Gin Phe Asn Asp Asn Ser Asn Leu Gin Tyr Arg Leu Arg Leu 
485 490 495 

70 Ser Tyr Val Glu Pro Leu Gly Lys Asn Tyr Phe Ala Gin Ala He Leu 
500 505 510 

Asn Ara Arg Phe Ser Arg Arg Asn Ser Asp Arg Glu Val Tyr Arg Leu 

515 520 525 

Gly Asp Asp Gly Gin Tyr Ser He Leu Asp Ser Gin Tyr Gly Leu Ser 
75 530 535 540 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDCX:iD: <WO 9929870A1_I_> 



60 



70 



75 



WO 99/29870 PCT/AU98/01023 

456/ 490 



Tyr 


Ser 


As n 


Glu 


Phe 


Thr 


Gin 


Tyr 


Arg 


He 


Giy 


Leu 


Asn 


Le u 


Lys 


Lys 


545 










550 










555 








560 


He 


Ala 


Lys 


Thr 


Trp 


Asp 


Tyr 


Thr 


Val 


Gly 


Phe 


Asn 


Val 


Asp 


Pro 


Asn 










565 










570 










575 




Arg 


Tlir 


Va 1 


Ser 


Tyr 


Arg 


Ser 


Val 


Ala 


Gly 


Val 


Glu 


Gl n 


Asp 


Lys 


Leu 








58 0 










585 










590 




Ala 


Phe 


Asn 


Arg 


Val 


Asn 


Leu 


Ser 


Pro 


Met 


Leu 


Arg 


He 


Asn 


Tyr 


Lys 






5 95 










600 










605 






Pro 


Ser 


Arg 


Thr 


Thr 


Asn 


Leu 


Arg 


Val 


Asp 


Tyr 


Arg 


Gly 


Arg 


Thr 


Thr 




6i0 










615 










620 










Gin 


Pro 


Ser 


He 


Asn 


Gin 


He 


Ala 


Pro 


Val 


Gin 


Asp 


He 


Thr 


Asn 


Pro 


625 










630 










635 










64 0 


Leu 


Fhe 


Val 


Thr 


Glu 


Gly 


Asn 


Pro 


Gly 


Leu 


Lys 


Pro 


Ser 


Tyr 


Ser 


Asn 










645 










650 










655 




Asii 


Vol 


Met 


Ala 


Met 


Phe 


Ser 


Asp 


Phe 


Asp 


Ala 


Lys 


Ser 


Gin 


Arg 


Ala 








660 










665 










670 






Phe 


Asn 


He 


Val 


Phe 


Phe 


Gly 


Asn 


Tyr 


Thr 


Phe 


Asp 


Asp 


He 


Val 


Pro 






675 










680 










685 








hsn 


Thr 


Hie 


Tyr 


Asp 


Pro 


Ser 


Thr 


Gly 


He 


7\rg 


Thr 


Thr 


Arg 


Tyr 


Glu 














655 










7 00 










Asn 


Ala 


Ser 


Giy 


Thr 


Trp 


Gin 


Ala 


Asn 


Leu 


His 


Gly 


Thr 


Leu 


Ser 


Leu 


7 05 










710 










715 










7 20 


Pro 


Leu 


Ly^ 


As n 


Arg 


Ala 


Phe 


Ser 


Phe 


Arg 


Met 


Ser 


Leu 


Phe 


Asn 


Arg 










725 










730 










7 35 




Leu 


hlh 


Glu 


Gly 


Gin 


Ser 


Phe 


He 


Asn 


ASD 


Aso 


L* -s 


Asn 


■'-'1 - 


Al a 


Leu 








74 0 










7 4 5 










7 50 






Ser 


Phe 


Arg 


Thr 


Arg 


Gl u 


Arg 


Leu 


Thr 


Leu 


Thr 


T"r 


Arg 


Asn 


Asn 


1 rp 






755 










7 6 0 










7 65 








He 


Asp 


Thr 


Ser 


He 


Gl V 


Gly 


Asn 


He 


I'^l \ - 
•j± ^ 


Phe 




f let 


Ala 


A'^n 


M.:: n 




77 0 










775 










7 80 










Ser 


Leu 


Ser 


Gly 


Gin 


Lys 


Asp 


Ser 


Ara 


Thr 




Asp 


Phe 


Gly 


Gl V 


Asn 


785 










7 90 










7 95 










800 


Tyr 


Gin 


Val 


Ala 


Leu 


Thr 


Leu 


Pro 


Tyr 


Gly 


Phe 


Arg 


He 


Asp 


Ser 


ASD 










805 










810 










815 


Val 


Glu 


Tyr 


Asn 


Thr 


Asn 


Ser 


Gl y 


Tvr 


Ser 


Glv 


Gl" 
J J. J 


Phe 


Ser 


Leu 










820 










825 










830 






Glu 


Trp 


Leu 


Trp 


Asn 


Ala 


Ser 


Leu 


Ser 


Tyr 


Ser 


Phe 


Leu 


Arg 


Asp 


Lys 






835 










84 0 










845 






Ala 


Gly 


Thr 


Leu 


Arg 


Val 


Asn 


Gly 


Tyr 


Asp 


He 


Leu 


Gly 


Gin 


Arg 


Ser 




850 










855 










860 










Ser 


He 


Ser 


Arg 


Ser 


Ala 


Ser 


Ala 


He 


Asn 


He 


Glu 


Glu 


Ser 


Met 


Ser 


865 










870 










875 










880 


A^n 


Thr 


He 


Gly 


Arg 


Tyr 


Val 


Met 


Val 


Asp 


Phe 


He 


Tyr 


Arg 


Phe 


Asn 










885 










8 90 








895 




Ala 


Phe 


Ser 


Gly 


Gly 


Gly 


Ser 


Arg 


Ser 


Asp 


His 


Gin 


Arg 


Gly 


Asn 


Met 








900 










905 










910 








Arg 


Pro 


Gly 


Pro 


Pro 


Phe 


Gly 


Gly 


Gly 


Arg 


Arg 


Pro 


Ser 










915 










920 










925 









10 



15 



20 



25 



30 



35 



40 



45 



50 (2) in FORMATION FOR SEQ ID HO: 505 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



{ vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 



FEATURE: 

65 (A) NAME/KEY: misc_feature 

t B) LOCATION 1 ... 400 



SEQUENCE DESCRIPTIOM: SEQ ID 110:505 

Met Val Met Lys Leu He Lys Arg Ser Leu Leu Leu Leu Gly Ala Val 

1 5 10 15 

Leu Leu He Thr Leu Pro Ala Tyr Ser Gin Asn Asp Asp He Phe Glu 

20 25 30 

Asp Asp He Tyr Thr Ser Arg Lys Glu He Arg Lys Gin Asn Gin Val 
35 40 45 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929670A1_L> 



• 



10 



15 



20 



25 



30 



35 



40 



45 



65 



WO 99/29870 PCT/AU98/01023 

457/490 



Lys 


A3p 


Trp 


Gin 


Asn 


Gin 


Glu 


Asp 


Gly 


Tyr 


Gly 


Asp 


Asp 


Thr 


Gl u 


Tyr 




50 










55 










60 










Thr 


Val 


Ala 


Ser 


Asp 


Arg 


Asp 


He 


Asp 


Ala 


Tyr 


Asn 


Arg 


Arg 


Asp 


Gly 


65 










7 0 










75 










80 


Gin 


Ser 


Tyr 


Asp 


Gly 


Lys 


Lys 


Leu 


Ser 


Lys 


Asp 


L-ys 


Lys 


Arg 


Asp 


Ser 










85 










90 










95 




Thr 


Arg 


Ser 


Ser 


Val 


Pro 


Gly 


Arg 


Tyr 


Ser 


Arg 


Arg 


Leu 


Ala 


Arg 


Phe 








100 










105 










110 






Tyr 


Lys 


Pro 


Asn 


Thr 


He 


Val 


He 


Ser 


Gly 


Ala 


Asp 


Asn 


Val 


Tyr 


Val 






115 










120 










125 








Thr 


Asp 


Asp 


Gly 


Glu 


Tyr 


Phe 


Val 


Tyr 


Gly 


Asp 


Glu 


Tyr 


Tyr 


Asp 


Asp 




130 










135 










140 










Ala 


Ser 


Ser 


Val 


Asn 


He 


Tyr 


He 


Asn 


Ser 


Pro 


Trp 


J - 


Asp 


Pro 


Phe 


145 










150 










155 










160 


Pro 


Tyr 


Thr 


Ser 


Trp 


Tyr 


Pro 


Ser 


Phe 


Ser 


Gly 


Trp 


Tyr 


Asn 


Tyr 


Thr 








165 










17 0 










175 




Trp 


Asn 


Tyr 


Pro 


Trp 


Phe 


Tyr 


Tyr 


Gly 


Ser 


His 


He 


Gly 


Trp 


Gly 


Gly 








180 










185 










190 






Tyr 


Tyr 


Pro 


Gly 


Tyr 


Asn 


Trp 


Tyr 


Trp 


Ser 


Tyr 


Tyr 


Tyr 


Asp 


Pro 


Phe 






195 










200 










205 








Tyr 


As ri 


Pro 


Tyr 


Gly 


He 


Gly 


net 


Gly 


Trp 


Gly 


Tyr 


Pro 


Tyr 


Gly 


Trp 




210 










215 










220 










Gly 


Ser 


Tyr 


Tyr 


Gly 


Trp 


Gly 


Gly 


Tyr 


Pro 


Gly 


Val 


He 


His 


His 


Tyr 


^ 5 










2 30 










235 










24 0 


His-. 


Hi^ 


Tyr 


Pro 


Lys 


Lys 


Thr 


Tyr 


Ser 


Asn 


Gly 


Gin 


His 


Ser 


Gly 


Ala 










245 










250 










255 




Tyr 


Tyr 


Ser 


Tyr 


Gly 


Arg 


Pro 


Asn 


Arg 


He 


Lys 


Gly 


Gly 


Thr 


Ser 


Gly 


















265 










27 0 






Al3 


Lys 


Leu 


Gly 


Thr 


Gly 


Arg 


Tyr 


Asp 


Arg 


He 


Gin 


Asn 


Ser 


Ser 


Ser 






275 










280 










285 








Gin 


Lys 


Asn 


Lys 


Phe 


Gly 


Leu 


Gin 


Ser 


Asn 


Lys 


Pro 


Asn 


Asn 


Asn 


Leu 




2 90 










295 










300 










Gin 


Asn 


Val 


Lys 


Ser 


Gly 


Arg 


Thr 


Gly 


Arg 


Ala 


Asn 


Arg 


Asp 


Arg 


Asn 


305 










310 










315 










320 


lie 


Glu 


Thr 


Val 


Thr 


Pro 


Asn 


Asn 


Gly 


Gin 


Lys 


Gin 


Asn 


Arg 


Pro 


Val 










325 










330 










335 




Phe 


Gin 


Gin 


Asn 


Gin 


Ser 


Gly 


Asn 


Asp 


Arg 


Pro 


Thr 


Gly 


Arg 


Asn 


He 








3-^0 










345 










350 






Arg 


Ser 


Glu 


Arg 


Gin 


Gly 


Glu 


Asn 


As n 


Asp 


Arg 


Thr 


Phe 


Ser 


Thr 


Pro 






355 










360 










365 








Ser 


Arg 


Ser 


Asn 


Ser 


Asn 


Gly 


Gly 


Phe 


Ser 


Thr 


Pro 


Ser 


Arg 


Ser 


Ser 




370 










375 










380 










Sec 


Gly 


Ser 


Met 


Ser 


Gly 


Gly 


Gly 


Gly 


Arg 


Ser 


Gly 


Arg 


Gly 


Arg 


Asn 


385 










390 










395 










4 00 



(2) IIIFORilATIOM FOR SEQ ID 110:506 



(i) SEQLfEtlCE CHARACTERISTICS: 

(A) LEllGTH: 3 93 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 (iii) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 

(A) ORGAIIISM: Porphyromonas gingival i£ 

60 (ix) FEATURE: 

<A) 1 1 AT IE /'KEY: misc_feature 

(E) LOCATIOIl 1 . . -398 



(xi) SEQUENCE DESCRIPTIOH: SEQ ID 110:506 



Met Lys Leu He Lys Arg Ser Leu Leu Leu Leu Gly Ala Val Leu Leu 
1 5' 10 15 

He Thr Leu Pro Ala Tyr Ser Gin Asn Asp Asp He Phe Glu Asp Asp 
20 25 30 

70 He Tyr Thr Ser Arg Lys Glu He Arg Lys Gin Asn Gin Val Lys Asp 
35 4 0 4 5 

Trp Gin Asn Gin Glu Asp Gly Tyr Gly Asp Asp Thr Glu Tyr Thr Val 

50 55 60 

Ala Ser Asp Arg Asp He Asp Ala Tyr Asn Arg Arg Asp Gly Gin Ser 
75 65 7 0 7 5 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1 J_> 



10 



15 



20 



25 



30 



35 



40 



50 



55 



70 



75 



WO 99/29870 PCT/AU98/01023 

458/490 





Asp 


Gly 






Leu 


Ser 








■'-'j - 


Arg 


p 


Ser 


Thr 


Arg 










85 










90 










95 




Ser 


Ser 


Val 


Pro 


Gl V 


Arg 




Ser 


Arg 


Arg 


Leu 


Ala 




Phe 


T v r 


- 








100 










105 










HO 






Pro 


A5:n 


Thr 


He 


Val 


He 


Ser 


Gly 


Ala 


Asp 


Asn 


Va 1 


Tyr 


Val 


Thr 


Asp 






115 










120 










125 






Asp 


Gly 


Glu 


Tyr 


Phe 


Val 


Tyr 


Gly 


Asp 


Glu 


Tyr 


Tyr 


Asp- 


Asp 


Ala 


Ser 




130 










135 










1^0 










Ser 


Val 


Asn 


He 


Tyr 


He 


As n 


Ser 


Pro 


Tro 


-J 




Pro 


Phe 


Pro 




145 










150 










155 










160 


Thr 


Ser 


Trp 


Tyr 


Pro 


Ser 


Phe 


Ser 


Gl V 


Tro 


^ _i ■'- 


Asn 


T^'r 


Thr 


Tro 


Asn 










1 65 










170 










175 




T\-r 

1 J a. 


Pro 


Tro 


Phe 


Tvr 

i. y ^ 


Tvr 


Gl V 


Ser 


His 


He 


Gl V 


1 i. p 


Gl - 


Gl V 




J. y i. 








1 80 










18 5 










1 90 






Pro 


Gl V 




As n 


I rp 


T^'r 


1 i"p 


S^ r 






rp. . 
I J i 


/\j:P 


Pro 


r ne 


i i r 


As n 






195 










200 










2 fi 5 








Pro 






lie 


y 


M*^ t 




1 rp 




rp. , 


Pro 






Trp 


Gi J 


Se r 




i n 










^ J. J 




















T - -r 


T \.- r 
i y L 


G I " 


1 1'p 


J. j 






P JL O 




V a 1 


Tic 

ire 


n r s 


U - c- 

n X £? 




His 


His 


IOC 










2 30 










o ::j c. 












rp. , 


Pro 


i->j ^ 


L* ":f 


Thr 


■I J ^ 


Se r 


Asn 




Gin 


His 




Gl ' ' 


Al a 




^ J i- 










2 4 5 










250 










2 55 




Ser 






A r <9 


Pro 


Asn 


Arg 


He 




'or ^ 


'J J. J- 


Thr 


S-- 
er 


ur J 


Al a 


-L-i S 








2 60 










2 65 










"^7 0 






L«u 


Gl \' 


Thr 


G 1 \- 


Arg 




Asp 


Arg 


H 




As n 


S^* r 


S *=■ r 


S *=" r 


Gin 


Jjj' s 
















2 S i'' 










^ o ^ 








n 




Phe 


G 1 • 


Leu 


Gl n 


Ser 




L' 's 


Pro 


Asn 


Asn 


As n 


eu 


Gin 


sn 




2 90 










2 95 










3 no 










Val 


Lys 


Ser 


Gly 


Arg 


Thr 


Gly 


Arg 


Ala 


Asn 


Arg 


Asp 


Arg 


Asn 


He 


Glu 


3 05 










310 










315 










320 


Thr 


Val 


Thr 


Pro 


Asn 


Asn 


Gly 


Gin 


Lys 


Gin 


Asn 


Arg 


Pro 


Val 


Phe 


Gin 










325 










330 










335 




Gin 


Asn 


Gin 


Ser 


Gly 


Asn 


Asp 


Arg 


Pro 


Thr 


'31 y 


Arg 


Asn 


He 


Arg 


Ser 








3^10 










345 










350 






Glu 


Arg 


Gin 


Gly 


Glu 


Asn 


Asn 


Asp 


Arg 


Thr 


Phe 


Ser 


Thr 


Pro 


Ser 


Arg 






355 










360 










365 








Ser 


Asn 


Ser 


Asn 


Gly 


Gly 


Phe 


Ser 


Thr 


Pro 


Ser 


Arg 


Ser 


Ser 


Ser 


Gly 




37 0 










375 










380 










Ser 


Met 


Ser 


Gly 


Gly 


Gly 


Gly 


Arg 


Ser 


Gly 


Arg 


Gly 


Arg 


Asn 






385 










390 










395 













(2) IlIFORMATIOtl FOR SEQ ID [40:507 



(i) SEQUEtlCE CHARACTERISTICS: 
45 (A) LEHGTH: 581 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) IIOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIUAL SOURCE: 

(A) ORGAIIISI'I: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAIIE/KEY: misc_feature 
(E) LOCATIOII 1 ... 581 

60 (Mi) SEQUEUCE DESCRIPTION: SEQ ID HO: 507 



65 



net 


He 


Arg 


Lys 


Leu 


He 


Leu 


Leu 


Leu 


Ala 


Leu 


Met 


Pro 


Val 


Aia 


Ser 


1 








5 










10 










15 




Val 


Ala 


Phe 


Ala 


Val 


Pro 


Thr 


Asp 


Ser 


Thr 


Glu 


Ser 


Lys 


Asp 


Asn 


Arg 








2 0 










25 










30 






He 


Leu 


Thr 


Ser 


Met 


Gin 


Ser 


Ser 


Ser 


Leu 


Asn 


Arg 


Asp 


Asp 


Ala 


Pro 






35 




















45 








Asp 


Lys 


Trp 


Gl n 


Pro 


Met 


His 


Ala 


Asn 


Phe 


Ser 


He 


Gin 


Ser 


Asp 


Met 




50 










55 










60 








Leu 


Leu 


Ser 


Thr 


Ala 


Gin 


Lys 


Ser 


Lys 


Asn 


Thr 


Trp 


Phe 


Gly 


Asn 


Ser 


65 










70 










75 






80 




He 


I let 


Gly 


He 


He 


Lys 


Asn 


Asn 


Tyr 


Leu 


Glu 


Phe 


Gly 


Ala 


Arg 










85 










90 










95 




Phe 


Glu 


Asp 


Leu 


Tyr 


Lys 


Pro 


Leu 


Pro 


Gly 


His 


Glu 


Pro 


Glu 


Met 


Gly 








1 00 










105 








110 
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Arg 


Gly 


Val 


Pro 


His 


Met 


Tyr 


Val 


Lys 


Gly 


Ser 


Tyr 


His 


Trp 


Ala 


Glu 


ai5 










120 










125 








Leu 


Thr 


net 


Giy 


Asp 


Phe 


Tyr 


Asp 


Gin 


Phe 


Gly 


Ser 


Gly 


Met 


Val 


Phe 




130 










135 










14 0 










Arg 


Thr 


Tyr 


Glu 


Gl u 


Arg 


As n 


Leu 


Gly 


He 


Asp 


Asn 


Ala 


Val 


Arg 


Gly 










150 










155 










160 


Gl y 


Arg 


He 


Val 


Leu 


Thr 


Pro 


Phe 


Asp 


Gly 


Val 


Arg 


Val 


Lys 


Gly 


He 






165 










170 










175 




Ala 


Gly 


Gin 


Gin 


Arg 


Asn 


Tyr 


Phe 


Asp 


Arg 


Thr 


Gly 


Lys 


Val 


Phe 


Asn 








180 










185 










190 






Ser 


Gly 


Arg 


Giy 


Tyr 


Leu 


Leu 


Gly 


Ser 


Asp 


Leu 


Glu 


Leu 


Asn 


Val 


Glu 






195 










200 










205 








Arg 


Trp 


Ser 


Ser 


Ala 


Met 


Arg 


Asp 


Asn 


Asp 


Tyr 


His 


Leu 


Ala 


He 


Gly 




210 










215 










22 0 










Gly 


Ser 


Phe 


Val 


Ser 


Lys 


His 


Glu 


Ala 


Asp 


Glu 


Asp 


He 


Phe 


Val 


Gly 


225 










2 30 










235 










2 40 


Val 


Gly 


Glu 


Asp 


Arg 


Lys 


Arg 


Leu 


Asn 


Leu 


Pro 


Leu 


Asn 


Val 


Pro 


He 










245 










250 










2 55 




net 


Gly 


Leu 


Arg 


Thr 


Asn 


Phe 


Gin 


Lys 


G 1 y 


Gly 


Leu 


Ala 


Leu 


Tyr 


Ala 








260 










265 










27 0 






Gl u 


Tyr 


Gly 


Tyr 


Lys 


Tyr 


Asn 


Asp 


Pro 


Ser 


Ala 


Asp 


Asn 


Asp 


Tyr 


He 




275 










280 










205 








Tyr 


His 


Asp 


Gly 


Gin 


Ala 


Ala 


Leu 


Leu 


Ser 


Ala 


Ser 


Tyr 


Ser 


Lys 


Lys 




290 








295 










300 










Gly 


Met 


Ser 


He 


Leu 


Leu 


Gin 


Ala 


Lys 


Arg 


Cys 


Glu 


Asn 


Phe 


Ala 


Phe 


305 










310 










315 










320 


Arg 


Ser 


Ly^ 


Arg 


Ser 


Ala 


Gin 


Leu 


Thr 


Pro 


Leu 


Met 


He 


Asn 


Tyr 


Met 








325 










330 










335 




fro 


Ala 


Phe 


Tlir 


Gin 


Ala 


His 


Thr 


Tyr 


Thc 


Leu 


Ala 


Ala 


He 


Tyr 


Pro 








340 










345 










350 






Tyr 


Alu 


Thr 


Gin 


Pro 


Gin 


Gly 


Glu 


Trp 


Ala 


Phe 


Gl ri 


Gly 


Glu 


Leu 


Arg 






355 










360 










365 








Tyr 


Asn 


Phe 


Ala 


Arg 


Arg 


Thr 


Ala 


Leu 


Gly 


Gly 


Arg 


Tyr 


Gly 


Thr 


Gly 




37 0 










375 










380 










Leu 


Arg 


J le 


Asn 


Val 


Ser 


His 


Val 


Arg 


Gly 


Leu 


Asp 


Lys 


Lys 


Met 


Leu 


385 










390 










395 










4 00 


hys 


Glu 


Asn 


Pro 


Asp 


Glu 


Leu 


He 


Gly 


Thr 


Asp 


Gly 


Tyr 


Thr 


Val 


Ser 








405 










410 










415 




Phe 


Phe 


Gly 


Met 


Gly 


Asp 


Leu 


Tyr 


Tyr 


Ser 


Asp 


He 


Asp 


Val 


Glu 


He 








420 










425 










430 






Thr 


hys 


Lys 


Val 


Ser 


Pro 


Gly 


Phe 


Asn 


Phe 


Thr 


Leu 


Thr 


Tyr 


Leu 


Asn 






435 










440 










445 








Gin 


lie 


Tyr 


Asn 


Asn 


Lys 


Val 


Leu 


His 


Gly 


Ala 


Ala 


Gly 


Glu 


Lys 


Pro 




4 50 










455 










460 










Glu 


Lys 


He 


Tyr 


Ala 


Asn 


He 


Phe 


Val 


Tyr 


Asp 


Gly 


Lys 


Tyr 


Lys 


Leu 


4 65 










470 










4 75 










4 SO 


Ser 


Asn 


Lys 


V^il 


Ala 


Leu 


Arg 


Thr 


Glu 


Leu 


Gin 


Tyr 


Leu 


His 


Thr 


Lys 










4 85 










A 90 










4 95 




Gin 


A.-P 


Gin 


Gly 


Asp 


Trp 


He 


Tyr 


Gly 


!Iet 


Ala 


Glu 


Leu 


Ser 


He 


Leu 








500 










505 










510 






Pro 


Ser 


Leu 


Met 


Leu 


Ser 


Leu 


Ser 


Glu 


Gin 


Tyr 


Asn 


He 


Gly 


Glu 


Thr 






515 










520 










525 








hys 


hys 


Hi 3 


Tyr 


Val 


Met 


Gly 


Ser 


Val 


Thr 


Tyr 


Thr 


His 


Gly 


Ala 


His 


530 










535 










54 0 










Arg 


Val 


Al a 


Phe 


Ser 


Ala 


Gly 


Lys 


Thr 


Arg 


Ala 


Gly 


Met 


Asn 


Cys 


Ser 


545 










550 










555 










560 


Gly 


Gly 


Val 


Cys 


Arg 


Val 


Val 


Pro 


Glu 


Thr 


Gin 


Gly 


Phe 


Tyr 


Leu 


Ser 










565 










570 










57 5 




Tyr 


Ser 


Thr 


Asn 


Leu 






























580 



























(2) I H FORMAT I on FOR SEQ ID HO: 508 



(i) SEQUEIICE CHARACTERISTICS: 
65 (A) LEHGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGIllAL SOURCE: 

(A) CRGAHISM: Porphyromonas gingivalif 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID; <WO 9929S70A1_ 



55 



60 



65 



70 



75 



WO 99/29870 PCT/AU98/01023 

460/490 

(ix) FEATURE: 

(A) IJAIJE/KEV: misc_feature 
(B ) LOCATIOII 1 . . . 239 

(xi) SEQUEMCE DESCRIPTION: SEQ ID 110:508 



Met 


Arg 


Ser 


Leu 


Phe 


Leu 


Ser 


Ala 


Leu 


Arg 


Ser 


Ser 


Ser 


Leu 


His 


Gly 


1 








5 










1 0 " 










15 


Ser 


Glu 


Arg 


Arg 


Ser 


Arg 


He 


Ser 


Ser 


Ser 


Val 


Val 


Ilet 


Ser 


He 


Arg 








20 










25 










30 






Gin 


Li- 


He 


Arg 


Leu 


Phe 


His 


Leu 


Ser 


Val 


Cys 


Ala 


Gin 


Thr 


His 


Asp 






35 










AO 










45 








His 


Leu 


He 


Gl u 


He 


His 


Leu 


Val 


Cys 


He 


Glu 


Phe 


Gly 


Ala 


He 


Asp 




50 










55 










60 










Thr 


Asp 


Glu 


Phe 


Arg 


Leu 


Ser 


Ser 


His 


Ala 


Tyr 


Thr 


Thr 


Ser 


Pro 


Thr 


65 










70 










75 










80 


His 


Thr 


Gly 


Ala 


He 


His 


His 


Asn 


Cys 


He 


Gl U 


Arg 


Ser 


Tyr 


Gly 


Arg 










85 










90 










95 




Tyr 


L€-U 


Val 


Thr 


Phe 


Gly 


Gin 


Glu 


Arg 


Asn 


Glu 


Leu 


His 


His 


His 


Ser 








1 0 0 










105 










110 






Arg 


Pro 


Asp 


Arg 


Asn 


Ala 


Glu 


Val 


Tyr 


Arg 


Phe 


Pro 


Phe 


Asp 


As n 


Ala 






lis 










120 










125 








The 


His 


Ser 


He 


Arg 


Tyr 


Glu 


Ala 


Phe 


Arg 


Pro 


He 


Arg 


Pro 


He 


Val 




1 30 










135 










140 












His 


Asp 


Asp 


His 


Phe 


He 


Ala 


He 


Gly 


Ser 


His 


Leu 


Phe 


Phe 


Lys 


1^]5 










150 










155 










160 


Asp 


Asn 


Gin 


He 


Phe 


Ser 


Ser 


Gly 


Ser 


Gin 


Tyr 


Asp 


Asn 


Tyr 


Thr 


Va 1 










165 










17 0 










175 




Ala 


Cys 


Phe 


Yal 


Glu 


Ser 


Leu 


His 


Asp 


Arg 


Gl u 


Gin 


Arg 


Ser 


His 


Thr 








180 










185 










190 






His 


Thr 


Ala 


Ser 


Gly 


Thr 


Asn 


His 


cys 


Ala 


Asp 


Leu 


Phe 


Asp 


Met 


Arg 






195 










200 










205 








Thr 


Leu 


Ser 


Gin 


Arg 


Thr 


Tyr 


His 


He 


Arg 


Asp 


He 


Val 


Ala 


Asp 


Phe 




210 










215 










22 0 










Glu 


Phe 


Gly 


Gin 


Phe 


Leu 


Gly 


Arg 


Phe 


Ala 


Mis 


Gly 


Leu 


Asn 


His 




2 2 5 










230 










2 35 













10 



15 



20 



25 



30 



35 



(2) IDPORi^ATIOH FOR SEQ ID NO: 50 9 

40 (i) SEQUE!JCE CHARACTERISTICS: 

(A) LEMGTH: 211 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (ii) IIOLECULE TYPE: protein 

(ill! H V POT H ET I C AL : Y E S 

{ V i ) OR I G 1 1 1 AL SOU RCE : 
50 (A) ORGAIIIS/I: Po rphyromonas gingival is 

(ix} FEATURE: 

(A) IIAIIE/KEY: misc_feature 

(B) LOCATION 1 . , . 211 



{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 50 9 



llet 


Ser 


He 


Arg 


Gin 


Lys 


He 


Arg 


Leu 


Phe 


His 


Leu 


Ser 


Val 


Cys 


Ala 


1 








5 










10 










15 




Gin 


Thr 


His 


Asp 
2 0 


His 


Leu 


He 


Glu 


He 

25 


His 


Leu 


Val 


Cys 


He 

30 


Glu 


Phe 


Gly 


Ala 


He 
3 5 


Asp 


Thr 


Asp 


Glu 


Phe 
^0 


Arg 


Leu 


Ser 


Ser 


His 
45 


Ala 


Tyr 


Thr 


Thr 


Ser 


Pro 


Thr 


His 


Thr 


Gly 


Ala 


He 


His 


His 


Asn 


Cys 


He 


Glu 


Arg 




50 










55 










60 






Ser 


Tyr 


Gly 


Arg 


Tyr 


Leu 


Val 


Thr 


Phe 


Gly 


Gin 


Glu 


Arg 


Asn 


Glu 


Leu 


65 










70 










75 










80 


His 


His 


His 


Ser 


Arg 

85 


Pro 


Asp 


Arg 


Asn 


Ala 

90 


Glu 


Val 


Tyr 


Arg 


Phe 
95 


Pro 


Phe 


Asp 


Asn 


Ala 

100 


Phe 


His 


Ser 


He 


Arg 
105 


Tyr 


Gl u 


Ala 


Phe 


Arg 
110 


Pro 


He 


Arg 


Pro 


He 


Val 


Cys 


His 


Asp 


Asp 


His 


Phe 


He 


Ala 


He 


Gly 


Ser 


His 






115 










120 










125 






Leu 


Phe 


Phe 


Lys 


Asp 


Asn 


Gin 


He 


Phe 


Ser 


Ser 


Gly 


Ser 


Gin 


Tyr 


Asp 




130 










135 










1^0 









SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929B70A1_I_> 
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461/490 





Tyr 


Thr 


Val 


Ala 


Cys 


Fhe 


Val 


Glu 


Ser 


Leu 


His 


Asp 


Arg 


Glu 


Gin 


1-15 










150 










155 










160 


Arcj 


Ser 


His 


Thr 


His 


Thr 


Ala 


Ser 


Gly 


Thr 


Asn 


His 


Cys 


Ala 


Asp 


Leu 








165 










170 










175 




Phe 


Asp 


Met 


Arg 
180 


Thr 


Leu 


Ser 


Gin 


Arg 
185 


Thr 


Tyr 


His 


He 


Arg 
190 


Asp 


He 


Val 


Ala 


Asp 
195 


Phe 


Glu 


Fhe 


Gly 


Gin 
200 


Fhe 


Leu 


Gly 


Arg 


Phe 
205 


Ala 


His 


Gly 


Leu 


Asn 
210 


His 





























(2} lilFORUATIOtl FOR SEQ ID MO: 510 



(i) SEQUEIICE CHARACTERISTICS: 
15 (A) LENGTH: 781 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

{vi} ORIGINAL SOURCE: 

(A) ORGAIIIStI: Porphyromonas gingival is 



(i::) FEATURE: 

(A) MAI IE/ KEY: n\isc_f eature 

(B) LOCATION I ... 781 

30 (::i) SEQUEMCE DESCRI PTIOtl : SEQ ID HO: 510 





I!et 


Asp 


Arg 


Pro 


Lys 


Pro 


Ser 


Tyr 


He 


Val 


Arg 


He 


Ala 


Ala 


He 


Leu 




1 


















10 










15 






Cys 


Leu 


Phe 


Val 


Gly 


Arg 


Pro 


Leu 


Phe 


Ala 


Gin 


Ser 


Tyr 


Val 


Asp 


Tyr 


35 








20 










25 










30 








Val 


Asp 


Pro 


Leu 


He 


Gly 


Thr 


Leu 


Ser 


Ser 


Phe 


Glu 


Leu 


Ser 


Ala 


Gly 






35 










AO 










^5 










Asn 


Thr 


Tyr 


Pro 


Val 


He 


Gly 


Leu 


Pro 


Trp 


Gly 


Met 


Asn 


Ser 


Trp 


Thr 






50 










55 










60 










40 


Pro 


Met 


Thr 


Gly 


Val 


Pro 


Gly 


Asp 


Gly 


Trp 


Gin 


Tyr 


Thr 


Tyr 


Ser 


Ala 




65 








70 










75 










80 




His 


Lys 


He 


Arg 


Gly 


Phe 


Lys 


Gin 


Thr 


His 


Gin 


Pro 


Ser 


Pro 


Trp 


He 










85 










90 










95 






Asn 


Asp 


Tyr 


Gly 


Gin 


Phe 


Ser 


Leu 


Leu 


Pro 


Leu 


Thr 


Ala 


Pro 


Gin 


Lys 


45 








1 00 










105 










110 








Pro 


Ser 


Ser 


Asn 


Asp 


Ser 


He 


Ala 


Leu 


Thr 


Lys 


Trp 


Cys 


Lys 


Gin 


Leu 








115 










120 










125 










Piie 


Ser 


Asp 


Glu 


Gin 


Thr 


Ser 


Trp 


Phe 


Ser 


His 


Lys 


Ala 


Gl u 


Thr 


Ala 






130 








135 










1 -I 0 










50 


Thr 


Pro 


Tyr 


Tyr 


Tyr 


Ser 


Val 


Tyr 


Leu 


Ala 


Asp 


Tyr 


Asp 


Thr 


Arg 


Val 




1^5 










150 










155 










160 




Glu 


Met 


Ala 


Pro 


Thr 


Glu 


Arg 


Ala 


Ala 


He 


Phe 


Arg 


He 


Arg 


Tyr 


Ser 












165 










17 0 










175 






Gly 


Asn 


Thr 


Glu 


Ser 


Gly 


Ser 


Gly 


Arg 


Trp 


Leu 


Arg 


Leu 


Asp 


Ala 




55 






180 










185 










190 








Tlir 


Gly 


Gly 


Ser 


Glu 


He 


Ser 


He 


Val 


Asp 


Pro 


His 


Thr 


Val 


Val 


Gly 








195 










200 










205 










He 


Ser 


Arg 


Lys 


Asn 


Ser 


Gly 


Gly 


Val 


Pro 


Ala 


Asn 


Phe 


Ala 


Cys 


Tyr 


60 




210 










215 










220 










Pl-.e 


He 


Leu 


Gin 


Ser 


Asp 


Thr 


Pro 


Met 


Ala 


Asp 


Val 


Leu 


Leu 


Glu 


Thr 














2 30 










235 










240 




Asp 


Thr 


01 y 


Lys 


Ser 


Asp 


Glu 


Gly 


Thr 


A.rg 


Ala 


Trp 


Ala 


Ala 


Cys 


Arg 












2-15 










250 










255 






Phe 


Asp 


Ser 


Gin 


Glu 


Val 


Thr 


Val 


AL'g 


Val 


Ala 


Ser 


Ser 


Phe 


He 


Ser 


65 






260 










265 










270 








Val 


Glu 


Gin 


Ala 


Glu 


Arg 


Asn 


Leu 


Ala 


Glu 


Val 


Lys 


Gly 


Gin 


Ser 


Phe 








275 










280 










285 










Asp 


Arg 


He 


Arg 


Leu 


Ala 


Gly 


Arg 


Glu 


Al a 


Trp 


Asn 


Lys 


Val 


Leu 


Gly 




2 90 










295 










300 










70 


Arg 


He 


His 


Val 


Glu 


Gly 


Gly 


Thr 


Lys 


Asp 


Glu 


Arg 


Thr 


Thr 


Phe 


Tyr 




305 










310 










315 










320 




Ser 


Ala 


Leu 


Tyr 


Arg 


Cys 


Leu 


Leu 


Phe 


Pro 


Arg 


Arg 


Phe 


Tyr 


Glu 


Glu 












325 










330 










335 






Asp 


Ala 


Ser 


Gly 


Asn 


Phe 


Val 


His 


Tyr 


Ser 


Pro 


Tyr 


Asn 


Gly 


Glu 


Val 


75 






3-10 










3*J5 










350 







BNSDOCID: <WO 9929870A1_L> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



75 



Leu 


Pro 


Gl y 


Tyr 


Leu 


Tyr 


Thr 


Asp 


Thr 


Gly 


Phe 


Trp 


Asp 


Thr 


Phe 


Arg 






355 










360 










365 








Ala 


Leu 


Phe 


Pro 


Leu 


Leu 


Asn 


Leu 


Leu 


Tvr 


Pro 


ASD 


Glu 


Asn 


He 


Lvs 




370 










37 5 










380 










lie 


Gin 


Glu 


Gly 


Leu 


Leu 


Asn 


Val 


Tyr 


Arg 


Glu 


Ser 


Gly 


Phe 


Phe 


Pro 


385 










390 










395 










400 


Glu 


Trp 


Ala 


Ser 


Pro 


Gly 


His 


Arg 


Asp 


Cys 


Met 


He 


Gly 


Asn 


Asn 


Ser 










405 










410 










415 




Al a 


Ser 


Val 


Leu 


Ala 


Asp 


Ala 


Tvr 


Leu 


Lvs 


Gl V 


Val 


Arg 


Val 


Glu 


Asp 








420 










425 










4 30 






Thr 


Arg 


Thr 


Leu 


Met 


Asn 


Gl V 


Leu 


Leu 


His 


Ala 


Thr 


L^''=^ 

Uj — 


Ala 


Val 


His 






4 35 










4 4 0 










4 4 5 








Pro 


Lys 


lie 


Ser 


Ser 


Thr 


Gl^' 


Arg 




Gl V 


T rn 


Gl u 




T\"r 
i y i. 


Asn 


Ser 




4 50 










4 55 










4 60 










Leu 


Gly 




Val 


Pro 


Ala 


Asp 


Ala 


Gly 


He 


Asp 


Glu 


Ser 


Ala 


Ala 


Ara 


4 65 










470 










475 










4 80 


Thr 


Leu 


Glu 


Tyr 


Ala 


Tyr 


Asn 


ASD 


Trn 


Cvs 


He 


Leu 


Ara 


Leu 


Gl V 


Arg 










485 










4 90 










4 95 




Thr 


Leu 


Gl-y 


Trp 


Asp 


Arq 


Ala 


Ala 


Leu 


ASD 


Thr 


Leu 


Al a 


His 


Arg 


Ser 








500 










505 










51 0 






I let 


Asn 


Tyr 


Arg 


H is 


Leu 


Phe 


ASD 


Pro 


Gl u 


Thr 




Leu 


Met 


Arg 


Gl V 






515 










520 










52 5 








Arg 


A en 


G 1 n 


Asp 


Gly 


Ser 


Phe 


Arg 


Thr 


Pro 


Phe 


Ser 


Pro 


Phe 


Lys 


Trp 




530 










535 










54 0 










Giy 


Asp 


Val 


Plie 


Thr 


Glu 


Gly 


Asn 


Ala 


Trp 


His 


Tyr 


Thr 


Trp 


Ser 


Val 












550 










555 










560 


Phe 


His 


Asp 


Va 1 


Gin 


Gly 


Leu 


lie 


Asp 


Leu 


Met 


Gly 


Gly 


Asp 


Arg 


Pro 










565 










570 










575 




Phe 


Vai 


Ser 


llet 


Leu 


Asp 


Ser 


Val 


Phe 


As n 


Thr 


Pro 


Pro 


Met 


Phe 


Asp 








580 










585 










590 






Giu 


Ser 


Tyr 


Tyr 


Gly 


Phe 


Val 


lie 


His 


Glu 


He 


Arg 


Glu 


Met 


Gin 


He 






595 










600 










605 








A3 a 


Asp 


I iet 




Asn 


Tvr 


Ala 


His 




Asn 


Gin 


Pro 


He 


Gin 


His 


Met 




610 










61 5 










620 










lie 


T V r 


Leu 


T vr 


Asn 


His 


Al a 


'jj. y 


His 


ro 


1 rp 




Al a 


Gl n 


u u 


Arg 


625 










630 










635 










64 0 


Leu 


Arg 


Glu 


Val 


Met 


Gl V 


Arg 


L^u 


i r 


Arg 


Pro 


Thr 




MSp 




Tyr 










64 5 










650 










655 




Cys 


Gly 


Asp 


Glu 


Asn 


As n 


Gl V 


Gin 


Thr 


Ser 


Al a 


T rn 


T^' r 


Val 


Phe 


Ser 








660 










665 










67 0 






Ala 


Leu 


Gl V 


Phe 


T vr 

J. jf i_ 


Pro 


Val 


Tlir 


Pro 


Ala 


Thr 


rt^ p 


Gln 




Va 1 


L=u 






67 5 










680 










6 85 






Gly 


Ser 


Pro 


He 


Phe 


Ser 


Lys 


Val 


He 


Leu 


Ser 


Phe 


Pro 


Asp 


Gly 


His 




690 










695 










700 










Lys 


Thr 


Val 


Leu 


His 


Ala 


Pro 


Ala 


Asn 


Ser 


Al a 


Asp 


Thr 


Pro 


Tyr 


He 


705 










710 










715 








720 


Arg 


Ser 


lie 


Ser 


Val 


Glu 


Gly 


Lys 


Glu 


Trp 


Ser 


C\'s 


Asn 


Tyr 


Leu 


Thr 










725 










730 










735 




His 


Glu 


Gin 


Leu 


Arg 


Ser 


Ser 


Ala 


Ser 


He 


Gin 


Trp 


Met 


Met 


Asp 


Thr 








740 










745 










7 50 






Lys 


Pro 


Asn 


Tyr 


Asn 


Arg 


Gly 


Met 


Lys 


Glu 


Ser 


Asp 


Arg 


Pro 


Tyr 


Ser 






755 










760 










765 








Phe 


Ser 


Thr 


Glu 


Gin 


Gin 


Arg 


Arg 


Ala 


Asn 


His 


Ser 


Asn 










770 










775 










7G0 











(2) It (FORMAT ION FOR SEQ ID IJO: 511 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

65 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

70 (ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATIOIi 1. . .271 



Cci) SEQUENCE DESCRIPTION: SEQ ID HO: 511 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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35 



55 



60 



65 



70 



75 
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Met 


Met 


hys 


Ser 


IJet 


Arg 


Ser 


Val 


Leu 


Leu 


Leu 


Leu 


Phe 


Pro 


Leu 


Ser 




1 








5 










10 










15 






Leu 


He 


Thr 


Ala 

20 


Leu 


Gly 


Cys 


Ser 


Asn 
25 


Asn 


Lys 


Ala 


Ala 


Glu 
30 


Ser 


Lys 


5 


Ser 


Val 


Ser 
35 


Phe 


Asp 


Ser 


Ala 


Tyr 
40 


Leu 


Glu 


Arg 


Tyr 


He 
4 5 


Pro 


Leu 


Arg 




Al a 


Asp 
50 


He 


Asp 


Thr 


Pro 


Ser 
55 


Leu 


His 


Val 


Met 


He 

60 


Ser 


Tyr 


Val 


Tyr 




Pro 


Ser 


Gly 


Asp 


Asp 


Met 


Leu 


Thr 


Glu 


He 


Phe 


Asn 


Gly 


Leu 


Leu 


Phe 


10 


65 






70 










75 










80 




Gly 


Asp 


Ser 


Leu 


Met 


Asp 


Ser 


Ser 


Ser 


Pro 


Glu 


Asn 


Ala 


Met 


Glu 


Gly 










85 










90 










95 






Tyr 


Ala 


Gin 


Met 
100 


Leu 


Gly 


Glu 


Asp 


Tyr 
105 


Arg 


Ser 


Asn 


Asn 


Ala 
110 


Glu 


Ala 


15 


As n 


Leu 


Gin 
115 


Gly 


Leu 


Pro 


Ser 


Asp 
120 


Leu 


Leu 


Asp 


Tyr 


He 
125 


Tyr 


Lys 


Gin 




Glu 


A^n 
130 


Thr 


He 


Ala 


Tyr 


Cys 
135 


Asp 


Thr 


Gly 


Leu 


He 
140 


Ser 


Thr 


Arg 


He 




As I ) 


Thr 


Tyr 


Thr 


Tyr 


Glu 


Gly 


Gly 


AlEi 


His 


Thr 


Gl u 


Asn 


Thr 


Val 


Arg 


20 


1^5 










150 










155 










160 




Phe 


Ala 


Asn 


He 


Leu 
165 


Arg 


Thr 


Thr 


Gly 


Lys 
1 7 0 


Val 


Leu 


Glu 


Glu 


Arg 
175 


Asp 




He 


Phe 


Lys 


He 
180 


Asp 


Tyr 


Ala 


Glu 


Arg 
105 


Leu 


Ser 


Ala 


Leu 


He 
190 


He 


Gly 


25 


Gin 


Leu 


Val 
1?5 


His 


Asp 


Phe 


Gly 


Lys 
2 00 


Thr 


T»ir 


Pro 


Ala 


Gl u 

205 


Leu 


Asp 


Ala 




He 


Gly 
210 


Phe 


Phe 


Asn 


Ala 


Glu 
215 


Glu 


He 


Gin 


Pro 


Asn 
220 


Gly 


Asn 


Phe 


Met 




He 


Asp 


Asp 


Lys 


Gly 


Leu 


Thr 


Tyr 


Cys 


Phe 


Asn 


Glu 


Tyr 


Gin 


He 


Ala 


30 


225 








230 










235 










240 




Ala 


Tyr 


Ala 


Arg 


Gly 
245 


Ala 


Val 


Tyr 


Val 


Arg 
2 50 


Leu 


Gly 


Tyr 


Asp 


Val 
255 


Leu 




Ala 


Pro 


Leu 


Leu 
260 


Arg 


Asp 


Asp 


Ser 


Pro 
265 


Leu 


Lys 


Arg 


Tyr 


Leu 

270 


Pro 





(2) It] FORMATION FOR SEQ ID HO: 512 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

50 (i:-. ) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1 ... 270 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 512 



Met 


Lys 


Ser 


Met 


Arg 


Ser 


Val 


Leu 


Leu 


Leu 


Leu 


Phe 


Pro 


Leu 


Ser 


Leu 


1 








5 










10 










15 




He 


Thr 


Ala 


Leu 


Gly 


Cys 


Ser 


Asn 


Asn 


Lys 


Ala 


Ala 


Glu 


Ser 


Lys 


Ser 








20 










25 










30 






Val 


Ser 


Phe 


Asp 


Ser 


Ala 


Tyr 


Leu 


Glu 


Arg 


Tyr 


He 


Pro 


Leu 


Arg 


Ala 






35 










40 










45 








Asp 


He 


Asp 


Thr 


Pro 


Ser 


Leu 


His 


Val 


Met 


He 


Ser 


Tyr 


Val 


Tyr 


Pro 




50 










55 










60 










Ser 


Gly 


Asp 


Asp 


Met 


Leu 


Thr 


Glu 


He 


Phe 


Asn 


Gly 


Leu 


Leu 


Phe 


Gly 


65 










70 










75 










80 


Asp 


Ser 


Leu 


Met 


Asp 


Ser 


Ser 


Ser 


Pro 


Glu 


Asn 


Ala 


Met 


Glu 


Gly 


Tyr 










85 










90 










95 




Ala 


Gin 


Met 


Leu 


Gly 


Glu 


Asp 


Tyr 


Arg 


Ser 


Asn 


Asn 


Ala 


Glu 


Ala 


Asn 








100 










105 










110 






Leu 


Gin 


Gly 


Leu 


Pro 


Ser 


Asp 


Leu 


Leu 


Asp 


Tyr 


He 


Tyr 


Lys 


Gin 


Glu 






115 










120 










125 








Asn 


Thr 


He 


Ala 


Tyr 


Cys 


Asp 


Thr 


Gly 


Leu 


He 


Ser 


Thr 


Arg 


He 


Asn 




130 










135 










140 










Thr 


Tyr 


Thr 


Tyr 


Glu 


Gly 


Gly 


Ala 


His 


Thr 


Glu 


Asn 


Thr 


Val 


Arg 


Phe 


145 










150 










155 










1 60 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



15 



35 



40 



45 



50 



55 



60 



65 



70 



Ala 


Asn 


lie 


Leu 


Arg 

J. O J 


Thr 


Thr 


Gly 


Lys 


Val 
J. / u 


Leu 


Glu 


Glu 


Arg 


Asp 
17 5 


He 


Phe 


Lys 


He 


Asp 
180 


Tyr 


Ala 


Glu 


Arg 


Leu 
185 


Ser 


Ala 


Leu 


He 


He 
ISO 


Gly 


Gin 


Leu 


Val 


His 
195 


Asp 


Phe 


Gly 


Lys 


Thr 

200 


Thr 


Pro 


AI a 


Glu 


Leu 

205 


Asp 


Ala 


He 


Gly 


Phe 
210 


Phe 


Asn 


Ala 


Glu 


Glu 
215 


He 


Gin 


Pro 


Asn 


Gly 
220 


Asn 


Phe 


Met 


He 


A^rp 


A^P 


Lys 


Gly 


Leu 


Thr 


Tyr 


Cys 


Phe 


Asn 


Glu 


Tyr 


Gin 


He 


Ala 


Ala 


225 










230 










235 










2 .1 0 


Tyr 


Ala 


Arg 


Gly 


Ala 
245 


Val 


Tyr 


Val 


Arg 


Leu 

2 50 


Gly 


Tyr 


Asp 


Val 


Leu 

255 


Ala 


Pro 


Leu 


Leu 


Arg 

2 60 


Asp 


Asp 


Ser 


Pro 


Leu 
265 


Lys 


Arg 


Tyr 


Leu 


Pro 
27 0 







(2) IMFORMATIOII FOR SEQ ID NO: 51 3 



(i) SEQUEIiCE CHARACTERISTICS: 

(A) LE?JGTH: 2 67 amino acids 
20 (B) TYPE: amino acid 

{ D ) TOPOLOG V : linear 

(ii) HOLECULE TYPE: protein 

25 (iii) HYPOTHETICAL: YES 

(vi) ORIGUIAL SOURCE: 

(A) ORGAMISIl: Porphyromonas gingiva li^ 

30 {ix) FEATURE: 

(A) MAIIE/KEY: misc_feature 

(B) LOCATIOI! 1 . - . 267 



(:-:i) SEQUEIJCE DESCRIPTIOll: SEQ ID MO: 513 



I let 


Arg 


Ser 


Val 


Leu 


Leu 


Leu 


Leu 


Phe 


Pro 


Leu 


Ser 


Leu 


He 


Thr 


AI a 


1 








5 










10 










15 




Leu 


Gly 


Cys 


Ser 


Asn 


Asn 


Lys 


Ala 


Ala 


Glu 


Ser 


Lys 


Ser 


Val 


Ser 


Phe 








20 










25 










30 






Asp 


Ser 


Ala 


Tyr 


Leu 


Glu 


Arg 


Tyr 


He 


Pro 


Leu 


Arg 


Ala 


Asp 


He 


Asp 






35 










4 0 










45 








Thr 


Pro 


Ser 


Leu 


His 


Val 


Met 


He 


Ser 


Tyr 


Val 


Tyr 


Pro 


Ser 


Gly 


Asp 




50 










55 










60 










Asp 


Ilet 


Leu 


Thr 


Glu 


He 


Phe 


Asn 


Gly 


Leu 


Leu 


Phe 


Gly 


Asp 


Ser 


Leu 


65 










70 










75 










80 


Met 


Asp 


Ser 


Ser 


Ser 


Pro 


Glu 


Asn 


Ala 


Met 


Glu 


Gly 


Tyr 


Ala 


Gin 


Met 










05 










90 










95 




Leu 


Gly 


Glu 


Asp 


Tyr 


Arg 


Ser 


Asn 


Asn 


Ala 


Glu 


Ala 


As n 


Leu 


Gin 










100 










105 










110 






Leu 


Pro 


Ser 


Asp 


Leu 


Leu 


Asp 


Tyr 


He 


Tyr 


Lys 


Gin 


Glu 


Asn 


Thr 


He 






115 










120 










125 








Ala 


Tyr 


r"-.-o 
-'J ^ 


Asp 


Thr 


Gly 


Leu 


He 


Ser 


Thr 


Arg 


He 


Asn 


Thr 


Tyr 


Thr 




130 










135 










J 4 0 










Tyr 


Gl Vi 


Gly 


Gly 


Ala 


His 


Thr 


Glu 


Asn 


Thr 


Val 


Arg 


Phe 


Ala 


Asn 


He 


145 










150 










155 










160 


Leu 


Arg 


Thr 


Thr 


Gly 


Lys 


Val 


Leu 


Glu 


Glu 


Arg 


Asp 


He 


Phe 


Lys 


He 










1 65 










170 










175 




Asp 


Tyr 


Ala 


Glu 


Arg 


Leu 


Ser 


Ala 


Leu 


He 


He 


Gly 


Gin 


Leu 


Val 


His 








180 










185 










190 






Asp 


Phe 


Gly 


Lys 


Thr 


Thr 


Pro 


Ala 


Glu 


Leu 


Asp 


Ala 


He 


Gly 


Phe 


Phe 






195 










200 










205 








Asn 


Ala 


Glu 


Glu 


He 


Gin 


Pro 


Asn 


Gly 


Asn 


Phe 


Met 


He 


Asp 


Asp 


Lys 




210 










215 










220 






Gly 


Leu 


Thr 


Tyr 


Cys 


Phe 


Asn 


Glu 


Tyr 


Gin 


He 


Ala 


Ala 


Tyr 


Ala 


Arg 


225 










230 










235 










240 


Gly 


Ala 


Val 


Tyr 


Val 


Arg 


Leu 


Gly 


Tyr 


Asp 


Val 


Leu 


Ala 


Pro 


Leu 


Leu 










245 










2 50 










<L ^ ^ 




Arg 


Asp 


Asp 


Ser 


Pro 


Leu 


Lys 


Arg 


Tyr 


Leu 


Pro 













2 60 2 65 

(2) IIIFORMATIOIJ FOR SEQ ID 110:514 



(i) SEQUEIICE CHARACTERISTICS: 

{A) LENGTH: 259 amino acids 
75 {B) TYPE: amino acid 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_L> 
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465/490 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

5 (iii) HYPOTHETICAL: YES 

( V i ) OR I G 1 N AL SOU RCE : 

(A) ORG AM ISM: Porphyromonas gingivalis 

10 (ix) FEATURE: 

(A) MAfiE/KEY: mi sc_f eatu re 

(B) LOCATION 1 ... 259 



70 



(xi) SEQUENCE DESCRIPTIOH: SEQ ID MO: 51^ 



15 



20 



25 



30 



35 



40 



45 



(2) IMFORI lATIOU FOR SEQ ID HO: 515 

(i) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 1266 amino acids 
55 (B) TYPE: amino acid 

{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

60 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Po rphyromonas gingivalis 

65 (ix) FEATURE: 

(A) NAME/ KEY: mi£C_feature 

(B) LOCATION 1 . . .1266 



Met 


Lys 


Lys 


Thr 


Thr 


Leu 


Thr 


Gly 


Ser 


He 


Cys 


AJ a 


Leu 


Leu 


Leu 


Pt-ie 


1 








c 










10 










15 




Leu 


Gly 


Leu 


Ser 


Ala 


Asn 


Ala 


Gin 


Ser 


Lys 


Leu 


Lys 


He 


Lys 


Ser 


He 








2 0 










25 










30 






Glu 


Ala 


Aia 


Thr 


Thr 


Phe 


Ser 


Ser 


Ala 


Thr 


Ala 


Gly 


Asn 


Gly 


Phe 


Gly 






35 




















45 








Gly 


Asn 


He 


Phe 


Gly 


Met 


Asp 


Met 


Ser 


He 


Arg 


Met 


Arg 


Val 


His 


His 




50 










c c 










60 










Ser 


He 


Leu 


Pro 


Glu 


Gly 


Leu 


Asp 


P).e 


Ser 


Val 


Gly 


He 


His 


Glu 


Arg 


65 










7 0 










7 S 












Arg 


Ala 


His 


Trp 


Glu 


Glu 


Ala 


'3iy 


Ser 


Pro 


Lys 


Leu 


Met 


Tyr 


Thr 


Asn 










05 










90 














Val 


Fro 


Ser 


He 


He 


Gly 


He 


Val 


Glu 


Lys 


Val 


He 


Val 


Ph- 


Glu 


Asp 








100 










105 










il 0 






Ala 


Glu 


Asp 


Phe 


Phe 


Asp 


Lys 


Lys 


Ala 


Leu 


Gly 


Arg 


Phe 


Leu 


He 


Ser 






115 










120 










125 








Leu 


Gly 


He 


Ser 


Tyr 


Thr 


Lys 


His 


Leu 


Gly 


Ala 


Tyr 


Trp 


Gly 


Trp 


Thr 




130 










135 










140 










Asn 


Asp 


Ala 


His 


He 


Leu 


Phe 


Ser 


Pro 


He 


Pro 


Lys 


Ser 


Lys 


Val 


His 


145 










150 










155 










160 


Tyr 


Asp 


Thr 


Tyr 


Thr 


Arg 


Ala 


Gly 


Ser 


Asp 


Leu 


Val 


Leu 


Gin 


Ser 


Glu 
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Asp 


Val 


Ala 


Thr 


Val 


Ser 


Asn 


Gly 


Phe 


Ser 


Pro 


Gly 


He 


Gly 


Leu 


Lys 
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He 
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Lys 
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Arg 






195 
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205 
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215 
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Ser 
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Ser 
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Leu 


Glu 
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Gl u 
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Ser 
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He 


Arg 
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Glu 
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Leu 


Glu 


Arg 
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Leu 
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215 
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Glu 


Gly 


Leu 


Gl u 


Arg 


Leu 


Thr 


Ser 


Leu 


Thr 


Lys 


Leu 


Arg 


Leu 


Arg 


Ser 


Z2S 










230 










235 
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A-n 


Gln 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Aro 


Leu 


Thr 


Ser 


Leu 


Ala 










24 5 










2 50 










255 




Tlir 


Leu 


Glu 


Leu 


Ser 


Gly 


Asn 


Gin 


He 


Arg 


Lj-s 


Leu 


Glu 


G 1 V 


Leu 


Glu 








260 
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270 






Arg 


Leu 
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Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gl ■ ' 


Asn 




He 
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Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Ser 


Ser 


L-^u 


Thr 


±jj ^ 


L^u 


Arg 


Leu 
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Arg 


Ser 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


G 1 Y 


Leu 


Glu 


Ar a 


L^u 


Thr 


Ser 


305 
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315 
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Leu 


Thr 




Leu 


Ser 


Leu 


Ser 


Asp 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Gl u 


G 1 \- 
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330 










335 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Glu 


Leu 


Tyr 


Leu 


Leu 


ASD 


Asn 


Gin 








340 










34 5 
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He 


Arg 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Thr 


Lys 


Leu 






355 
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Arg 


Leu 


Arg 


Ser 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Asp 


Ser 


Leu 
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Thr 


Ser 


Leu 


Thr 
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Ser 


Leu 


Ser 
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Gl n 


He 


Ser 


Lys 


Leu 
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Glu 
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Leu 


Glu 
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Thr 
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Leu 
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Glu 


Leu 


Tyr 


Leu 


Leu 


Asp 
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He 
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Leu 
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Gly 
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Gly 


Leu 


Ala 
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Leu 


Thr 
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Arg 
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Arg 


Arg 


Asn 
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He 
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Lys 
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Leu 


Asp 
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Arg 
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Val 
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Val 
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G 1 y 


Asn 
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He 
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Ser 


He 


Asp 


Asp 


He 


Lys 


Leu 


Leu 


Ala 
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He 


Leu 


Glu 


Gin 


Thr 


Leu 


465 
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Glu 


Lys 


Leu 


Arg 


He 
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Asp 
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Pro 


Phe 
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Ser 
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Leu 


Ser 


Pro 


Tyr 


Asp 


Asn 
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Pro 


Glu 


He 


Lys 


Ala 


Leu 


Leu 
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505 
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Glu 


Lys 


Glu 


Lys 


Glu 


Lys 


Gin 


Lys 


Lys 


Thr 


Ser 


Val 


Glu 


T' -r 


His 


Pro 
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Phe 


Cys 


Lys 


Val 


Met 


Leu 


Leu 


Gly 


Asn 


His 


Ser 


Ser 


Gly 




Thr 


Thr 
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535 
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Phe 


Leu 


Ser 


Gin 


Tyr 


Asp 


Thr 


Asn 


Tyr 


Thr 


Tyr 


Gin 


Lys 


Asn 


Thr 


His 


545 










550 










555 










560 


Val 


Leu 


Ser 


He 


His 


Arg 


Ser 


Asn 


Asn 


Pro 


Asn 


Ala 


He 


Phe 


Tyr 


Asp 










565 










57 0 










575 


Phe 


Gly 


Gly 


Gl n 


Asp 


Tyr 


Tyr 


His 


Gly 


He 


Tyr 


Gin 


Ala 


Phe 


Phe 


Thr 








580 










505 










590 






Thr 


Gin 


S^l 


Leu 


Tyr 


Leu 


Leu 


Phe 


Trp 


Asp 


Ala 


Lys 


Lys 


Asp 


Arg 


Asn 






5 95 










600 










605 




Phe 


Val 


Sec 


Val 


Asp 


Asp 


Lys 


Glu 


Tyr 


Gin 


Thr 


Leu 


Asn 


Phe 


Asn 


Arg 




610 










615 










62 0 








Pro 


Tyr 


Trp 


Leu 


Gly 


Gl n 


He 


Ala 


Tyr 


Ala 


C' 's 


Asn 


Arg 


n- -c- 


Met 


Ser 


625 










630 










655 








640 
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Asp 
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Thr 
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Gin 


Thr 


Tlir 


Asp 
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650 










655 




Thr 


He 


He 


He 


Gin 


Thr 


His 


Ala 


Asp 


Glu 


Thr 


Gly 


Ala 


Lys 


Gin 


Gl n 








660 










665 










670 






Thr 


Leu 


Gly 


Cys 


Ala 


Ala 


Glu 


Asn 


Gly 


Val 


Leu 


Glu 


Glu 


He 


Tyr 


Val 






G7 5 










660 










685 








Ser 


Leu 


Glu 


Pro 


Lys 


Ala 


Asn 


Ser 


Ala 


Val 


His 


Ala 


Leu 


Asn 


Tyr 


Leu 




690 










695 










7 00 










A^n 


Glu 


Arg 


Val 


Arg 


Glu 


Val 


Val 


Ala 


Ser 


Arg 


Ser 


Lys 


Ser 


He 


Gin 


705 








710 










715 










720 


He 


Thr 


Glu 


Lys 


Asp 


Lys 


Gly 


Leu 


Tyr 


Glu 


Ala 


Leu 


Pro 


Thr 


He 


Ala 










725 










730 










735 




Gly 


Asp 


Asn 


Lys 


His 


He 


Pro 


He 


Ser 


Leu 


Gl u 


Ala 


Leu 


Ala 


Ala 


Gin 






740 










7 45 










750 






Leu 


Asn 


Lys 


Gly 


Arg 


Ala 


Glu 


Asn 


Asp 


Leu 


Tyr 


Thr 


He 


Glu 


Tyr 


Leu 






755 










760 










765 








Gl n 


Thr 


Glu 


Leu 


Asn 


Gin 


Leu 


Ser 


Leu 


Arg 


Gly 


Glu 


Val 


Leu 


Tyr 


Tyr 




770 










775 










7 80 










Arg 


Glu 


Asn 


Glu 


Lys 


Leu 


Asn 


Asn 


Tyr 


Val 


Trp 


Leu 


Asp 


Pro 


Ala 


Ala 


785 










790 










795 










800 


Phe 


Val 


Gin 


Met 


He 


His 


Gly 


Glu 


He 


Leu 


Gin 


Lys 


Asp 


Asn 


He 


Asn 










805 










810 










815 




Arg 


Gly 


Thr 


Val 


Pro 


Lys 


Asp 


He 


Phe 


Glu 


Cys 


Lys 


Leu 


His 


Asn 


Leu 






820 










825 










830 






Ser 


SeL 


Giy 


Set- 


He 


Phe 


Glu 


Glu 


Asp 


Gly 


Gin 


Asn 


Gly 


Asn 


Met 


He 






9 35 










S40 










845 








Leu 


Gin 


Leu 


Leu 


Leu 


Gl u 


Gl u 


Leu 


He 


Val 


Tyr 


Glu 


Asp 


Lys 


Asp 


Cys 




850 










855 










860 










Tyr 


Val 


He 


Pro 


Gly 


Tyr 


Leu 


Pro 


Leu 


His 


Ser 


Asp 


Asp 


Glu 


Ala 


Tyr 


B65 










870 










875 










88 0 


Ly£- 


Trp 


Leu 


Thr 


Leu 


Gly 


Phe 


Glu 


Arg 


Pro 


Asn 


Phe 


Val 


Leu 


Lys 


Phe 








885 










890 










895 




Glu 


Arg 


Phe 


He 


Pro 


Phe 


Gly 


Leu 


He 


Asn 


Gin 


He 


He 


Ala 


Tyr 


Tyr 








900 










905 










910 






Gly 


Arg 


Glu 


Glu 


Gly 


Ala 


Leu 


Lys 


Arg 


Tyr 


Trp 


Arg 


Asp 


Gin 


Val 


He 






915 










920 










925 








Phe 


Thr 


Ala 


Gly 


Arg 


Glu 


Met 


Asp 


Arg 


Gin 


Thr 


Leu 


Glu 


Gin 


Glu 


Glu 




930 










935 










940 










Glu 


Lys 


Glu 


Gly 


Leu 


Pro 


Lys 


Thr 


Asn 


Ala 


Glu 


Asp 


Tyr 


Gin 


He 


Trp 


945 








950 










955 










960 


He 


Lys 


Leu 


Asp 


Phe 


Thr 


Asp 


Leu 


Ala 


lie 


Ser 


Val 


Phe 


He 


Lys 


Glu 






965 










970 










975 




Gin 


Arg 


Lys 


Thr 


Ser 


Ala 


Lys 


Asp 


Met 
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Arg 


Lys 


Glu 


Ala 


Thr 
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980 
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Leu 


Ser 


Asp 


Met 


Leu 


Asp 


Met 


Tyr 


Trp 


Asn 


Asn 


He 


Pro 


Pro 


Arg 


Glu 
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1000 
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Gin 


He 


Gly 


Asp 


Lys 


Asp 


Thr 


Glu 


Gin 


Thr 


Arg 


Ser 


Thr 


He 


Arg 


Glu 
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Thr 
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Arg 
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Gin 
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Thr 








1045 
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Leu 


Asp 


Asp 


Glu 


Ser 
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Thr 


Thr 


Ala 


Arg 
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Ala 
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Tyr 


Pro 


Leu 








1060 
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Gly 


Val 


He 


Asp 


Lys 


Glu 


Arg 


Val 


Arg 
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Ser 


Thr 


Arg 
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Gin 
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1095 
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Tyr 
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Lys 


Glu 
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Thr 


Glu 


Leu 


Glu 


Thr 


Cys 
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1110 
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Leu 


Gin 


Phe 


Phe 


Lys 


Pro 


Leu 


Glu 


Lys 
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Gly 


Gin 


He 


Glu 


He 


Tyr 
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1130 
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Tyr 


Asp 


Lys 


Leu 
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Lys 


Phe 
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Pro 


He 
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Pro 


Glu 


He 
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Val 
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Ala 
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He 
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He 
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Gin 
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Thr 


Phe 
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Glu 
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Asp 
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Lys 
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Arg 
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Arg 
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Lys 


Leu 










245 
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Leu 
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Arg 


Leu 


Ser 


Ser 


Leu 


Thr 


Lys 


Leu 


Arg 


Leu 


Arg 


Ser 
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270 
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Ser 
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Leu 


Glu 


Gly 


Leu 


Glu 


Arg 
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Thr 


Ser 


Leu 


Thr 
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280 










285 








Lys 


Leu 


Ser 


Leu 
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Asp 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 




2 90 










2 95 










300 










Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Glu 


Leu 


Tyr 


Leu 


Leu 


Asp 


Asn 


Gin 


He 


Arg 


305 










310 










315 










320 


Lys 


Leu 


Glu 


Gly 


Leu 


Gl u 


Arg 


Leu 


Thr 


Ser 


Leu 


Thr 


Lys 


Leu 


Arg 


Leu 










325 










330 










335 




Arg 


Ser 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Asp 


Ser 


Leu 


Thr 


Ser 








3'10 










345 










350 






Leu 


Thr 


Lys 


Leu 


Ser 


Leu 


Ser 


Asp 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 






355 










360 










365 








Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Glu 


Leu 


Tyr 


Leu 


Leu 


Asp 


Asn 


Gin 




370 










375 










300 










He 


Arg 


Lys 


Leu 


Glu 


Gly 


Leu 


Asp 


Gly 


Leu 


Ala 


Ser 


Leu 


Thr 


Arg 


Leu 
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385 










390 










395 










4 00 




Ser 


Leu 


Arg 


Arg 


Asn 
405 


Gin 


He 


Ser 


Lys 


Leu 
410 


Giu 


Gly 


Leu 


Asp 


Arg 
415 


Leu 




Lys 


Val 


Leu 


Arg 


Lys 


Leu 


Asp 


Val 


Ser 


Gly 


Asn 


Asp 


He 


Gin 


Ser 


He 


5 








420 










4 25 










430 








Asp 


Asp 


He 


Lys 


Leu 


Leu 


Ala 


Pro 


He 


Leu 


Glu 


Gin 


Thr 


Leu 


Glu 


Lys 






435 










440 










445 










Leu 


Arg 
450 


He 


His 


Asp 


Asn 


Pro 
455 


Phe 


Val 


Ala 


Ser 


Ser 
460 


Gly 


Leu 


He 


Leu 


10 


Ser 

AGS 


Pro 


Tyr 


Asp 


Asn 


His 
470 


Leu 


Pro 


Glu 


He 


Lys 
475 


Ala 


Leu 


Leu 


Glu 


Lys 

480 




Glu 


Lys 


Glu 


Lys 


Gin 


Lys 


Lys 


Thr 


Ser 


Val 


Glu 


Tyr 


His 


Pro 


Phe 


Cys 










485 










4 90 










4 95 






Lys 


Val 


Met 


Leu 


Leu 


Gly 


Asn 


His 


Ser 


Ser 


Gly 


Lys 


Thr 


Thr 


Phe 


Leu 


15 








500 










505 










510 








Ser 


Gin 


Tyr 
515 


Asp 


Thr 


Asn 


Tyr 


Thr 
520 


Tyr 


Gin 


Lys 


Asn 


Thr 

525 


His 


Vra 


Leu 




Ser 


He 
530 


Hi^ 


Arg 


Ser 


Asn 


Asn 
535 


Pro 


Asn 


Ala 


He 


Phe 
54 0 


Tyr 


Asp 


Phe 


Gly 


20 


Gly 
54 5 


Gin 


Asp 


Tyr 


Tyr 


His 

550 


Gly 


He 


Tyr 


Gin 


Ala 

555 


Phe 


Phe 


Thr 


Thr 


Gin 
560 




Ser 


Leu 


Tyr 


Leu 


Leu 
565 


Plie 


Trp 


Asp 


Ala 


Lys 
570 


Lys 


Asp 


Arg 


Asn 


Phe 
57 5 


Val 




Ser 


Val 


Asp 


Asp 


Lys 


Glu 


Tyr 


Gin 


Thr 


Leu 


Asn 


Plie 


Asn 


Arg 


Pro 


Tyr 


25 






580 










585 










5 90 








Trp 


Leu 


Gly 


Gin 


He 


Ala 


Tyr 


Ala 


Cys 


Asn 


Arg 


Cys 


1 let 


Ser 


Val 


Gly 






595 










600 










605 










Gly 


Asn 
GIO 


Pre 


Asp 


Gly 


Lys 


Asp 
615 


Thr 


Pro 


Gin 


Thr 


Thr 

62 0 


Asp 


Asp 


Thr 


He 


30 


He 

625 


He 


Gin 


Thr 


His 


Ala 
630 


Asp 


Glu 


Thr 


Gly 


Ala 
635 


Lys 


Gin 


Gin 


Tlir 


Leu 
64 0 




Gly 


Cys 


Ala 


Ala 


Glu 


Asn 


Gly 


Val 


Leu 


Glu 


Glu 


He 


Tyr 


Val 


Ser 


Leu 








645 










650 










655 






Glu 


Pro 


Lys 


Ala 


Asn 


Ser 


Ala 


Val 


His 


Ala 


Leu 


Asn 


Tyr 


Leu 


Asn 


Glu 


35 






660 










665 










670 








Arg 


Val 


Arg 
67 5 


Glu 


Val 


Val 


Ala 


Ser 
680 


Arg 


Ser 


Lys 


Ser 


He 
685 


Gin 


He 


Thr 




Glu 


Lys 
690 


Asp 


Lys 


Gly 


Leu 


Tyr 
695 


Glu 


Ala 


Leu 


Pro 


Thr 
700 


He 


Ala 


Gly 


Asp 


40 


Asn 


Lys 


His 


He 


Pro 


He 


Ser 


Leu 


Glu 


Ala 


Leu 


Ala 


Ala 


Gin 


Leu 


Asn 




7 05 








710 










715 










720 




Ly^ 


Gly 


Arg 


Ala 


Glu 
725 


Asn 


Asp 


Leu 


Tyr 


Thr 
730 


He 


Glu 


Tyr 


Leu 


Gin 
735 


Thr 




Giu 


Leu 


Asn 


Gin 


Leu 


Ser 


Leu 


Arg 


Gly 


Glu 


Va 1 


Leu 


Tyr 


Tyr 


Arg 


Glu 


45 








74 0 










74 5 










750 








Asa 


Glu 


Lys 

755 


Leu 


Asn 


Asn 


Tyr 


Val 
760 


Trp 


Leu 


Asp 


Pro 


Ala 
765 


Ala 


Phe 


Val 




Gin 


Met 
770 


He 


His 


Gly 


Gl u 


He 
775 


Leu 


Gin 


Lys 


Asp 


Asn 
780 


He 


Asn 


Arg 


Gly 


50 


Tl-ir 
7 85 


Val 


Pro 


Lys 


Asp 


He 
7 90 


Phe 


Glu 


Cys 


Lys 


Leu 
795 


His 


Asn 


Leu 


Ser 


Ser 
000 




Gly 


Ser 


He 


Phe 


Glu 


Glu 


Asp 


Gly 


Gin 


Asn 


Gly 


Asn 


Ilet 


He 


Leu 


Gin 










805 










810 










815 






Leu 


Leu 


Leu 


Glu 


Glu 


Leu 


He 


Val 


Tyr 


Glu 


Asp 


Lys 


Asp 


Cys 


Tyr 


Val 


55 








820 










825 










830 








He 


Pro 


Gly 
835 


Tyr 


Leu 


Pro 


Leu 


His 
840 


Ser 


Asp 


Asp 


Glu 


Ala 
845 


Tyr 


Lys 


Trp 




Leu 


Thr 

050 


Leu 


Gly 


Phe 


Glu 


Arg 
855 


Pro 


Asn 


Phe 


Val 


Leu 
860 


Lys 


Phe 


Glu 


Arg 


60 


Phe 
865 


He 


Pro 


Phe 


Gly 


Leu 
870 


He 


Asn 


Gin 


He 


He 
075 


Ala 


Tyr 


Tyr 


Gly 


Arg 
880 




Glu 


Glu 


'31y 


Ala 


Leu 
805 


Lys 


Arg 


Tyr 


Trp 


Arg 
890 


Asp 


Gin 


Val 


He 


Phe 

895 


Thr 


65 


Ala 


Gly 


Arg 


Glu 


I let 


Asp 


Arg 


Gin 


Thr 


Leu 


Glu 


Gin 


Glu 


Glu 


Gl u 


Lys 








900 










905 










910 








Glu 


Gly 


Leu 
915 


Pro 


Lys 


Thr 


Asn 


Ala 
920 


Glu 


Asp 


Tyr 


Gin 


He 
92 5 


Trp 


He 


Lys 




Leu 


Asp 


Phe 


Thr 


Asp 


Leu 


Ala 


He 


Ser 


Val 


Phe 


He 


Lys 


Glu 


Gin 


Arg 


70 




930 










935 










940 










Lys 


Thr 


Ser 


Ala 


Lys 


Asp 
950 


Met 


Gin 


Arg 


Lys 


Glu 

955 


Ala 


Thr 


He 


Leu 


Ser 
960 




Asp 


Met 


Leu 


Asp 


t'let 
965 


Tyr 


Trp 


Asn 


Asn 


He 
970 


Pro 


Pro 


Arg 


Glu 


Gin 

97 5 


He 




Gly 


Asp 


Lys 


Asp 


Thr 


Glu 


Gin 


Thr 


Arg 


Ser 


Thr 


He 


Arg 


Glu 


Thr 


Asn 


75 








980 










985 










990 
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65 



70 
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Arg 


Lys 


Lys 


Arg 


Fro 


He 


Gin 


Asp 


Leu 


Tyr 


Leu 


Ser 


cys 


Ala 


Gin 


Ala 






995 










1000 








1005 






AFP 


hys 


A.-P 


Leu 


Thr 


Glu 


Ser 


His 


Tyr 


He 


His 


Leu 


Gly 


Thr 


Leu 


Asp 




1010 








1015 








1020 






Asp 


Glu 


Sei: 


Lys 


Thr 


Thr 


Ala 


Arg 


He 


Ala 


Ala 


Tyr 


Pro 


Leu 


Lys 


Asn 


1025 








1030 








1035 








Gly 


Val 


He 


Asp 


Lys 


Glu 


Arg 


Val 


Arg 


Glu 


Val 


Ser 


Thr 


Arg 


Pro 


Tyr 










lO^f 










101 


50 








1055 


Ly^ 


His 


L-u 


Ser 


Val 


Asn 


Lys 


Asn 


Leu 


Ala 


Thr 


Ala 


Lys 


Gin 


He 


Phe 








1060 








1065 








1070 




lie 


Ser 


Tyr 


Ser 


Lys 


Glu 


Asp 


Gin 


Thr 


Glu 


Leu 


Glu 


Thr 


Cys 


Leu 


Gin 



1040 



1075 1080 1085 

Phe Phe Lys Pro Leu Glu Lys Asn Gly Gin He Glu He Tyr Tyr Asp 

1090 1095 1100 

Lys Leu Thr Lys Phe Glu Thr Pro He His Pro Glu He Arg Lys Arg 
1105 1110 1115 1120 

He Val Glu Ala Asp Cys He He Ala Leu He Ser Gin Arg Tyr Leu 

1125 1130 1135 

Ala Thr Asp Tyr He Leu Asp His Glu Leu Pro Val Phe Arg Glu Tyr 
20 1140 1145 1150 

Asn L^^■s Thr He Val Pro He Leu He Lys Pro Cys Thr Phe Glu Asp 

1155 1160 1165 

Asp Glu Ftie Leu Arg Glu Lys Tyr Phe Ala Gin Lys Ala Gin He He 
H'^O 1175 1180 

25 Asn Leu Gly Lys Glu Gly Lys Thr He Lys Ala Tyr Asp Ser He Thr 

1185 1190 1195 1200 

Ala Ser Ala His Arg Asp Glu Asn Trp Val Ala Val Val Arg Glu Phe 
1205 1210 1215 

Glu Lys He Leu Arg He Thr Lys Gin Glu Val Asn Thr Asp Glu 
30 1220 1225 1230 

(2) IHFOKIIATION FOR SEQ ID HO: 517 

(i) SEQUEKCE CHARACTERISTICS: 
35 (A) LEIIGTH: 1175 amino acids 

(B) TYPE: amino acid 
{D} TOPOLOGY: linear 



{ii) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAiriSM: Porphyromonas gingival i5 

(ix) FEATURE: 

(A) HAIIE/P:EY: inisc_feature 

(B) LOCATIOII 1 . . ,117 5 

(Ki) SEQUENCE DESCRIPTION: SEQ ID HO: 517 



I let 


Thr 


Trp 


Leu 


He 


Asp 


Phe 


Pro 


Ala 


Leu 


Lys 


Lys 


Leu 


Asp 


Leu 


Ser 


1 








5 










10 








15 




Tyr 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 








20 










25 








30 






Thr 


Lys 


Leu 


Arg 


Leu 


Arg 


Ser 


Asn 


Gin 


He 


Arg 


Lys 


Leu 


Glu 


Gly 


Leu 






35 










40 










45 






Asp 


Ser 
50 


Leu 


Th r 


Ser 


Leu 


Thr 
55 


Lys 


Leu 


Ser 


Leu 


Ser 
60 


Asp 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Gl u 


Leu 


Tyr 


65 










70 










75 










80 


Leu 


Leu 


Asp 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 










8 5 










90 








95 




Ser 


Leu 


Ala 


Thr 
100 


Leu 


Glu 


Leu 


Ser 


Gly 
105 


Asn 


Gin 


He 


Arg 


Lys 
110 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


^ly 


Asn 






115 










120 










125 






Gin 


He 


Arg 


Ly^ 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Thr 


Lys 




130 










135 










14 0 








Leu 


Arg 


Leu 


Arg 


Ser 


Asn 


Gin 


He 


Ser 


hys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


145 










150 










155 










160 


Leu 


Thr 


Ser 


Leu 


Ala 
165 


Thr 


Leu 


Glu 


Leu 


Ser 
170 


Gly 


Asn 


Gin 


He 


Arg 
175 


Lys 


Leu 


Glu 


Gly 


Leu 
180 


Glu 


Arg 


Leu 


Thr 


Ser 
185 


Leu 


Ala 


Thr 


Leu 


Glu 
190 


Leu 


Ser 
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Gly Asn Gin He Ser Lys Leu Glu Gly Leu Glu Arg Leu Ser Ser Leu 

195 200 205 

Thr Lys Leu Arg Leu Arg Ser Asn Gin He Ser Lys Leu Glu Gly Leu 
210 215 220 

5 Glu Arg Leu Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin He 
225 230 235 240 

Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr 

245 250 255 

Leu Leu Asp Asn Gin He Arg Lys Leu Glu Gly Leu Glu Arg Leu Thr 
10 260 265 270 

Ser Leu Thr Lys Leu Arg Leu Arg Ser Asn Gin He Ser Lys Leu Glu 

275 280 285 

Gly Leu Asp Ser Leu Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn 
290 295 300 

15 Gin He Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu 
305 310 315 320 

Leu Tyr Leu Leu Asp Asn Gin He Arg Lys Leu Glu Gly Leu Asp Gly 

325 330 335 

Leu Ala Ser Leu Thr Arg Leu Ser Leu Arg Arg Asn Gin He Ser Lys 
20 340 345 350 

Leu Glu Gly Leu Asp Arg Leu Lys Val Leu Arg Lys Leu Asp Val Ser 

355 360 365 

Gly Asn Asp He Gin Ser He Asp Asp He Lys Leu Leu Ala Pro He 
37 0 37 5 380 

25 Leu Glu Gin Thr Leu Glu Lys Leu Arg He His Asp Asn Pro Phe Val 
385 390 395 400 

Ala Ser Ser Gly Leu He Leu Ser Pre Tyr Asp Asn His Leu Pro Glu 

4 05 ^110 4 15 

Jle Lys Ala Leu Leu Glu Lys Glu L^i'S Glu L-ys Gin Lys Lys Thr Ser 
30 420 425 430 

Val Glu Tyr His Pro Phe Cys Lys Val Met Leu Leu Gly Asn His Ser 

435 440 445 

Ser Gly Lys Thr Thr Phe Leu Ser Gin Tyr Asp Thr Asn Tyr Thr Tyr 
4 50 'J 55 4 60 

35 G\n Lys Asn Thr His Val Leu Ser He His Arg Ser Asn Asn Pro Asn 
^65 4 70 47 5 4 80 

Ala He Phe Tyr Asp Phe Gly Gly Gin Asp Tyr Tyr His Gly He Tyr 

485 490 495 

Gin Ala Phe Phe Thr Thr Gin Ser Leu Tyr Leu Leu Phe Trp Asp Ala 
40 500 505 510 

Lys Lys Asp Arg Asn Phe Val Ser Val Asp Asp Lys Glu Tyr Gin Thr 

515 520 525 

Leu Asn Phe Asn Arg Pro Tyr Trp Leu Gly Gin He Ala Tyr Ala Cys 
530 535 540 

45 Asn Arg Cys Met Ser Val Gly Gly Asn Pro Asp Gly Lys Asp Thr Pro 
545 550 555 560 

Gin Thr Thr Asp Asp Thr He He He Gin Thr His Ala Asp Glu Thr 

565 570 575 

Gly Ala Lys Gin Gin Thr Leu Gly Cys Ala Ala Glu Asn Gly Val Leu 
50 580 585 590 

Glu Glu He Tyr Val Ser Leu Glu Pro Lys Ala Asn Ser Ala Val His 

595 600 605 

Ala Leu Asn Tyr Leu Asn Glu Arg Val Arg Glu Val Val Ala Ser Arg 
610 615 620 

55 Ser Lys Ser He Gin He Thr Glu Lys Asp Lys Giy Leu Tyr Glu Ala 
625 630 635 640 

Leu Pro Thr He Ala Gly Asp Asn Lys His He Pro He Ser Leu Glu 

645 650 655 

Ala Leu Ala Ala Gin Leu Asn Lys Gly Arg Ala Glu Asn Asp Leu Tyr 
60 660 665 670 

Thr He Glu Tyr Leu Gin Thr Glu Leu Asn Gin Leu Ser Leu Arg Gly 

675 680 685 

Glu Val Leu Tyr Tyr Arg Glu Asn Glu Lys Leu Asn Asn Tyr Val Trp 
6 90 6 95 7 00 

65 Leu Asp Pro Ala Ala Phe Val Gin Met He His Gly Glu He Leu Gin 
7 05 710 715 720 

Lys Asp Asn He Asn Arg Gly Thr Val Pro Lys Asp He Phe Glu Cys 

725 730 735 

Lys Leu His Asn Leu Ser Ser Gly Ser He Phe Glu Glu Asp Gly Gin 
70 740 745 750 

Asn Gly Asn Met He Leu Gin Leu Leu Leu Glu Glu Leu He Val Tyr 

755 760 "765 

Glu Asp Lys Asp Cys Tyr Val He Pro Gl-:^' Tyr Leu Pro Leu His Ser 
77 0 77 5 7 80 

75 Asp Asp Glu Ala Tyr Lys Trp Leu Thr Leu Giy Phe Glu Arg Pro Asn 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



70 



785 










790 










7 95 










800 


Phe 


Val 


Leu 


Lys 


Phe 


Glu 


Arg 


Phe 


He 


Pro 


Phe 


Gly 


Leu 


He 


Asn 


Gl n 










805 










810 










815 




lie 


He 


Ala 


Tyr 


Tyr 


Gly 


Arg 


Glu 


Glu 


Gly 


Ala 


Leu 


Lys 


Arg 


Tyr 


Trp 








820 










825 










830 






Arg 


A^p 


Gin 


V a 1 


He 


Fhe 


Thr 


Ala 


Gly 


Arg 


Glu 


net 


Asp 


Arg 


Gin 


Thr 






835 










840 










845 








Leu 


Glu 


Gin 


Glu 


Glu 


Glu 


Lys 


Glu 


Gly 


Leu 


Pro 


Lys 


Thr 


Asn 


Ala 


Glu 




850 










855 










860 










Asp 


Tyr 


Gin 


He 


Trp 


He 


Lys 


Leu 


Asp 


Phe 


Thr 


Asp 


Leu 


Ala 


He 


Ser 


865 










870 










87 5 










88 0 


Va 1 


Phe 


He 


L-ys 


Glu 


Gin 


Arg 


Lys 


Thr 


Ser 


Ala 


Lys 


Asp 


Met 


Gl n 


Arg 










885 










890 










8 95 


Lys 


Glu 


Ala 


Thr 


He 


Leu 


Ser 


Asp 


I let 


Leu 


ASD 


Met 


Tvr 


TrD 


Asn 


As n 








900 










905 










910 






lie 


Pro 


Pro 


Arg 


Glu 


Gl n 


He 


Gly 


Asp 


Lvs 




Th r 


Glu 


Gin 


Thr 


A 

rg 






915 










920 










925 






Ser 


Thr 


He 


Arc 


Glu 


Thr 


Asn 


Arg 


Lys 




Arg 


Pro 


He 


Gin 


Asp 


Leu 














935 










940 








Tyr 


Leu 


Ser 


C"-'s 


Ala 


Gin 


Ala 


Asp 


Lys 


Asp 


Leu 


Thr 


Glu 


Ser 


His 


T'"r 


945 










950 










955 










960 


lie 


His 


Leu 


Gly 


Thr 


Leu 


Asp 


Asp 


Glu 


Ser 


Lys 


Thr 


Thr 


Ala 


Arg 


He 










965 










97 0 










975 




Ala 


Al a 


T-yr 


Pro 


Leu 


Lys 


Asn 


Gly 


Val 


I] e 


Asp 


Lys 


Glu 


Arg 


Val 


Arg 








98 0 










985 










990 




Glu 


Va ]. 


Ser 


Thr 


Arg 


Pro 


Tyr 


Lys 


His 


Leu 


Ser 


Val 


Asn 


Lys 


Asn 


Leu 
















1000 








1005 






Ala 


Tlir 


Ala 


Lys 


Gin 


He 


Phe 


He 


Ser 


Tyr 


Ser 


Lys 


Glu 


Asp 


Gin 


Thr 




101 










1015 








1020 








Glu 


Leu 


Glu 


Thr 


Cys 


Leu 


Gin 


Phe 


Phe 


Lys 


Pro 


Leu 


Glu 


Lys 


Asn 


Gly 


102 5 








1030 








1035 






Gin 


He 


Glu 


He 


Tyr 


Tyr 


Asp 


Lys 


Leu 


Thr 


Lys 


Phe 


Glu 


Thr 


Pro 


He 










1045 








1050 








1055 


Hi^ 


Pro 


Glu 


He 


Arg 


Lys 


Arg 


He 


Val 


Glu 


Ala 


Asp 


Cys 


He 


He 


Ala 








1060 








1065 








107 0 




Leu 


He 


Ser 


Gin 


Arg 


Tyr 


Leu 


Ala 


Thr 


Asp 


Tyr 


He 


Leu 


Asp 


His 


Glu 






107 5 








1080 








1085 






Leu 


Pro 


Val 


Phe 


Arg 


Glu 


Tyr 


Asn 


Lys 


Thr 


He 


Val 


Pro 


He 


Leu 


He 




1090 








1095 








1100 








Ly^ 


Pro 


n\'<^ 


Thr 


Phe 


Glu 


Asp 


Asp 


Glu 


Phe 


Leu 


Arg 


Glu 


Lys 


Tyr 


Phe 


1105 








1110 








1115 




] 


Ala 


Gin 


Lys 


Ala 


Gin 


He 


He 


Asn 


Leu 


Gly 


Lys 


Glu 


Gly 


Lys 


Thr 


He 










1125 








H 30 








1135 


hys 


Ala 


Tyr 


A:fp 


Ser 


He 


Thr 


Ala 


Ser 


Ala 


His 


Arg 


Asp 


Glu 


Asn 


Trp 








1140 








114 


5 








1150 


Val 


Ala 


Val 


Val 


Arg 


Glu 


Phe 


Lys 


Glu 


Lys 


He 


Leu 


Arg 


He 


Thr 


Lys 






115^ 










1160 








1165 






Gin 


Glu 


Va 1 


Asn 


Thr 


Asp 


Glu 





















1040 



1120 



1170 1175 
(2) IMFORI lATIOIJ FOR SEQ ID MO: 518 



(i) SEQUEflCE CHARACTERISTICS: 

(A) LENGTH: 22 9 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

60 (iii) HYPOTHETICAL: YES 

(vi) ORIGItlAL SOURCE: 

(A) ORGAMISII: Porphyromonas gingivali£ 

65 (iM) E'EATURE: 

(A) NAilE/KEY: misc_feature 

(B) LOCATION 1 ... 229 



(:-:i} SEQUEIJCE DESCRIPTION: SEQ ID 110:518 



net net Lys Lys Ala Phe Val Phe Val Leu Leu Val Cys Leu Phe Ser 

15 10 15 

Ser Phe Ser Ser Ser Ala Gin Thr Thr Thr Asn Ser Ser Ara Ser Tyr 

2 0 2 5 30^ 

75 Fhe Thr Gly Arg He Glu Lys Val Ser Leu Asn Leu Gly Val Pro Pro 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



15 



20 



25 



35 



40 



55 



60 



65 



70 



75 



WO 99/29870 PCT/AU98/01023 

473/490 







35 










4 0 










45 








Val 


Ser 


Thr 


Glu 


Val 


Trp 


Gly 


llet 


Thr 


His 


Asp 


Ala 


Asn 


Gly 


Leu 


Pro 




50 










55 










60 










Phe 


Glu 


He 


Pro 


He 


Ser 


Phe 


Ser 


Arg 


Phe 


Asn 


Ser 


Gin 


Gly 


Asp 


He 


65 










70 










75 










00 


Ala 


Thr 


Thr 


Tyr 


Tyr 


He 


Ala 


Asn 


Ser 


Glu 


Ala 


Thr 


Leu 


Asn 


Glu 


Trp 








85 










90 










95 




Cys 


A^p 


Tyr 


Ala 


His 


Pro 


Gly 


Gly 


He 


Val 


Arg 


Val 


Glu 


Gly 


Arg 


Phe 








1 00 










105 










110 






Trp 


Lys 


Ilet 


Tlir 


Tyr 


Asn 


He 


Pro 


Thr 


Tyr 


Asn 


Ala 


Val 


cys 


Thr 


Arg 






115 










120 










125 








He 


Thr 


Phe 


Glu 


Asn 


Gin 


Glu 


He 


Glu 


Gly 


Thr 


He 


Val 


Leu 


He 


Pro 




130 










135 










1 4 0 










Lys 


Pro 


Lys 


Val 


Ser 


Leu 


Pro 


His 


Val 


Ser 


Glu 


Ser 


Val 


Pro 


C''S 


He 


145 










150 










155 










160 


Arg 


Thr 


Glu 


Ala 


Gly 


Arg 


Glu 


Phe 


He 


Leu 


Cys 


Glu 


Glu 


Asp 


Asp 


Thr 










165 










170 










175 




Phe 


Va2 


Ser 


His 


Asp 


Gly 


Asn 


Glu 


Val 


Thr 


He 


Gly 


Gly 


Lys 


Pro 


Phe 








180 










185 










190 






Leu 


Leu 


Asn 


Thr 


Asn 


Val 


Lys 


He 


Val 


Gly 


Asp 


Val 


Ser 


Gin 


Lys 


Tyr 






195 










2 00 










205 








Alu 


Val 


Gly 


Val 


Gly 


Glu 


He 


Arg 


Phe 


Leu 


Gin 


He 


cys 


Ala 


Gin 


Thr 




210 










215 










220 










Val 


Ser 


Gin 


G ], 1 1 


Lys 

























(2) IMFORMATIOM FOR SEQ ID 110:519 



(i) SEQUEIICE CHARACTERISTICS: 
30 (A) LEDGTH: 228 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



Hi) IIOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAMISM: Pcrphyromonas gingivalis 



(ix) FEATURE: 

(A) MAI-IE/ KEY: mi sc_f eature 

(B) LOCATION 1 . . .22 8 

45 (;;i) SEQUEIICE DESCRI PTIOl i : SEQ ID )IO:519 



50 



net 


Lys 


Lys 


Ala 


Phe 


Val 


Phe 


Val 


Leu 


Leu 


Val 


—J *- 


Leu 


Phe 


Ser 


Ser 


1 








5 










10 










15 




Phe 


Ssr 


Ser 


Ser 


Ala 


Gin 


Thr 


Thr 


Thr 


Asn 


Ser 


Ser 


Arg 


Ser 


Tyr 


Phe 








2 0 










25 










30 






Thr 


Gly 


Arg 


He 


Glu 


Lys 


Val 


Ser 


Leu 


Asn 


Leu 


Gly 


Val 


Pro 


Pro 


Val 






35 










40 










45 








Ser 


Thr 


Glu 


Val 


Trp 


Gly 


net 


Thr 


His 


Asp 


Ala 


Asn 


Gly 


Leu 


Pro 


Phe 




50 










55 










60 










Glu 


He 


Pro 


He 


Ser 


Phe 


Ser 


Arg 


Phe 


Asn 


Ser 


Gin 


Gly 


Asp 


He 


Ala 


65 










7 0 










75 










80 


Thr 


Thr 


Tyr 


Tyr 


He 


Ala 


Asn 


Ser 


Glu 


Ala 


Thr 


Leu 


Asn 


Glu 


Trp 


Cys 










85 










9 0 










95 




A^p 


Tyr 


Ala 


His 


Pro 


Gly 


Gly 


He 


Val 


Arg 


Val 


Glu 


Gly 


Arg 


Phe 


Trp 








i 00 










105 










110 






Lys 


llet 


Thr 


Tyr 


Asn 


He 


Pro 


Thr 


Tyr 


Asn 


Ala 


Val 


Cys 


Thr 


Arg 


He 






115 










120 










125 








Thr 


Phe 


Glu 


Asn 


Gin 


Glu 


He 


Glu 


Gly 


Thr 


He 


Val 


Leu 


He 


Pro 


Lys 




130 










135 










14 0 










Pro 


Lys 


Val 


Ser 


Leu 


Pro 


His 


Val 


Ser 


Glu 


Ser 


Val 


Pro 


Cys 


He 


Arg 


14 5 










150 










155 










160 


Thr 


Glu 


Ala 


Gly 


Arq 


Glu 


Phe 


He 


Leu 


C* 's 


Glu 


Glu 


Asp 


Asp 


Thr 


Phe 










165 










170 










17 5 




Val 


Ser 


His 


Asp 


Gly 


Asn 


Glu 


Val 


Thr 


He 


Gly 


Gly 


Lys 


Pro 


Phe 


Leu 








180 










185 










1 90 






Leu 


Asn 


Thr 


As II 


Val 


Lys 


He 


Val 


Gly 


Asp 


Val 


Ser 


Gin 


Lys 


Tyr 


Ala 






195 










200 










205 








Val 


Gly 


Val 


Gly 


Glu 


He 


Arg 


Phe 


Leu 


Gin 


He 


Cys 


Ala 


Gin 


Thr 


Val 




210 










215 










220 










Ser 


Gin 


Gin 


Lys 



























BNSDOCID: <WO 9929870A1J_> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



474/ 490 



225 

(2) IMFORMATIOM FOR SEQ ID MO: 520 

5 (1) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH; 540 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 
15 (A) ORGAN I SH: Porphyromonas gingiva li; 

(i>0 FEATURE: 

(A) MAI IE/KEY: misc_feature 

( B) LOCATIOM 1 ... 540 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(::i) SEQUEMCE DESCRIPTTOM: SEQ ID 110:520 



[let 


Lys 


Thr 


Lys 


Val 


Leu 


Arg 


Lys 


Phe 


Val 


Val 


Ala 


Ala 


Phe 


Ala 


Val 


1 


















10 










15 




Ala 


Thr 


Leu 


Cys 


Pro 


Leu 


Ala 


Gin 


Ala 


Gin 


Thr 


Met 


Gly 


Gly 


Asp 


Asp 








2 0 










"^5 










30 




Val 


Lys 


Va 1 


Val 


Gin 


Tyr 


Asn 


Gin 


Glu 


Lys 


Leu 


Val 


Gin 


Thr 


Arg 


Met 






3 5 










40 










45 






Ser 


Val 


Ala 


Asp 


As n 


Gly 


Trp 


He 


Tyr 


Val 


Met 


Thr 


His 


Ser 


Gly 


Tyr 




50 










55 










60 








Ar^p 


Thr 


Gly 


Asn 


Ser 


Asn 


Val 


Lys 


He 


Phe 


Arg 


Ser 


Lys 


Asp 


Gin 


Gly 


65 










70 










75 








80 


Ala 


Thr 


Tyr 


Gin 


Lys 


Leu 


Arg 


Asp 


Trp 


Asp 


Pro 


Ser 


Asp 


Asp 


Tyr 


Gin 










85 










90 










95 




Phe 


Gin 


Asp 


Phe 


Asp 


He 


Val 


Val 


Thr 


Gly 


Lys 


Asn 


Glu 


Ser 


Asp 


He 








100 










105 










110 




Lys 


lie 


Trp 


Ser 


Val 


Glu 


Leu 


I let 


Asn 


Lys 


Pro 


Gly 


Gly 


Tyr 


Lys 


Ser 






115 










120 










125 








Arg 


Val 


Ala 


Val 


Phe 


Ser 


Arg 


Asp 


Ala 


Asn 


Ala 


Gin 


Asn 


Ala 


Lys 


Leu 




130 










135 










140 








Val 


Tyr 


Lys 


Glu 


Asp 


Phe 


Ser 


Asn 


Val 


Gin 


Leu 


Tyr 


Asp 


Val 


Asp 


He 


14 5 










150 










155 










160 


Ala 


Ser 


Asn 


Tyr 


Arg 


Ser 


Pro 


Ser 


Ser 


Leu 


Asn 


Asn 


Gly 


Gly 


Asn 


Pro 










165 










170 










175 




Phe 


Ala 


Leu 


Ala 


Phe 


Ala 


Tyr 


Thr 


Gly 


Phe 


Asn 


Asn 


Thr 


His 


Lys 


He 








180 










185 










190 




Ser 


Phe 


Val 


Asp 


Tyr 


Val 


Phe 


Ser 


Leu 


Asn 


Gly 


Gly 


Gin 


Asn 


Phe 


Asn 






195 










200 










205 








Lys 


Asn 


Leu 


Leu 


Phe 


Ser 


Gin 


Asp 


Gly 


Glu 


L:-s 


Lys 


He 


Asp 


Lys 


Val 




210 










215 










22 0 










Asp 


Leu 


Ser 


Leu 


Gly 


Ser 


Thr 


Ser 


Glu 


Ser 


Met 


Gly 


His 


Asn 


Ala 


Trp 












230 










2 35 










240 


Pro 


Leu 


Met 


Gly 


Val 


Val 


Phe 


Glu 


Met 


Asn 


Lys 


Gin 


Gly 


Gly 


Lys 


Ser 










245 










250 










255 




Asp 


lie 


Gly 


Phe 


Leu 


Ser 


Asn 


Phe 


Val 


Asp 


Asn 


Asp 


Pro 


Glu 


Phe 


Gin 








260 










265 










270 






Trp 


Ser 


Gly 


Pro 


He 


Lys 


Val 


Ser 


Glu 


Ser 


Asp 


Met 


Ser 


Phe 


Ser 


Pro 






275 










280 










285 








Lys 


He 


Gin 


Met 


Leu 


Leu 


Asp 


Glu 


Asp 


Asn 


Asn 


Thr 


He 


Asn 


Gly 


Glu 




2 90 










295 










300 










Ser 


Cys 


His 


Asn 


Phe 


net 


He 


Thr 


Tyr 


Ser 


Asp 


Tyr 


Asp 


Ser 


Glu 


Tyr 


305 










310 










315 










320 


Ser 


Asp 


Trp 


Asp 


He 


Arg 


Tyr 


Val 


Tyr 


Pro 


Lys 


Lys 


Ser 


Phe 


Lys 


Tyr 










325 










330 










335 




Glu 


Lys 


Gly 


Lys 


Thr 


Pro 


Thr 


Met 


Asp 


Asp 


Leu 


Val 


Glu 


Ala 


Phe 


Leu 








340 










345 










350 






Thr 


Ala 


Ser 


Tyr 


Gin 


Ser 


Glu 


Thr 


Asn 


Ser 


Gly 


Leu 


Gly 


Tyr 


Asp 


Lys 






355 










360 










365 




Asn 


Ala 


Asn 


His 


Tyr 


Leu 


He 


Thr 


Tyr 


Ala 


Lys 


Lys 


Glu 


Glu 


Asn 


Gly 




370 










37 5 










380 










Thr 


Asn 


Thr 


Leu 


Lys 


Tyr 


Arg 


Trp 


Ala 


Asn 


Tyr 


Asp 


Lys 


He 


His 


Asn 


385 










390 










395 








400 


Lys 


Asp 


Leu 


Trp 


Ser 


Asp 


Thr 


Phe 


Thr 


Tyr 


Thr 


Ser 


Ser 


Ala 


Asn 


Ala 










405 










410 










415 




Leu 


Tyr 


Thr 


Pro 


Gin 


Val 


Asp 


He 


Asn 


Pro 


Thr 


Lys 


Gly 


Leu 


Val 





SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_L> 



25 



30 



WO 99/29870 PCT/AU98/01023 

475/490 



45 



50 



55 



60 



65 



70 



75 









420 










425 










430 






TL-p 


Ser 


Trp 


Val 


Glu 


Tyr 


Leu 


Pro 


Gly 


Lys 


Arg 


He 


Val 


Trp 


Ser 


Asp 




435 










440 










445 








Thr 


Gin 


Trp 


Thr 


His 


Ala 




Gl y 


Val 


Gl u 


Asp 


He 


Val 


Met 


Gin 


Glu 




4 50 










455 










4 60 










Gly 


Ser 


II- L 


Ly^ 


Leu 


Tyr 


Pro 


Asn 


Pro 


Ala 


Gin 


Glu 


Tyr 


Ala 


Val 


He. 












470 










475 










4G0 


Ser 


Leu 


Pro 


Thr 


Ala 


Ala 


A^n 


Cys 


Lys 


Ala 


Val 


Val 


Tyr 


Asp 


Met 


Gin 










485 










4 90 










495 




G]y 


Arq 


Val 


Val 


Ala 


Glu 


Ala 


Ser 


Phe 


Ser 


Gl- 


As n 


Glu 


Tyr 


Arg 


Leu 




500 










505 










510 






Asn 


Val 


Gl n 


His 


Leu 


Ala 


Lys 


Gly 


Thr 


Tyr 


ue 


Leu 


Lys 


Val 


Val 


Ser 






515 










520 










525 








Asp 


Thr 


Giu 


Arg 


Phe 


Val 


Glu 


Lys 


Leu 


He 


Val 


Glu 










530 










535 










540 











10 



15 

(2) IN FORMATION FOR SEQ ID NO: 521 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 771 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 

( vi ) OKIGI NAL SOURCE : 

(A) ORGAItlSIl: Porphyromonas gingiva lis 

(ix) FEATURE: 

{A) NAME /KEY: misc_feature 

(B) LOCATION 1 ... 771 



35 SEQUENCE DESCRIPTION: SEQ ID NO: 521 



40 



Met 


Leu 


Thr 


He 


Arg 


Asn 


Phe 


Leu 


Leu 


Phe 


Cys 


Cys 


Leu 


Ser 


Leu 


He 


1 


















10 










15 




Ala 


Phe 


Ala 


Al a 


Asp 


Ala 


Gin 


Ser 


Ser 


Val 


Ser 


Ser 


Gly 


Arg 


Arg 


Leu 








2 0 








25 










30 






Thr 


Glu 


Tyr 


Val 


Asn 


Pro 


Phe 


He 


Gly 


Thr 


Ala 


Asn 


Tyr 


Gly 


Thr 


Thr 






35 










40 










45 








Asn 


Pro 


Gly 


Ala 


Val 


Leu 


Pro 


Asn 


Gly 


Leu 


I let 


Ser 


Val 


Thr 


Pro 


Phe 




50 








55 










60 










Asn 


Val 


Ser 


Gly 


Ser 


Thr 


Glu 


Asn 


Arg 


Phe 


Asp 


Lys 


Asp 


Ser 


Arg 


Trp 


65 










70 










75 










80 


Trp 


Ser 


Ala 


Pro 


Tyr 


Ser 


Ala 


Asp 


Asn 


Ser 


Tyr 


— J ^ 


He 


Gly 


Phe 


Ser 








85 










9 0 










95 




His 


Val 


Asn 


Leu 


Ser 


Gly 


Val 


Gly 


Cys 


Pro 


Giu 


Leu 


Ser 


Gly 


He 


Leu 








100 










105 










110 






Leu 


Met 


Ala 


Thr 


Ser 


Gly 


Thr 


Phe 


Asp 


Pro 


Asp 


T-yr 


C'-'s 


Cys 


Tyr 


Gly 






115 










120 










125 








Ser 


Ser 


Leu 


Ser 


Arg 


Glu 


Tyr 


Ala 


Arg 


Pro 


Gly 


Glu 


Tyr 


Lys 


Ala 


Val 




130 










135 










14 0 










Leu 


Asp 


Ly^- 


Tyr 


Gly 


He 


Asp 


Ala 


Ala 


Val 


Thr 


Val 


Thr 


Glu 


Arg 


Thr 


145 










150 










155 










160 


Ala 


Leu 


Thr 


Glu 


Phe 


Ala 


Phe 


Pro 


Glu 


Gly 


Glu 


Gly 


His 


He 


Leu 


Leu 










165 










17 0 










175 




Asn 


Leu 


G] y 


Gin 


Ala 


Leu 


Ser 


Asn 


Glu 


Ser 


Gly 


Ala 


Ser 


Val 


Arg 


Phe 








100 










185 










190 






Leu 


Asn 


Asp 


Ser 


Thr 


Val 


Val 


Gly 


Ser 


Arg 


Leu 


Met 


Gly 


Thr 


Phe 


Cys 






195 










200 










205 








Tyr 


Asn 


Fro 


Gin 


Ala 


Val 


Phe 


Arg 


Gin 


Tyr 


Pne 


Val 


Leu 


Gin 


Val 


Ser 


210 










215 










220 










Arq 


Arg 


Prv 


He 


Ser 


Ala 


Gly 


Tyr 


Trp 


Lys 


Lys 


Gin 


Pro 


Pro 


Met 


Thr 


22 5 










230 










235 










24 0 


Val 


Glu 


Ala 


Gin 


Trp 


Asp 


Sei: 


Thr 


Ala 


Gly 




Tyr 


Lys 


Gin 


Tyr 


Asp 










245 










2 50 










255 




Gly 


Tyr 


Lys 


Arg 


Glu 


Met 


Ser 


Gly 


Asp 


Asp 


He 


Gly 


Val 


Arg 


Phe 


Ser 








260 










265 










270 






Phe 


Asn 




Asp 


Gin 


Gly 


Glu 


Lys 


He 


Tyr 


Val 


Arg 


Ser 


Ala 


Val 


Ser 






275 










280 










285 








Phe 


Val 


Ser 


Glu 


Ala 


Asn 


Ala 


Leu 


Tyr 


Asn 


Leu 


Glu 


Ala 


Glu 


Gin 


Gl u 




290 










295 










300 










Glu 


Val 


Phe 


Lys 


Ser 


Val 


Gly 


Gly 


Asn 


Pre 


Ala 




Ala 


Phe 


Ser 


Ala 
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305 310 315 320 

He Arg Ser Arg Ala He Glu Arg Trp Glu Glu Ala Leu Gly Thr Val 

325 330 335 

Glu Veil Glu Gly Gly Thr Pro A^p Glu L^'^ Thr He Phe Tyr Thr Ala 

5 3-10 3^5 350 

Leu Tyr Hi^ Leu Leu He His Pro Asn He Leu Gin Asp Ala Asn Gly 

355 3G0 365 

Glu Tyr Fro Met I let Gly Ser Gly Ly^: Thr Gly Asn Thr Ala His Asp 

370 375 380 

10 Arg Tyr Thr Val Phe Ser Leu Trp Asp Thr Tyr Arg Asn Val His Pro 

385 390 395 400 

Leu Leu Cys Leu Leu Tyr Pro Glu Lys Gin Leu Asp Met Val Arg Thr 

405 410 415 

Leu He Asp I let Tyr Arg Glu Ser Gly Trp Leu Pro Arg Trp Glu Leu 

15 420 425 430 

Tyr Gly Gin Glu Thr Leu Thr Met Glu Gly Asp Pro Ser Leu He Val 

4 35 44 0 4 45 

He Asn Asp Thr Trp Gin Arg Gly Leu Arg Ala Phe Asp Tr^r Ala Thr 

4 50 4 55 4 60 

20 Ala Tyr Glu Ala Met Lys Lys Asn Ala Ser Ser Ala Gly Ala Thr His 

465 47 0 47 5 48 0 

Pro He Arg Pro Asp Asn Asp Asp T'yr Leu Thr Leu Gly Phe Val Pro 

4 85 4 90 4 95 

Leu Arg Glu Gin Tyr Asp Asn Ser Val Ser His Ala Leu Glu Tyr Tyr 

25 500 505 510 

Leu Ala Asp Trp Asn Leu Ser Arg Phe Ala His Ala Leu Gly His Lys 

515 52 0 52 5 

Glu Asp Ala Ala Leu Phe Gly Lys Arg Ser Leu Gly Tyr Arg His Tyr 

530 535 54 0 

30 Tyr Asn Lys Glu Tyr Gly Met Leu Cys Fro Leu Leu Pro Asp Gly Ser 

54 5 550 555 560 

Phe Leu Thr Pro Phe Asp Pro Lys Gin Gly Glu Asn Phe Glu Pro Asn 

565 570 575 

Pro Gly Phe His Glu Gly Ser Ala Tyr Asn Tyr Ala Phe Phe Val Pro 

35 580 585 590 

His Asp He Gin Gly Leu Ala Arg Leu i-let Gly Gly Ala Lys Val Phe 

555 600 605 

Ser Glu Arg Leu Gin Lys Val Phe Asp Glu Gly Tyr Tyr Asp Pro Thr 

610 615 620 

40 Asn Glu Pro Asp He Ala Tyr Pro Tyr Leu Phe Ser Tyr Phe Pro Lys 

625 630 635 640 

Glu Ala Trp Arg Thr Gin Lys Leu Thr Arg Glu Leu He Asp Lys His 

645 650 655 

Phe Cys Asn Ala Pro Asn Gly Leu Pro Gly Asn Asp Asp Ala Gly Thr 

45 660 665 670 

Met Ser A.la Trp Leu Val Tyr Ser Met Leu Gly Phe Tyr Pro Asp Cys 

675 680 685 

Pro Gly Ser Pro Thr Tyr Thr Leu Thr Ser Pro Val Phe Pro Arg Val 

690 695 700 

50 Arg He Arg Leu Asn Pro Gin Tyr Tyr Pro Gin Gly Glu Leu He He 

7 05 710 715 72 0 

Thr Thr Asn Thr Glu Asn Gin Pro Thr Asp Ser He Tyr He His Thr 

725 730 735 

Val Ser Leu Gly Asn Lys Thr Leu Pro His Gly Thr Arg His He Ser 

55 740 745 750 

His Ala Asp Leu Val Arg Cys Gly His Leu Arg Tyr Glu Leu Ser Asn 

755 760 765 
Arg Pro Arg 
770 



60 



(2) IIIFORi lATlOIl FOR SEQ ID 110:52 2 



ii) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 776 amino acids 
65 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii} flOLECULE TYPE: protein 

70 (iii) HYPOTHETICAL: YES 

(vi) ORIGItlAL SOURCE: 

(A) ORGAtiISM: Po rphyromonas gingival is 

75 (i::) FEATURE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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30 (2) IME'ORMATIOIJ FOR SEQ ID MO: 523 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LEtfGTH: 1158 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGV: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGIMAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingiva lis 



( ix ) FEATURE : 
45 {A) MAME/KEY: misc_f eatur- 

(B) LOCATIOH 1...1158 



(xi) SEQUEMCE DESCRIPTION: SEQ ID HO: 523 
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Lys 
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875 
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Leu 


Phe 


Glu 
885 


Thr 
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Phe 


He 


Val 
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Lys 


Asp 


Val 


Asn 
895 


Tyr 


Ser 


Glu 
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Glu 
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Gly 
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He 


Asn 
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Glu 
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He 
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Val 


Gin 
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Val 
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Glu 
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Thr 
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Pro 
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He 


Ser 
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Phe 
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Asp 
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Val 
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Pro 


Leu 
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Gly 
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Gly 
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Cys 


He 
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Asn 
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Tyr 


Phe 


Thr 
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Ser 


Ala 


Thr 
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Val 
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Leu 


Phe 
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He 
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Ala 


Leu 


Ala 


Glu 
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1020 








Gly 
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Ala 


Arg 
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Val 


Trp 


Asp 


He 


Phe 


Asn 
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Ala 


1025 








1030 








1035 
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Val 
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Asp 


Phe 
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Phe 


Arg 


Val 
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Gly 


He 


Ala 


Pro 


Asp 










1045 
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Val 
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Val 
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Val 


Arg 
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Ser 
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Val 
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Val 
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1085 






Glu 


He 


Tyr 
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Val 
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Pro 


Val 
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Thr 
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Glu 
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He 
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He 


Lys 


Trp 


Asp 
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Lys 
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(2) IlfFORIlATIOM FOR SEQ ID 110:52 4 

(i) SEQUEtlCE CHARACTERISTICS: 
50 {A} LENGTH: 961 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAMISM: Pc rphy romonas gingival is 

(ix} FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATIOM 1 , , . 961 

(xi) SEQUEtlCE DESCRIPTIOM: SEQ ID IJO:524 



Met Lys Lys Leu Phe Pro Leu Leu Leu Leu He Leu Ser He Leu Val 

15 10 15 

'^Y^ *31y Lys Lys Glu Lys His Ser Val Thr Glu He Ala Arg Glu 
/O 20 25 30 

Lys Lys Arg He Thr Ala Leu Leu Tyr Glu Lys Glu Leu Pro Thr Asp 

35 40 45 

Ser Val Lys Gin Leu Tyr Glu Asn Ser Val Gin Asn Lys Asn Leu Val 

50 55 60 

/O Gly Gin Met Leu Phe Ala He Glu Val Gly Lys Arg Met Arg Asn Met 



BNSDOCID: <WO 9929e70A1_L> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



481/490 
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Arg 


Arg 


He 
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Al a 
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Al a 


Ser 


Asp 
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175 




Leu 
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Arg 
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Gly 
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Glu 
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255 




Ala 


Thr 


Ala 


Tyr 


Lys 


Leu 


Met 
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Arg 
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Trp 
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Leu 


Asn 


Ser 


Cys 


He 


Pro 


I let 


Ala 


Arg 


He 


Asn 


Leu 


L^-s 


Gin 


Gly 


Asn 






275 










280 
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Glu 


Arg 
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Tyr 
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Phe 


He 
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Ala 


Glu 


Gly 


Thr 


Ala 


L:/s 




2 90 










295 










300 










Glu 


He 


Asn 


Ser 


Thr 


Ser 
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Leu 


He 


Glu 


He 


Tyr 


Asn 


Leu 


Gin 


Tyr 


305 










310 
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Glu 


Asn 


Leu 


Glu 


Arg 


Lys 


Lys 
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Ala 
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Ala 


Phe 
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Ser 
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Lys 


Val 
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Ser 
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Glu 


Arg 
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Ser 


Gin 
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360 
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Glu 
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Lys 
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He 
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Arg 
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Gly 


Lys 


Lys 


His 










4 05 










4 10 
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Asn 


Lys 
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He 


Lys 
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Thr 


Asp 


Lys 


Leu 


Arg 


Ser 


Gly 


Phe 


Phe 


Thr 








A20 










425 
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Gly 


He 


Thr 


His 


Glu 


Phe 


Arg 


Thr 
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He 


Thr 


Val 


He 


Gin 
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Leu 
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440 










445 








Acn 


Glu 
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Asp 
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Ser 


Asp 


Arg 
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4 55 
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Glu 


Leu 


His 
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He 


He 


Asp 


Arg 


Gin 


Ser 


Ser 


His 


Met 


Leu 


Asn 


Leu 


465 










470 










475 
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Val 
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He 
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He 


Arg 
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Gly 


Val 


Ser 


Thr 
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Pro 


Glu 


Trp 


Arg 


Asn 


Gly 


Asp 


He 


Val 


Ser 


Phe 


Val 


Gin 


He 


Leu 


He 






500 
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510 






Asp 


Ser 


Phe 


Ala 
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Tyr 


Ala 


Gin 


Ala 


Gin 


Asp 


He 


Thr 


Leu 


Glu 


Leu 






515 










520 










525 








Gin 


Fro 


Glu 


Ser 


Lys 


Pro 


He 


Val 


Val 


Asp 


Phe 


Val 


Pro 


Ser 


Tyr 


Leu 




530 










535 
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Gin 


Lys 


He 


He 


Ser 


Asn 


Leu 


Leu 


Ser 


Asn 


Ala 


He 


Lys 


Tyr 


Ser 


Leu 


545 










550 
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Ala 


Gly 


Gi- 


Arg 


Val 


Val 


He 


Ser 


Leu 


Ala 


Lys 


Thr 


Lys 


Asn 


Glu 


Lys 










565 










57 0 










57 5 




hs n 


Leu 


ll e 


He 


Arg 


Val 


Ala 


Asp 


Asn 


Gly 


He 


Gly 


He 


Asp 


Lys 


Thr 








580 










585 










590 






Asp 


Gin 


Ala 


His 


He 


Phe 


Asp 


He 


Phe 


Tyr 


Arg 


Gly 


Gin 


Ser 


Ala 


Thr 






595 










600 










605 








Glu 


Lys 


His 


Gly 


Ser 


Gly 


Val 


Gly 


Leu 


Ser 


Phe 


Thr 


Asn 


He 


Leu 


Val 




610 










615 










620 










Glu 


Asn 


Leu 


Arg 


Gly 


Thr 


He 


Lys 


Vol 


Glu 


Ser 


Gl n 


Pro 


Gly 


Lys 


Gly 


G25 










630 










635 










64 0 


Ser 


Ala 


Phe 


Thr 


He 


Ser 


He 


Pro 


Thr 


Gin 


Asn 


Gin 


Ser 


Ser 


Ser 


Ala 










645 










650 










655 




Glu 


He 


Leu 


Pro 


Trp 


Leu 


Fro 


Ser 


Ser 


Asp 


Asp 


He 


Val 


Met 


Pro 


Val 








660 










665 










67 0 







SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDCXIID: <WO 9929870A1_I_ 



wo 99/29870 PCT/AU98/01023 

482/490 



50 



55 



60 
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70 
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He 


Ala 

675 


Pro 


Asp 


Asp 


Ser 


Pro 
680 


Thr 


Ser 


Pro 


Met 


Val 
68 5 


Ala 


Ala 


Leu 


Asn 


His 
690 


Arg 


Phe 


Glu 


Asp 


Glu 
695 


Arg 


Pro 


Thr 


He 


Leu 
700 


Leu 


Val 


Glu 


Asp 


Asn 


Lys 


Asp 


He 


Asn 


Leu 


Leu 


Val 


L^ys 


Leu 


Leu 


Leu 


Cys 


Asp 


Arg 


Tyr 


705 










710 










715 






720 


Asn 


Val 


Leu 


Ser 


Ala 
725 


Ala 


Asn 


Gly 


Lys 


Glu 
7 30 


Gly 


He 


Ala 


Leu 


Ala 
735 


Thr 


Glu 


His 


He 


Pro 


Asp 


He 


He 


He 


Thr 


Asp 


He 


Met 


Met 


Pro 


He 


Met 








7^0 










745 








750 






Asp 


Gly 


He 

755 


Glu 


net 


Thr 


He 


Arg 
760 


Met 


Lys 


Gin 


Ser 


Pro 
765 


Leu 


Leu 


Cys 


His 


He 


Pro 


li- 


Val 


Ala 


Leu 


Thr 


Ala 


Lys 


Ser 


Thr 


Glu 


Gin 


Asp 


Arg 




770 










775 










780 








Leu 


Glu 


Gly 


ne 


Lys 


Ser 


Gly 


Val 


Val 


Ser 


Tyr 


Leu 


Cys 


Lys 


Pro 


Phe 


785 










790 










795 










800 


Ser 


Pro 


Glu 


Glu 


Leu 


Leu 


Met 


Arg 


He 


Glu 


Gin 


Leu 


Leu 


Lys 


Asp 


Arg 










805 










810 










015 


Glu 


Leu 


Leu 


Lys 


Lys 


Phe 


Tyr 


Met 


Gin 


Lys 


Leu 


Met 


Leu 


Asp 


Arg 


Lys 








820 










825 










8 30 




Pro 


Glu 


Gl u 


Glu 


Pro 


Gin 


Pro 


He 


Asp 


Asp 


Ser 


Ser 


Met 


Gin 


Phe 


Leu 






835 










840 








845 








Leu 


Ala 
850 


Ala 


Lys 


Asp 


Ala 


Val 
855 


Ser 


Gly 


Gly 


He 


Lys 
860 


Gin 


Asn 


Pro 


Asp 


Phe 


Ser 


Ala 


Gin 


Asp 


Leu 


Ala 


Glu 


Lys 


Met 


C'-s 


Met 


Ser 


Pro 


Ser 


Gl n 


865 










870 










875 










880 


Leu 


Asn 


Arg 


Lys 


Leu 
885 


Thr 


Ser 


Val 


Val 


Gly 
890 


C" "S 


Ser 


Thr 


He 


Gly 
895 


Tyr 


He 


Gin 


Gin 


He 
900 


Lys 


He 


Lys 


Leu 


Ala 
905 


Cys 


Lys 


Leu 


Leu 


Ala 
910 


Asp 


Glu 


Ser 


Lys 


Asn 


He 


Ser 


Asp 


He 


Ser 


He 


Glu 


Ala 
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Phe 


Ser 


Asp 


Pro 
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920 
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Tyr 


Phe 
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Arg 


Thr 


Phe 


Lys 


Arg 


Tyr 
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Cys 


Ser 
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Ser 




930 










935 
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40 (2) III FORMAT I ON FOR SEQ ID 110:525 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 556 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) MA]'1E/P:EY: misc_feature 

(B) LOCATION 1 ... 556 

(xi) SEQUENCE DESCRIPTION: SEQ ID J]0: 525 
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Ala 
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He 


Leu 


Ala 
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Gly 


Leu 
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Val 
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Val 
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Pro 


Phe 
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He 


Pro 


He t 


Glu 


Leu 


He 
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7 5 
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Lys 


Glu 


Leu 


Ala 


Lys 
85 


Asn 


Asp 


Lys 


Val 


He 

90 


Thr 


He 


Val 


Ala 


Seu 
95 


Glu 
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Thr 


Val 


He 


Thr 
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Thr 
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Ser 


Gly 


Val 


Asn 
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SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_L> 



wo 99/29870 PCT/AU98/01 023 

483/ 490 

Arg Asp Tyr Ttir Gly Trp Phe Ala Met Tyr Asp Thr Asn Lys Val Gly 

130 135 140 

Leu VaJ Asp Phe He Tyr Asn Arg Pro Arg Pro Asn Asp Asp Glu Phe 
145 150 155 160 

5 Pro Lys Tyr Glu Ala Gin Tyr Leu Gly He Glu Met Phe Gly Met Lys 
165 170 175 

Leu Lys Gin Thr Gly Gly Asn Tyr Met Thr Asp Gly Tyr Gly Ser Ala 

180 185 190 

Val Gin Ser His He Ala Tyr Thr Glu A^n Ser Ser Leu Ser Gin Ala 
10 19b 200 205 

Gin Val Asn Gin Lys Met Lys Asp Tyr Leu Gly He Thr His His Asp 

210 215 2 20 

Val Val Gin Asp Pro Asn Gly Glu Tyr He Asn His Val Asp Cys Trp 
225 230 235 240 

15 Gly Lys Tyr Leu Ala Pro Asn Lys He Leu He Arg Lys Val Pro Asp 

245 250 255 

Asn His Pro Gin His Gin Ala Leu Glu Asp Met Ala Ala Tyr Phe Ala 

260 265 270 

Ala Gin Thr Cys Ala Trp Gly Thr Lys Tyr Glu Val Tyr Arg Ala Leu 
20 275 280 285 

Ala Thr Asn Glu Gin Pro Tyir Thr Asn Ser Leu He Leu Asn Asn Arg 

2 90 295 3 00 

Val Phe Val Pro Val Asn Gly Pro Ala Ser Val Asp Asn Asp Ala Leu 
305 310 315 320 

25 Asn Val Tyr Lys Thr Ala Met Pro Gly Tyr Glu He He Gly Val Lys 

325 330 335 

Gly Ala Ser Gly Thr Pro Trp Leu Gly Thr Asp Ala Leu His Cys Arg 

340 345 350 

Thr His Glu Val Ala Asp Lys Gly Tyr Leu Tyr He Lys His Tyr Pro 
30 355 360 365 

He Leu Gly Glu Gin Ala Gly Pro Asp Tyr Lys He Glu Ala Asp Val 

370 375 380 

Val Sez Cys Ala Asn Ala Thr He Ser Pro Val Gin Cys Tyr Tyr Arg 
385 390 395 400 

35 He Asn Gly Ser Gly Ser Phe Lys Ala Ala Asp Met Thr Met Glu Set 

405 410 415 

Thr Gly His Tyr Thr Tyr Ser Phe Thr Gly Leu Asn Lys Asn Asp Lys 

420 425 *^30 

Val Glu Tyr Tyr He Ser Ala Ala Asp Asn Ser Gly Arg Lys Glu Thr 
40 435 440 445 

Tyr Pro Phe He Gly Glu Pro Asp Pro Phe Lys Phe Thr Cys Met Asn 

450 455 460 

Glu Thr Asn Thr Cys Thr Val Thr Gly Ala Ala Lys Ala Leu Arg Ala 
465 470 475 480 

45 Trp Phe Asn Ala Gly Arg Ser Glu Leu Ala Val Ser Val Ser Leu Asn 

4 85 4 90 4 95 

He Ala Gly Thr Tyr Arg He Lys Leu Tyr Asn Thr Ala Giy Glu Glu 

500 505 510 

Val Ala Ala Met Thr Lys Glu Leu Val Ala Gly Thr Ser Val Phe Ser 
50 515 520 525 

Met Asp Val Tyr Ser Gin Ala Pro Gly Thr Tyr Val Leu Val Val Glu 

530 535 540 

Gly Asn Gly He Arg Glu Thr Met Lys He Leu Lys 
545 550 555 



55 



(2) IIIFORHATIOH FOR SEQ ID MO: 52 6 



(i) 3EQUEIICE CHARACTERISTICS: 

(Al LENGTH: 4 28 amino acids 
60 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

65 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

70 ( ix ) FEATURE : 

{A) UAME/KEY: misc_f eat u re 

(B) LOCATION 1. . .428 



75 



(::i) SEQUEiJCE DESCRIPTION: SEQ ID NO: 526 



BNSDOCID: <WO 9929e70A1_L> 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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WO 99/29870 PCT/AU98/01023 

484/ 490 
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95 


A^n 


Thr 


Thr 


Arg 


Leu 


Gly 


Cys 


Arg 


Ser 


Thr 


Gly 


Ala 


Thr 


Ala 


Val 


Asp 








100 










105 










110 






Val 


ThL- 


Lys 


Asn 


Pro 


Asn 


Leu 


Thr 


Tyr 


Leu 


Al a 


Cys 


Pro 


Lys 


Asn 


Asn 






115 










120 










125 








Leu 


Lys 


S el- 


Leu 


Asp 


Leu 


Thr 


Gin 


Asn 


Pro 


Lys 


Leu 


Leu 


Arg 


Val 


Trp 




130 










135 










140 








Cys 


Asp 


se r 


Asn 


Glu 


He 


Glu 


Ser 


Leu 


Asp 


Leu 


Ser 


Gly 


Asn 


Pro 


Al a 


145 










150 










155 










160 



10 



15 



20 

Leu He He Leu Gly Cys Asp Arg Asn Lys Leu Thr Glu Leu Lys Thr 

165 170 175 

Asp Asn Asn Pro Lys Leu Ala Ser Leu Trp Cys Ser Asp Asn Asn Leu 
leo 185 190 

25 Thr Glu Leu Glu Leu Ser Ala Asn Pro Arg Leu Asn Asp Leu Trp Cys 
195 200 " 2 05 

Phe Gly Asn Arg He Thr Lys Leu Asp Leu Ser Ala Asn Pro Leu Leu 

210 215 220 

Val Thr Leu Trp Cys Ser Asp Asn Glu Leu Ser Thr Leu Asp Leu Ser 
30 225 230 235 240 

Lys Asn Ser Asp Val Ala Tyr Leu Trp Cys Ser Ser Asn Lys Leu Thr 

245 250 255 

Ser Leu Asn Leu Ser Gly Val Lys Gly Leu Ser Val Leu Val Cys His 
260 265 270 

35 Ser Asn Gin He Ala Gly Glu Glu Met Thr Lys Val Val Asn Ala Leu 
275 280 285 

Pro Thr Leu Ser Pro Gly Ala Gly Ala Gin Ser Lys Phe Val Val Val 

290 295 300 

Asp Leu Lys Asp Thr Asp Glu Lys Asn He Cys Thr Val Lys Asp Val 
40 305 310 315 320 

Glu Lys Ala Lys Ser Lys Asn Trp Arg Val Phe Asp Phe Asn Gly Asp 

325 330 335 

Ser Asp Asn Met Leu Pro Tyr Glu Gly Ser Pro Thr Ser Asn Leu Ala 
3^0 345 350 

45 Val Asp Ala Pro Thr Val Arg He Tyr Pro Asn Pro Val Gly Arg Tyr 
355 360 365 

Ala Leu Val Glu He Pro Glu Ser Leu Leu Gly Gin Glu Ala Ala Leu 

370 375 380 

Tyr Asp Met Asn Gly Val Lys Val Tyr Ser Phe Ala Val Glu Ser Leu 
50 385 390 395 400 

Arg Gin Asn He Asp Leu Thr His Leu Pro Asp Gly Thr Tyr Phe Phe 

405 ^10 415 

Arg Leu Asp Asn Tyr Ttir Thr Lys Leu He Lys Gin 
4 2 0 4 2 5 



(2) II I FORMATION FOR SEQ ID MO: 527 

(i) SEQUEtlCE CHARACTERISTICS: 

(A) LENGTH: 310 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

65 (iii) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingiva lis 

70 (ix) FEATURE: 

(A) MAME/KEY: misc_feature 
{ B) LOCATION 1 . . , 310 



75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <W0 9929870A1_L> 



wo 99/29870 



PCT/AU98/01023 



485/490 



[•let Arg Lys Thr He He Phe Cys Leu Leu Leu Ala Leu Phe Gly Cys 
15 30 15 

Ser Trp Ala Gin Glu Arg Val Asp Glu Lys Val Phe Ser Ala Gly Thr 
20 25 30 

5 Ser He Plie Arg Gly He Leu Glu Lys Val Lys Ala Pro Leu Met Tyr 
35 40 45 

Gly Asp Arg Glu Val Trp Gly Met Ala Arg Ala Ser Glu Asp Phe Phe 

50 ' 55 60 

Phe He Leu Pro Val Thr Asp Asp Leu Thr Pro Val Leu Phe Tyr Asn 
10 65 70 75 80 

Arg Leu Thr Asn Glu Pro Cys Phe Val Ser Asp Gin Gly He Thr Glu 

05 90 95 

Tyr Phe Lys Phe Ala Gin Glu Gly Asp Tyr He Glu Val Glu Gly Ser 
100 105 110 

15 Ser Val Phe Met Ala Asn Leu Leu Tyr Tyr Arg Phe Phe Pro Thr Arg 
115 120 " 125 

He Thr Ser Tyr Asn Ala Pro He Glu Gly Val Val Ser Lys Thr Gly 

130 135 1^0 

Asn Pro Ala Phe Thr He Pro Met Leu Pro Gly Val Ser Asp Cys He 
20 US 150 155 160 

Glu He Ser Asn Asn Arg Lys Val Phe Leu Thr Asn Gin Leu Gly Val 

165 170 175 

Val Asti He Thr Asp Gly tlet Glu Pro Pro He He Ala Gly Val Ser 
leo 105 190 

25 Ala Ser Tyr Gly Ser Ser Val Arg Val Tyr Gly His Val Ser Gin Arg 
195 2 00 2 05 

Trp Asp He He Gly His Cys Tyr Leu Asp He Tyr Pre Thr Asn Cys 

210 215 220 

Tyr Pro Leu Ser Thr Lys Pro Val Ala Gly Asp Asp Glu Val Phe Val 
30 225 230 235 240 

Lys Gin Gin Gly Arg Gin He Glu He Asp Ser Asn Ser Fro He Val 

245 250 255 

Gin Val Val Val Tyr Asp Leu Glu Gly Lys Ser Val Phe Arg Lys Ara 
260 265 270 

35 Met Thr Glu Asn Ala Tyr Thr Leu Ser Phe Arg Ala Pro Met Leu Gly 
275 280 285 

Phe Met Thr He Met He Glu Thr Gin Asn Ser He He Asn Lys Lys 

290 295 300 

Leu Asn Val Thr Gin Leu 
40 305 310 

(2) IIIFORtlATIOll FOR SEQ ID l!O:520 

(i) SEQUEtJCE CHARACTERISTICS: 
45 (A) LENGTH: 4 05 amino acids 

(B) TVPE: aiuino acid 
( D ) TO POLOG Y : linear 



50 



55 



(ii) MOLECULE TYPE: protein 

Hii) HYPOTHETICAL: YES 

(vi) ORIGItfAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
( D) LOCATION 1 ... 405 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID JIO:528 





Met 


Lys 


Lys 


Thr 


Thr 


He 


He 


Ser 


Leu 


He 


Val 


Phe 


Gly 


Ala 


Ph- 




1 


















10 










is 


65 


Ala 


Ala 


Val 


Gly 


Gin 


Thr 


Lys 


Asp 


Asn 


Ser 


Ser 


Tyr 


Lys 


Pro 


Phe 








20 










25 










30 






Lys 


Glu 


Asp 
35 


He 


Ala 


Gly 


Gly 


Val 
40 


Tyr 


Ser 


Leu 


Pro 


Thr 
45 


Gin 


Asn 




Ala 


Gin 


Lys 


Asp 


Asn 


Ala 


Glu 


Trp 


Leu 


Jjeu 


Thr 


Ala 


Thr 


Val 


Ser 


70 




50 










55 










60 








Asn 
65 


Gin 


Ser 


Ala 


Asp 


Thr 
70 


His 


Phe 


He 


Phe 


Asp 
75 


Glu 


Asn 


Asn 


Arg 




He 


Ala 


A eg 


Asp 


He 
85 


Lys 


Ala 


Asn 


Gly 


Val 
90 


Arg 


Lys 


Ser 


Thr 


Asp 

95 


75 


He 


T-^-r 


Tyr 


Asp 


Ala 


Asn 


Gly 


Arg 


He 


Ser 


His 


Val 


Asp 


Leu 


Tyr 








100 










105 










110 





30 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1_I_> 



wo 99/29870 



PCT/AU98/01023 



486/ 490 

Ser Pile Ser Gly Gly Glu Fro Ala Leu A^p Thr Arg Phe Lys Tyr Thr 

115 120 125 

Tyr Asp A^p Glu Gly Lys Met Thr Val Arg Glu Val Phe Met Leu Val 
130 135 140 

5 Met Asp Pro Asn Thr Pro lie Ser Arg Leu Glu Tyr His Tyr Asp Ala 
145 150 155 160 

Gin Gly Arg Leu Thr His Trp lie Ser Phe Ala Phe Gly Ala Glu Ser 

165 170 175 

Gin Lys Asn Thr Tyr His Tyr Asn Glu Lys Gly Leu Leu Val Ser Glu 
10 180 185 190 

Val Leu Ser Asn Ala Met Gly Thr Thr Tyr Ser Asp Thr Gly Lys Tlir 

195 200 205 

Glu Tyr Ser Tyr Asp Asp Ala Asp Asn Met Val Lys Ala Glu Tyr Phe 
210 215 22 0 

15 Val Val Gin Gin Gly Lys Ala Trp Gin Val Leu Lys Arg Glu Glu Tyr 
225 230 235 240 

Thr Tyr Glu Asp Asn He Cys He Gin Tyr Leu Ala He Asn Gly Thr 

245 250 255 

Asp Thr Lys Val Tyr Lys Arg Asp He Glu Ser Asp Lys Ser He Ser 
20 260 265 270 

Ala Asn Val He Asp He Pro Ser Met Pro Glu Gin Thr Trp Pro Asn 

27 5 2 80 2 85 

Met Tyr Gly Phe Asn Ala Lys Arg Leu Lys Glu Thr Tyr Ser Ser Tyr 
290 295 300 

25 Glu Gly Asp Val Ala Thr Pro He Phe Asp Tyr He Tyr Thr Tyr Lys 
305 310 315 320 

Ala Leu Thr Ser Met Ala Thr Pro Ser Thr Glu Ala Gin Val Ala Val 

325 330 335 

Tyr Leu Asn Pro Ser Thr Asp Arg Leu Val He Leu Ala Asn Gly He 
30 340 34b 350 

Thr His Leu Ser Met Tyr Asp Leu Gin Gly Lys Leu He Arg Asp Cys 

355 360 365 

Ala Leu Ser Gly Asp Lys Val Glu Met Gly Val Gly Ser Leu Thr Lys 

370 375 380 

Gly Thr Tyr Leu Leu Lys Val Asn Thr Asp Gin Gly Ala Phe Val Arg 
385 390 395 ^00 

Lys Val Val He Arg 
405 



35 



40 (2) IllFORMATION FOR SEQ ID MO: 52 9 

(i) SLQUEIICE CHARACTERISTICS: 

(A) LEHGTH: 2037 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRAIIDEDWESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: Dt I7\ (genomic) 
50 (iii) HYPOTHETICAL: MO 

(iv) AMTI- SENSE: I JO 



55 



60 



(vi) ORIGIIJAL SOURCE: 

(A) ORGAliISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) MAI-IE/ KEY: mi sc_f eature 

(B) LOCATION 1 . . . 2 037 

(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 52 9 



GTAATGGACT ACAAACTCAC TTCTCGATTC AAGCCCACGG GCGACCAGCC GGAAGCCATT 60 

CGCCAACTCG TACAGGGCAT CAACGAJ\GGG ATGCCGGCTC AGACGCTGCT CGGCGTAACG 120 

bb GGTTCGGGCA AAACTTTTAC GGTGGCTAAC GTGGTGGCGG CGGTCAATCG TCCGACCCTT 180 

GTCCTGAGTC ACAACAAGAC CTTGGCAGCA CAGCTATACG GAGAGTTCAA AGCCTTCTTC 2 4 0 

CCCGAGAATG CGGTGGAGTA TTTCGTCAGC TACTACGACT ACTATCAGCC CGAGGCCTAC 30^' 

CTCCCCGTCA CAGACACCTA TATCGAAAAG GACATGGCCA TCAA.CGCGGA GATCGAAAAA 360 

CTGCGATTGA GGGCCACGGC TTCGCTCCTG TCAGGGCGGA AAGATGTGCT TGTGGTCAGC 4-^0 

7U TCCGTATCCT GTCTCTACGG TATGGCCAAT CCTGAAGCTT TTTCCGAAAA GGTGATCAGC 4 80 

CTGCACACGG GACAAAGGGC AGACAGGGAT CATTTTATCC GCCTGCTGGT AGAGAGCTAC 54 0 

TACACGAACA ATAAAGTAGA GTTCGAGAGC GGCAACTTCC GTGTCAAAGG CGACAGCGTG 60O 

GACATATTCC CCGCCGTAGA AGGTTATGAC GGCGTGGCAT ACAGGGTGGA GTTTTGGGAT 660 

GGAGAGGTCG AGCGGCTGAG TACCTTCGAT CCGCGAACGG GACGGGAATA CGGCCTGCTG l-^n 

TCGGAGCTGA AGATATA'i'CC GGCCAATCTC TTCGTGACGA CTAAGGAGCA GGTGGATCGG IQC^ 



75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 

BNSDOCID: <WO 9929870A1J_> 



wo 99/29870 



PCT/AU98/01023 



487/490 



GCAGTAGGGA AAATCGATGT GGATCTGGGC 
AAACCATATG AAGCCAAACG CTTGTATGAG 
GAGTTGGGTT ATTGTTCCGG TATAGAGAAC 
GGCGAACGTC CTTTCTGTCT GTTGGATTAT 
5 GAAAGCCATG 'I'AACGATACC GCAGATACGT 

GAGAATCTGG TCGAATACGG ATTCCGCCTG 
TTCGACGAGT TCGAAGCTCT CACCCCCCGG 
TATGAGCTGA ACAGAAGCGA AGGCGTGATC 
CTGGATCCCA TCATCGACGT CAAGCCGACG 

10 ATAGCACGCT GCATCGAAAA GAAAGAGCGC 
GCAGAGGAGC TTAGCGAATA CCTGCTACGC 
GATGTGGACA CGCTGGAGCG TGTGCGTATC 
GCACTCATCG GGGTG7KATCT GCTCCGCGAA 
GCTATTCTGG ATGCGGATT^ GGAAGGATTC 

15 GCAGGACGTG CCGCCCGGCA CATTCATGGG 

GACAGTATGC AGCTCACCAT GGACGAGACT 
AACGAAGCGC ACGGCATCAC CCCCCAACAG 
GAAGGCGATG TGTCGGCCTT GCAA.TCCGAT 
AGCATGGTGG CTGCCGATCC TTTGGCCGAC 

20 ATTGCTTCGA CCAAGAAGCA AATGCTGGCA 
GCACGACTTC GGGACGAAGC CGCACGATTG 



GCACAGGTCG ATTTTCTGAA AGAAATCGGC 840 
CGGGTCACGT ATGACTTGGA AATGATCCGT 900 
TATTCGCGCT ACTTCGACGG CCGTGACGCG 960 

TTCCCGGAGG ATTTCCTGTT GGTCATAGAC 1020 

GCCATGTACG GAGGCGATCG TTCGCGCAAG 1080 

CCTGCCGCTC TCGACAATCG GCCGCTTCGC 114 0 

ACCCTTTATA TCAGTGCCAC GCCTGCCGAC 12 00 

GTCGAGCAGC TGATCCGTCC GACCGGACTG 12 60 

GCAAACCAAG TGGACGATCT GATGGAGGAG 1320 

GTACTGGTAA CGACCCTGAC CAAACGTATG 13 80 

CACGGTATCA GCACCGGCTA CATACACAGC 1-^40 

ATGG7\A.GACC TGCGCAAGGG GGTCTACGAT 1500 

GGATTGGACT TGCCGGAAGT TTCGCTTGTG 1560 

CTGCGCTCGC ATCGTTCGCT CACGCAGACT 1620 

CGTGTCATCT TCTACGCGGA CAAGATCACC 1680 

GCACGCCGAC GCGCAAAGCA ACTGGCCTAC 17 4 0 

ATAGTGAAGA ACAGTGCTGC CATTTGGGGA 18 00 

ACAGAATCCG GTGCGTACAT AGAAGAGAGC 18 60 

TATCTGAGCA AACCCAAGCT GGAAGCACTC 1920 

GCAGCCAAAG AGCTGGACTT TCTGGAAGCG 19 80 

GAAAAGAAAC TGGAGCAACT CACAGCC 2037 



25 



(2) lUFORtiATIOII FOR SEQ ID 110:53 0 



(i) SEQUEtlCE CHARACTERISTICS: 

(A) LENGTH: 2034 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIIDEDUESS : double 
30 (D) TOPOLOGY: circular 



35 



40 



(ii) IIOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: (lO 

(iv) AllTI-SEMSE: MO 

(vi) ORIGIMAL SOURCE: 

(A) ORGAfJISM: Porphyromonas gingival is 



(ix) FEATURE: 

(A) MAilE/KEY: mi s c_f ea tu re 

(B) LOCATIOIJ 1 . . .2034 

45 SEQUEMCE DESCRIPTIOW: SEQ ID UO:530 

ATGGACTACA AACTCACTTC TCGATTCAAG CCCACGGGCG ACCAGCCGGA AGCCATTCGC 60 

CAACTCGTAC AGGGCATCAA CGAAGGGATG CCGGCTCAGA CGCTGCTCGG CGTAACGGGT 120 

TCGGGC7W\A. CTTTTACGGT GGCTAACGTG GTGGCGGCGG TCAATCGTCC GACCCTTGTC 180 

50 CTGAGTCACA ACAAGACCTT GGCAGCACAG CTATACGGAG AGTTCAAAGC CTTCTTCCCC 24 0 

GAGAATGCGG TGGAGTATTT CGTCAGCTAC TACGACTACT ATCAGCCCGA GGCCTACCTC 300 

CCCGTCACAG ACACCTATAT CGAAAAGGAC ATGGCCATCA ACGCGGAGAT CGAAAAACTG 360 

CGATTGAGGG CCACGGCTTC GCTCCTGTCA GGGCGG.AAAG ATGTGCTTGT GGTCAGCTCC 4 20 

GTATCCTGTC TCTACGGTAT GGCCAATCCT GAAGCTTTTT CCGAAAAGGT GATCAGCCTG 4 80 

55 CACACGGGAC AAAGGGCAGA CAGGGATCAT TTTATCCGCC TGCTGGTAGA GAGCTACTAC 54 0 

ACGAACAATA AAGTAGAGTT CGAGAGCGGC AACTTCCGTG TCAAAGGCGA CAGCGTGGAC 6 00 

ATATTCCCCG CCGTAGAAGG TTATGACGGC GTGGCATACA GGGTGGAGTT TTGGGATGGA 6 60 

GAGGTCGAGC GGCTGAGTAC CTTCGATCCG CGAACGGGAC GGGAATACGG CCTGCTGTCG 7 20 

GAGCTGAAGA TATATCCGGC CA/^.TCTCTTC GTGACGACTA AGGAGCAGGT GGATCGGGCA 7 80 

60 GTAGGGAf^j^"^ TCGATGTGGA TCTGGGCGCA CAGGTCGATT TTCTGAAAGA AATCGGCAAA 840 

CCATATG.AAG CCAAACGCTT GTATGAGCGG GTCACGTATG ACTTGGAAAT GATCCGTGAG 900 

TTGGGTTATT GTTCCGGTAT AGAGAACTAT TCGCGCTACT TCGACGGCCG TGACGCGGGC 960 

GAACGTCCTT TCTGTCTGTT GGATTATTTC CCGGAGGATT TCCTGTTGGT CATAGACGAA. 1020 

ASCChTGT^JK CGATACCGCA GATACGTGCC ATGTACGGAG GCGATCGTTC GCGCAAGGAG 1080 

65 AATCTGGTCG A.ATACGGATT CCGCCTGCCT GCCGCTCTCG ACAATCGGCC GCTTCGCTTC 114 0 

GACGAGTTCG AAGCTCTCAC CCCCCGGACC CTTTATATCA GTGCCACGCC TGCCGACTAT 12 00 

GAGCTGAACA GAAGCGAAGG 03TGATCGTC GAGCAGCTGA TCCGTCCGAC CGGACTGCTG 12 60 

GATCCCATCA TCGACGTCAA GCCGACGGCA AACCAAGTGG ACGATCTGAT GGAGGAGATA 1320 

GCACGCTGCA TCGAA.AAGAA AGAGCGCGTA CTGGTAACGA CCCTGACCAA ACGTATGGCA 1380 

70 GAGGAGCTTA GCGAATACCT GCTACGCCAC GGTATCAGCA CCGGCTACAT ACACAGCGAT 1440 

GTGGACACGC TGGAGCGTGT GCGTATCATG GAAGACCTGC GCA-^GGGGGT CTACGATGCA 1500 

CTCATCGGGG TGAATCTGCT CCGCGAAGGA TTGGACTTGC CGGAAGTTTC GCTTGTGGCT 1560 

ATTCTGGATG CGGATAAGGA AGGATTCCTG CGCTCGCATC GTTCGCTCAC GCAGACTGCA 1620 

GGACGTGCCG CCCGGCACAT TCATGGGCGT GTCATCTTCT ACGCGGACAA GATCACCGAC 16 90 

75 AGTATGCAGC TCACCATGGA CGAGACTGCA CGCCGACGCG CAAAGCAACT GGCCTACA.i\C 17 4 0 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCID: <WO 9929870A1_I_> 




wo 99/29870 



PCT/AU98/01023 



488/ 490 



GAAGCGCACG GCATCACCCC 
GG CGATGTGT CGGCCTTGCA 
ATGGTGGCTG CCGATCCTTT 
GCTTCGACCA AGAAGCAAAT 
CGACTTCGGG ACGAAGCCGC 



CCAACAGATA GTGAAGAACA GTGCTGCCAT TTGGGGAGAA 
ATCCGATACA GAATCCGGTG CGTACATAGA AGAGAGCAGC 
GGCCGACTAT CTGAGCAAAC CCAAGCTGGA AGCACTCATT 
GCTGGCAGCA GCCAAAGAGC TGGACTTTCT GGAAGCGGCA 
ACGATTGGAA AAGAAACTGG AGCAACTCAC AGCC 



1800 
1860 
1920 
1980 
203^ 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(2) IlIFORilATIOtl FOR SEQ ID !IO:531 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A) ORGAMISIl: Porphyromona s gingival is 

(ix) FEATURE: 

(A) NAIIE/KEY: mi£c_f eature 

(E) LOCATIOM 1 ... 679 



SEQUEUCE DESCRI PTIOll : SEQ ID 110:531 



V a 1 


Met 


A£-p 


Tyr 


Lys 


Leu 


Thr 


Ser 


Arg 


Phe 


Lys 


Pro 


Thr 


Gl y 


Asp 


Gin 


1 








5 










1 0 










1 5 




Pro 


G 1 u 


Al a 


He 


Arg 


Gl n 


Leu 


Va 1 


Gin 


Gly 


He 


Asn 


Glu 


Gly 


1 let 


Pro 








2 0 










25 










30 






Al a 


Gl n 


Tnr 


Leu 


Leu 


Gly 


Val 


Thr 


Gly 


Ser 


Gly 


Lys 


Thr 


Phe 


Thr 


Va 1 






35 










4 0 










4 5 








n 1 =. 


Asn 


Val 


Val 


MX d 


/\ J. d 


Val 


As n 


Arg 


Pro 


Thr 


Leu 


Val 


Leu 


Ser 


His 




50 




















6 0 










Asn 


Lys 


Thr 


Leu 


Ala 


Ala 


Gin 


Leu 


Tyr 


Gly 


Glu 


Phe 


L^'s 


Ala 


Phe 


Pile 


65 










70 










7 5 










80 


Pro 


Glu 


Asn 


Ala 


Val 


Glu 


Tyr 


Phe 


Val 


Ser 


Tyr 


Tyr 


Asp 


Tyr 


Tyr 


Gin 










85 










90 










95 




Pro 


Glu 


Ala 


Tyr 


Leu 


Pro 


Val 


Thr 


Asp 


Thr 


Tyr 


He 


Glu 


Lys 


Asp 


Met 








100 










105 










110 






Ala 


He 


Asn 


Al a 


Glu 


He 


Glu 


Lys 


Leu 


Arg 


Leu 


Arg 


Ala 


Thr 


Ala 


Ser 






115 










120 










125 








Leu 


Leu 


Ser 


Gly 


Arg 


Lys 


Asp 


Val 


Leu 


Val 


Val 


Ser 


Ser 


Val 


Ser 


Cys 




130 










135 










140 










Leu 


Tyr 


Gly 


Met 


Ala 


Asn 


Pro 


Glu 


Ala 


Pi.e 


Ser 


Glu 


Lys 


Val 


He 


Ser 


145 










150 










155 










160 


Leu 


His 


Tlir 


Gly 


Gin 


Arg 


Ala 


Asp 


Arg 


Asp 


His 


Phe 


He 


Arg 


Leu 


Leu 










165 










170 










175 




Vai 


Glu 


Ser 


Tyr 


Tyr 


Thr 


Asn 


Asn 


Lys 


Val 


Glu 


Phe 


Glu 


Ser 


Gly 


Asn 








180 










105 










190 






Phe 


Arg 


Val 


Lys 


Gly 


Asp 


Ser 


Val 


Asp 


He 


Phe 


Pro 


Ala 


Val 


Glu 


Gly 






195 










200 










205 








Tyr 


Asp 


Gly 


Val 


Ala 


Tyr 


Arg 


Val 


Glu 


Phe 


Trp 


Asp 


Gly 


Glu 


Val 


Glu 




210 










215 










220 










Arg 


Leu 


Ser 


Thr 


Phe 


Asp 


Pro 


Arg 


Thr 


Gly 


Arg 


Glu 


Tyr 


Gly 


Leu 


Leu 


225 










230 










235 








240 


Ser 


Glu 


Leu 


Lys 


He 


Tyr 


Pro 


Ala 


Asn 


Leu 


Phe 


Val 


Thr 


Thr 


Lys 


Glu 










245 










2 50 










255 




Gin 


Val 


Asp 


Arg 


Ala 


Val 


Gly 


Lys 


He 


Asp 


Val 


Asp 


Leu 


Gly 


Ala 


Gin 








260 










265 










270 






Val 


Asp 


Phe 


Leu 


Lys 


Glu 


He 


Gly 


Lys 


Pro 


Tyr 


Glu 


Ala 


Lys 


Arg 


Leu 






275 










280 










285 








Tyr 


Glu 


Arg 


Val 


Thr 


Tyr 


Asp 


Leu 


Glu 


Met 


He 


Arg 


Glu 


Leu 


Gly 


Tyr 




290 










295 










300 










Cys 


Ser 


Gly 


He 


Glu 


Asn 


Tyr 


Ser 


Arg 


Tyr 


Phe 


Asp 


Gly 


Arg 


Asp 


Ala 


305 










310 










315 










320 


Gly 


Glu 


Arg 


Pro 


Phe 


Cys 


Leu 


Leu 


Asp 


Tyr 


Phe 


Pro 


Gl u 


Asp 


Phe 


Leu 










325 










330 










335 




Leu 


Val 


He 


Asrp 


Glu 


Ser 


His 


Val 


Thr 


He 


Pro 


Gin 


He 


Arg 


Ala 


Met 








340 










345 










350 






Tyr 


Gly Gly Asp 


Arg 


Ser 


Arg 


Lys 


Glu 


Asn 


Leu 


Val 


Glu 


Tyr 


Gly 


Phe 






355 










360 










365 






Arg 


Leu 


Pro 


Ala 


Ala 


Leu 


Asp 


Asn 


Arg 


Pro 


Leu 


Arg 


Phe 


Asp 


Glu 


Phe 



370 375 380 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



BNSDOCIO; <WO 9929e70A1_L> 



wo 99/29870 



PCT/AU98/01023 



489/ 490 



50 



60 



65 



70 



75 



Glu 


Ala 


Leu 


Thr 


Pro 


Arg 


Thr 


Leu 


Tyr 


He 


Ser 


Ala 


Thr 


Pro 


Ai a 


Asp 


385 










390 










395 










4 00 


Tyr 


Glu 


Leu 


Asn 


Arci 


Ser 


Glu 


Gly 


Val 


He 


Val 


Glu 


Gin 


Leu 


He 


Arg 










4 05 
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40 (2) INFORMATION FOR SEQ ID 110:532 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LEIIGTH: 678 amino acids 

(B) TVPB:: amino acid 
45 (D) TOPOLOGV: linear 

(ii) MOLECULE TYPE: protein 

till) HYPOTHETICAL : VES 



(vL) ORIGINAL SOURCE: 

(A) ORGANISM : Porphyromonas gingival is 



(ix) FEATURE: 
55 (A) NAME/KEY: mi sc_ fea ture 

(B) LOCATION 1. . .678 



ixi) SEQUE13CE DESCRIPTION: SEQ ID NO: 532 
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